This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Zillow Group’s Housing Trends Data Engineering team brings innovative analysis, creativity, and excellence to the U.S. housing market. We create, productionalize, analyze, and share a full collection of housing metrics and analysis for Zillow’s internal and external customers. Our work contextualizes Zillow’s unparalleled living database of all homes and hundreds of millions of customers to provide insight to everyday consumers and open data to hundreds of researchers and policymakers across the country. You’ll work in a fun, collaborative atmosphere with a high-performance team with diverse data engineering and analytical skills. Your work will be highly impactful. The data we create ends up in the nation’s most reputable publications, used by government institutions in policy-making and official reports, in public-facing tools used by millions of Zillow consumers, and in the work of hundreds of academic researchers. Small team = big impact. Engineering teams are highly decentralized in order to create the small team speed and autonomy of a start-up environment but are backed by big company resources.
Job Responsibility
Build data systems to link raw data and concrete Housing Trends insights
Build and extend our internal forecasting model development framework
Develop and maintain scalable data products and deliver pipelines built for speed, accuracy, and consistency that will scale as our stakeholders’ need
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability and performance, etc
Collaborate with data scientists to understand their data requirements and build systems/tools that enable efficient model training and experimentation
Implement feature stores and ETL pipelines to streamline feature engineering and model development
Build and maintain infrastructure for deploying machine learning models into production environments
Be a trusted partner for our stakeholders, such as Forecasting, Zestimates, and the ML teams within the AI organization to assist them for data investigation/publication, ML model development and infrastructure support
Requirements
Strong proficiency in Python programming
Hands-on experience with big data ecosystems, architectures and modern data platforms, including AWS cloud data services and Lakehouse/Lakebase platforms
Hands-on experience with Databricks is a significant advantage
Proven ability to design, build, and orchestrate batch and real-time data pipelines, processing large-scale datasets from diverse sources
Practical experience with cutting-edge data engineering technologies such as SparkSQL and Spark Streaming, PostgreSQL, EKS, and Kubernetes
Familiarity with Agile/DevOps software development processes and tools such as GitLab and CI/CD
A good understanding of data science concepts, fundamental machine learning algorithms/libraries, and development lifecycle is a must-to-have
Familiarity with LLM and Agentic AI technologies and engineering patterns is a big advantage
Strong ability in performing root cause analysis to address operational issues and identify opportunities for improvement
Exceptional communication skills to drive effective customer engagement, collaborate with cross-functional teams, and clearly articulate technical concepts to technical and non-technical stakeholders
Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment
5+ years of experience in Big Data Engineering or Machine Learning Engineering roles
Nice to have
Hands-on experience with Databricks is a significant advantage
Familiarity with LLM and Agentic AI technologies and engineering patterns is a big advantage
What we offer
Eligible for equity awards based on factors such as experience, performance and location