CrawlJobs Logo

Data Engineer (Analytics)

United States, New York 182584.00 - 199650.00 USD / Year · Job Posted March 19, 2026
Apply Position
Job Link Share

Job Description

Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology.

Job Responsibility

  • Design, model, and implement data ware housing activities to deliver the data foundation that drives impact through informed decision making
  • Design, build and launch collections of sophisticated data models and visualizations that support multiple use cases across different products or domains
  • Collaborate with engineers, product managers and data scientists to understand data needs, representing key data insights visually in a meaningful way
  • Define and manage SLA for all data sets in allocated areas of ownership
  • Create and contribute to frameworks that improve the efficacy of logging data, while working with data infrastructure to triage issues and resolve
  • Determine and implement the security model based on privacy requirements, confirm safeguards are followed, address data quality issues, and evolve governance processes within allocated areas of ownership
  • Solve challenging data integration problems utilizing optimal ETL patterns, frameworks, query techniques, and sourcing from structured and unstructured data sources
  • Optimize pipelines, dashboards, frameworks, and systems to facilitate easier development of data artifacts
  • Influence product and cross-functional teams to identify data opportunities to drive impact
  • Work on problems of diverse scope where analysis of data requires evaluation of identifiable factors
  • Demonstrate good judgment in selecting methods and techniques for obtaining solutions

Requirements

  • Requires a Master’s degree (or foreign equivalent) in Computer Science, Engineering, Information Systems, Mathematics, Statistics, Data Analytics, Applied Sciences, or a related field
  • Requires completion of one graduate-level course, one research project, or one internship involving each of the following: Features, design, and use-case scenarios across a big data ecosystem
  • Custom ETL design, implementation, and maintenance
  • Object-oriented programming languages
  • Schema design and dimensional data modeling
  • Writing SQL statements
  • Analyzing data to identify deliverables, gaps, and inconsistencies
  • Managing and communicating data warehouse plans to internal clients
  • MapReduce or MPP system
  • Python

What we offer

  • bonus
  • equity
  • benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Engineer (Analytics)

8 matching positions

Data Engineer (Analytics)

Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologie...
Location
Location
United States , Menlo Park
Salary
Salary:
250130.00 - 284900.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree (or foreign degree equivalent) in Computer Science, Engineering, Information Systems, Mathematics, Statistics, Data Analytics or related field
  • 5 years of progressive, post-baccalaureate work experience in the job offered or in a related occupation
  • 5 years experience in: Features, design, and use-case scenarios across a big data ecosystem
  • Custom ETL design, implementation, and maintenance
  • Object-oriented programming languages
  • Schema design and dimensional data modeling
  • Writing SQL statements
  • Analyzing data to identify deliverables, gaps and inconsistencies
  • Managing and communicating data warehouse plans to internal clients
  • MapReduce or MPP system
Job Responsibility
Job Responsibility
  • Design, model, and implement data warehousing activities to deliver the data foundation that drives impact through informed decision making
  • Design, build and launch collections of sophisticated data models and visualizations that support multiple use cases across different products or domains
  • Collaborate with engineers, product managers and data scientists to understand data needs, representing key data insights visually in a meaningful way
  • Define and manage SLA for all data sets in allocated areas of ownership
  • Create and contribute to frameworks that improve the efficacy of logging data, while working with data infrastructure to triage issues and resolve
  • Determine and implement the security model based on privacy requirements, confirm safeguards are followed, address data quality issues, and evolve governance processes within allocated areas of ownership
  • Solve challenging data integration problems utilizing optimal ETL patterns, frameworks, query techniques, and sourcing from structured and unstructured data sources
  • Optimize pipelines, dashboards, frameworks, and systems to facilitate easier development of data artifacts
  • Influence product and cross-functional teams to identify data opportunities to drive impact
  • Work on problems of diverse scope where analysis of data requires evaluation of identifiable factors
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Data Engineer, Enterprise Data, Analytics and Innovation

Are you passionate about building robust data infrastructure and enabling innova...
Location
Location
United States
Salary
Salary:
110000.00 - 125000.00 USD / Year
vaniamgroup.com Logo
Vaniam Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of professional experience in data engineering, ETL, or related roles
  • Strong proficiency in Python and SQL for data engineering
  • Hands-on experience building and maintaining pipelines in a lakehouse or modern data platform
  • Practical understanding of Medallion architectures and layered data design
  • Familiarity with modern data stack tools, including: Spark or PySpark
  • Workflow orchestration (Airflow, dbt, or similar)
  • Testing and observability frameworks
  • Containers (Docker) and Git-based version control
  • Excellent communication skills, problem-solving mindset, and a collaborative approach
Job Responsibility
Job Responsibility
  • Design, build, and operate reliable ETL and ELT pipelines in Python and SQL
  • Manage ingestion into Bronze, standardization and quality in Silver, and curated serving in Gold layers of our Medallion architecture
  • Maintain ingestion from transactional MySQL systems into Vaniam Core to keep production data flows seamless
  • Implement observability, data quality checks, and lineage tracking to ensure trust in all downstream datasets
  • Develop schemas, tables, and views optimized for analytics, APIs, and product use cases
  • Apply and enforce best practices for security, privacy, compliance, and access control, ensuring data integrity across sensitive healthcare domains
  • Maintain clear and consistent documentation for datasets, pipelines, and operating procedures
  • Lead the integration of third-party datasets, client-provided sources, and new product-generated data into Vaniam Core
  • Partner with product and innovation teams to build repeatable processes for onboarding new data streams
  • Ensure harmonization, normalization, and governance across varied data types (scientific, engagement, operational)
What we offer
What we offer
  • 100% remote environment with opportunities for local meet-ups
  • Positive, diverse, and supportive culture
  • Passionate about serving clients focused on Cancer and Blood diseases
  • Investment in you with opportunities for professional growth and personal development through Vaniam Group University
  • Health benefits – medical, dental, vision
  • Generous parental leave benefit
  • Focused on your financial future with a 401(k) Plan and company match
  • Work-Life Balance and Flexibility
  • Flexible Time Off policy for rest and relaxation
  • Volunteer Time Off for community involvement
  • Fulltime
Read More
Arrow Right

Senior AWS Data Engineer / Data Platform Engineer

We are seeking a highly experienced Senior AWS Data Engineer to design, build, a...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
northbaysolutions.com Logo
NorthBay
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in data engineering and data platform development
  • Strong hands-on experience with: AWS Glue
  • Amazon EMR (Spark)
  • AWS Lambda
  • Apache Airflow (MWAA)
  • Amazon EC2
  • Amazon CloudWatch
  • Amazon Redshift
  • Amazon DynamoDB
  • AWS DataZone
Job Responsibility
Job Responsibility
  • Design, develop, and optimize scalable data pipelines using AWS native services
  • Lead the implementation of batch and near-real-time data processing solutions
  • Architect and manage data ingestion, transformation, and storage layers
  • Build and maintain ETL/ELT workflows using AWS Glue and Apache Spark on EMR
  • Orchestrate complex data workflows using Apache Airflow (MWAA)
  • Develop and manage serverless data processing using AWS Lambda
  • Design and optimize data warehouses using Amazon Redshift
  • Implement and manage NoSQL data models using Amazon DynamoDB
  • Utilize AWS DataZone for data governance, cataloging, and access management
  • Monitor, log, and troubleshoot data pipelines using Amazon CloudWatch
  • Fulltime
Read More
Arrow Right

Enterprise Data Analytics Engineer

We are looking for a Data Analytics Engineer to help strengthen and scale analyt...
Location
Location
United States , Boston
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 1 year of experience in a data engineering, analytics engineering, or data analyst role with hands-on work in SQL and ETL processes
  • Practical experience with Databricks, Azure Databricks, Microsoft Fabric, or comparable modern cloud data platforms
  • Familiarity with building, maintaining, and troubleshooting data pipelines in a warehouse or lakehouse environment
  • Working knowledge of data modeling principles and an understanding of how curated datasets support reporting and business analysis
  • Ability to write clear, efficient SQL for data extraction, transformation, validation, and performance analysis
  • Experience collaborating with both technical and non-technical partners to gather requirements and deliver usable data assets
  • Exposure to tools or practices such as version control, testing, documentation, and data quality monitoring
  • Healthcare or participant-focused data experience is preferred but not required
Job Responsibility
Job Responsibility
  • Build and support data pipelines, curated datasets, and analytics-ready data structures used for reporting and performance tracking
  • Clean, join, and reshape data from multiple sources to produce dependable outputs for dashboards, KPI measurement, and deeper analysis
  • Partner with analysts, engineers, and business stakeholders to translate reporting needs into practical data models and scalable solutions
  • Investigate pipeline failures and data inconsistencies, then implement fixes that improve reliability, accuracy, and timeliness
  • Maintain clear documentation for data definitions, model logic, source mapping, and lineage to improve transparency across the analytics function
  • Create recurring and on-demand analyses that help teams monitor trends, evaluate outcomes, and support current business priorities
  • Contribute to the upkeep and optimization of analytics workflows within tools such as Databricks, Microsoft Fabric, and related cloud platforms
  • Provide data support to operational teams and assist with information requests tied to cross-functional initiatives or external-facing needs
What we offer
What we offer
  • Medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
Read More
Arrow Right

Senior Data Analytics Engineer

We are looking for a Senior Data Analytics Engineer to help shape and scale mode...
Location
Location
United States , Reston
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years of experience in data engineering, analytics engineering, or a comparable data-centric role
  • Demonstrated expertise in dimensional modeling, including the design of fact tables, dimension tables, and slowly changing dimensions
  • Strong hands-on capability with Python, PySpark, SparkSQL, and notebook-based data development
  • Experience delivering end-to-end data solutions built on medallion architecture principles
  • Working knowledge of modern big data technologies such as Apache Spark, Hadoop, Kafka, and ETL frameworks
  • Familiarity with machine learning and AI concepts within contemporary analytics environments
  • Experience using Git within team-based software or data development workflows
  • Strong communication skills with the ability to align technical delivery with business objectives
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable data pipelines that ingest, transform, and deliver trusted datasets for analytics and operational use
  • Develop curated data layers using medallion-style architecture to improve data quality, accessibility, and consistency across the platform
  • Create and optimize dimensional models, including fact and dimension structures, to support reporting, trend analysis, and business intelligence needs
  • Use Python, PySpark, SparkSQL, and notebook-based development environments to engineer efficient data processing workflows
  • Partner with business and technical stakeholders to gather requirements and translate them into practical data products and analytics solutions
  • Apply data governance, security, and enterprise data management standards to protect information and support compliant data usage
  • Contribute to collaborative development practices through version control, code review, and shared engineering standards using Git
  • Support advanced analytics initiatives by preparing data foundations that can be used for machine learning and AI-driven use cases
What we offer
What we offer
  • medical, vision, dental, and life and disability insurance
  • 401(k) plan
Read More
Arrow Right

Senior Data Analytics Engineer

We are looking for a Senior Data Analytics Engineer to help shape and scale mode...
Location
Location
United States , Reston
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years of experience in data engineering, analytics engineering, or a comparable data-centric role
  • Demonstrated expertise in dimensional modeling, including the design of fact tables, dimension tables, and slowly changing dimensions
  • Strong hands-on capability with Python, PySpark, SparkSQL, and notebook-based data development
  • Experience delivering end-to-end data solutions built on medallion architecture principles
  • Working knowledge of modern big data technologies such as Apache Spark, Hadoop, Kafka, and ETL frameworks
  • Familiarity with machine learning and AI concepts within contemporary analytics environments
  • Experience using Git within team-based software or data development workflows
  • Strong communication skills with the ability to align technical delivery with business objectives
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable data pipelines that ingest, transform, and deliver trusted datasets for analytics and operational use
  • Develop curated data layers using medallion-style architecture to improve data quality, accessibility, and consistency across the platform
  • Create and optimize dimensional models, including fact and dimension structures, to support reporting, trend analysis, and business intelligence needs
  • Use Python, PySpark, SparkSQL, and notebook-based development environments to engineer efficient data processing workflows
  • Partner with business and technical stakeholders to gather requirements and translate them into practical data products and analytics solutions
  • Apply data governance, security, and enterprise data management standards to protect information and support compliant data usage
  • Contribute to collaborative development practices through version control, code review, and shared engineering standards using Git
  • Support advanced analytics initiatives by preparing data foundations that can be used for machine learning and AI-driven use cases
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
  • Fulltime
Read More
Arrow Right

Cost Engineer – Data Analytics

The Program and Purchase Cost Optimization (PPCO) team at General Motors is seek...
Location
Location
United States , Warren
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ability to translate ambiguous business questions into well-defined analytical and data engineering problems, and communicate findings and recommendations in a clear, structured manner to technical and non-technical stakeholders
  • Bachelor’s degree in computer science, engineering, statistics, mathematics, physics, or a related quantitative field (advanced degree preferred)
  • 5+ years in data analytics (could be data science, research, or machine learning)
  • Python proficiency (e.g., pandas, NumPy, scikit-learn, PySpark)
  • SQL proficiency
  • Experience with Databricks, Spark, and/or other cloud-based data platforms for large-scale data processing
  • Experience designing and implementing ETL/ELT pipelines that integrate data from multiple transactional and analytical systems
  • Strong skills in EDA to understand data quality, structure, and relationship
  • Practical experience developing predictive models and machine learning (regression methods, Clustering and segmentation, random forests, gradient boosting)
  • Solid problem-solving skills and the ability to convert business questions into data and modeling problems with clear hypotheses and success criteria
Job Responsibility
Job Responsibility
  • Design and maintain data models and curated datasets (relational schemas, dimensional models, and feature characterizations) that support PPCO analytics and efficient downstream consumption for reporting in tools such as Power BI (while not being primarily responsible for dashboard design)
  • Query, integrate, and engineer data from multiple enterprise systems and relational databases, and design, build, and operate robust ETL/ELT pipelines (e.g., Databricks/Spark) to produce unified, analysis-ready datasets with appropriate data quality checks, validation, and monitoring
  • Perform in-depth EDA on large, heterogeneous datasets, engineer derived features (characterizations, aggregations, encodings), and understand data distributions, identify anomalies, and uncover data quality risks and insight to address by the business team
  • Develop and validate predictive machine learning models and descriptive models (regression, clustering, decision trees, random forests, gradient boosting, time-series/panel models), utilizing existing data to predict product attributes (geometries, cost, economic indicators) from known input parameters and historical patterns
  • Enable system integration across the GM ecosystem by designing the data pipelines, workflow automation and data transformation to take data from one system and turning it into a consumable format for the destination system
  • Automate and optimize existing cost engineering workflows by building scalable, Python-based data and analytics tools
  • Work across functional boundaries, including Engineering, Program Management, R&D, Finance, and Purchasing, to understand data sources, business logic, and use case
  • Fulltime
Read More
Arrow Right

Lead Analytics Engineer - Data Modeling & Quality

Arcadia is the only healthcare data and software company dedicated to healthcare...
Location
Location
United States
Salary
Salary:
Not provided
themuse.com Logo
The Muse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced SQL: window functions, complex CTEs, aggregation patterns, performance tuning on columnar databases
  • DBT: hands-on experience authoring models, tests, macros, and yml documentation
  • familiarity with incremental strategies
  • Healthcare data literacy: working knowledge of claims data (professional, institutional, pharmacy), clinical data (EHR entities), and common quality dimensions (member months, coverage rates, null patterns)
  • Data quality mindset: ability to differentiate source data issues from transform issues, design systematic validation checks, and communicate data quality findings clearly
  • Clear communicator — able to translate technical findings for clients and non-technical stakeholders
  • Strong analytical judgment — you can look at a distribution and know when something is wrong
  • Ability to manage several projects simultaneously, leveraging AI tooling to stay organized and efficient
  • Genuine desire to learn and apply AI tools for operational efficiency
Job Responsibility
Job Responsibility
  • DATA MODELING & DBT DEVELOPMENT: Author, review, and maintain DBT models using Spark/Hudi from ingest through bronze and silver
  • Help clients understand their data model, assumptions, and limitations through intentional validation
  • Troubleshoot and fix issues, then write DBT tests to catch issues proactively
  • Optimize SQL performance for slow-running jobs
  • Partner with Data Engineering on Hudi table design, partition strategy, and incremental patterns
  • DATA QUALITY OWNERSHIP: Triage and classify data quality alerts, distinguishing source-level issues from transform-layer failures
  • Design and maintain volume monitors and DQ monitors (null rate, distribution, future-date checks)
  • Author and apply clinical DQ rules (entity volume, field coverage, LOINC coverage, referential integrity) and claims validation rules across silver and gold layers
  • Conduct quality reviews for connector promotions — evaluating silver entity coverage, validation rule pass rates, and bronze-to-silver transformation correctness
  • Own the ticket queue for DQ, attribution, hierarchy, and customer-specific data quality issues, writing clear customer-facing findings
What we offer
What we offer
  • Pet Insurance
  • Health Insurance
  • Dental Insurance
  • Vision Insurance
  • FSA
  • HSA
  • HSA With Employer Contribution
  • Life Insurance
  • Short-Term Disability
  • Long-Term Disability
Read More
Arrow Right