This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for a Security Data Engineer to join our team in Costa Mesa, California in a contract capacity with the potential for a permanent role. This position is focused on advancing a well-established data lake environment by strengthening the ingestion layer, expanding connectivity to additional systems, and ensuring reliable movement of security-relevant data. The person in this role will take ownership of pipeline operations, partner with internal system stakeholders to enable access, and help keep incoming data accurate, timely, and usable for downstream needs.
Job Responsibility
Take ownership of the data ingestion layer for an existing data lake, ensuring steady performance and dependable delivery of incoming datasets
Reduce a defined backlog of pipeline work during the first 90 days by building and activating prioritized ingestion workflows
Assume responsibility for pipeline components transitioned from the lead engineer and continue their operational support without disrupting established foundations
Support and improve current ETL processes by monitoring health, resolving failures, and extending coverage where needed
Integrate new source systems, including platforms such as Rippling and Workday, into the broader data ecosystem
Coordinate with internal application and system owners to identify data sources, secure appropriate access, and obtain logs or source records required for ingestion
Apply data cleaning and analysis practices to improve data quality and maintain consistency across inbound datasets
Contribute to infrastructure and deployment workflows using AWS, AWS CDK, Infrastructure as Code, and CI/CD practices to support scalable pipeline operations
Requirements
Experience building and supporting data pipelines in a data lake or large-scale data engineering environment
Strong programming skills in Python and hands-on experience with Apache Spark, Apache Hadoop, Apache Kafka, and ETL development
Practical knowledge of Amazon Web Services (AWS) and infrastructure automation tools such as AWS CDK and Infrastructure as Code frameworks
Experience maintaining pipeline reliability, troubleshooting ingestion issues, and improving operational stability in production environments
Ability to work directly with internal stakeholders to locate source data, obtain access, and move integrations forward
Familiarity with data analysis and data cleaning techniques that improve downstream usability and trust in datasets
Exposure to CI/CD practices and deployment automation for data engineering workflows
Additional programming experience in Rust or Go is preferred
Nice to have
Additional programming experience in Rust or Go
What we offer
Medical, vision, dental, and life and disability insurance