CrawlJobs Logo

Clinical Data Engineer

United Kingdom, Livingston · Job Posted January 05, 2026
Apply Position
Job Link Share

Job Description

As Clinical Data Engineer, you will be joining the world’s largest & most comprehensive clinical research organisation, powered by healthcare intelligence. In this role, you will lead the design, development, and implementation of enterprise-level data pipelines that support clinical trial data integration and analysis. This role is instrumental in ensuring the efficient ingestion, transformation, and delivery of high-quality clinical data across multiple systems and sources to enable data-driven insights, regulatory compliance, and business excellence.

Job Responsibility

  • Serve as a technical expert in building data pipelines for the ingestion and delivery of clinical data at the study level, supporting study start-up, conduct, and close-out activities
  • Develop robust data pipelines for integrating heterogeneous data sources
  • Identify, design, and implement scalable data delivery solutions, automating manual processes whenever possible
  • Develop and implement comprehensive data integrity and quality checks throughout the data ingestion process
  • Design and build infrastructure for optimal data extraction, transformation, and loading (ETL/ELT) using cloud platforms such as AWS and Azure
  • Collaborate with downstream users—including statistical programmers, SDTM programmers, analytics, and clinical data programmers—to ensure deliverables meet end-user requirements
  • Appropriately escalate issues to CDE leadership as needed

Requirements

  • Bachelor’s degree in Computer Science, Statistics, Biostatistics, Mathematics, or a related field
  • advanced degree preferred
  • 8+ years of experience in data engineering or a related field, with at least 5 years focused on building pipelines for complex, multi-source data integration
  • Extensive experience developing ELT and ETL solutions for data warehouses and data lakes
  • Proficient with Python, R, RShiny, SQL, and NoSQL databases
  • Hands-on cloud experience with AWS, Azure, or GCP
  • Familiarity with GitLab, GitHub, and Jenkins for version control and CI/CD
  • Proven expertise in deploying data pipelines in cloud environments
  • Skilled in setting up and managing data warehouses and data lakes (e.g., Snowflake, Amazon Redshift)
  • Efficient in designing, developing, and maintaining scalable data pipelines for large datasets
  • Strong understanding of database concepts, with working knowledge of XML, JSON, and API integrations

What we offer

  • Various annual leave entitlements
  • A range of health insurance offerings to suit you and your family’s needs
  • Competitive retirement planning offerings to maximise savings and plan with confidence for the years ahead
  • Global Employee Assistance Programme, TELUS Health, offering 24-hour access to a global network of over 80,000 independent specialised professionals who are there to support you and your family’s well-being
  • Life assurance
  • Flexible country-specific optional benefits, including childcare vouchers, bike purchase schemes, discounted gym memberships, subsidised travel passes, health assessments, among others

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Clinical Data Engineer

8 matching positions

Sr. Data Engineer – Clinical Data Foundation

The Sr. Data Engineer is responsible for designing, building, maintaining, analy...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s /Bachelor’s degree with 9-12 years of experience in Computer Science, IT or related field
  • Hands on experience with big data technologies and platforms, such as Databricks, Apache Spark (PySpark, SparkSQL), workflow orchestration, performance tuning on big data processing
  • Hands on experience with various Python/R packages for data analysis, feature engineering and machine learning model training
  • Proficiency in data analysis tools (eg. SQL) and experience with data visualization tools
  • Excellent problem-solving skills and the ability to work with large, complex datasets
  • Strong understanding of data governance frameworks, tools, and best practices.
  • Knowledge of data protection regulations and compliance requirements (e.g., GDPR, CCPA)
Job Responsibility
Job Responsibility
  • Design, develop, and maintain data solutions for data generation, collection, and processing
  • Be a key team member that assists in design and development of the data pipeline
  • Create data pipelines and ensure data quality by implementing ETL processes to migrate and deploy data across systems
  • Contribute to the design, development, and implementation of data pipelines, ETL/ELT processes, and data integration solutions
  • Take ownership of data pipeline projects from inception to deployment, manage scope, timelines, and risks
  • Collaborate with cross-functional teams to understand data requirements and design solutions that meet business needs
  • Develop and maintain data models, data dictionaries, and other documentation to ensure data accuracy and consistency
  • Implement data security and privacy measures to protect sensitive data
  • Leverage cloud platforms (AWS preferred) to build scalable and efficient data solutions
  • Collaborate with Data Architects, Business SMEs, and Data Scientists to design and develop end-to-end data pipelines to meet fast paced business needs across geographic regions
Read More
Arrow Right

Data Engineer – Clinical Data Foundation

The Data Engineer is responsible for designing, building, maintaining, analyzing...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s /Bachelor’s degree with 5-8 years of experience in Computer Science, IT or related field
  • Hands on experience with various Python/R packages for data analysis, feature engineering and machine learning model training
  • Proficiency in data analysis tools (eg. SQL) and experience with data visualization tools
  • Excellent problem-solving skills and the ability to work with large, complex datasets
  • Strong understanding of data governance frameworks, tools, and best practices.
  • Knowledge of data protection regulations and compliance requirements (e.g., GDPR, CCPA)
  • Excellent critical-thinking and problem-solving skills
  • Strong communication and collaboration skills
  • Demonstrated awareness of how to function in a team setting
  • Demonstrated presentation skills
Job Responsibility
Job Responsibility
  • Design, develop, and maintain data solutions for data generation, collection, and processing
  • Be a key team member that assists in design and development of the data pipeline
  • Create data pipelines and ensure data quality by implementing ETL processes to migrate and deploy data across systems
  • Contribute to the design, development, and implementation of data pipelines, ETL/ELT processes, and data integration solutions
  • Take ownership of data pipeline projects from inception to deployment, manage scope, timelines, and risks
  • Collaborate with cross-functional teams to understand data requirements and design solutions that meet business needs
  • Develop and maintain data models, data dictionaries, and other documentation to ensure data accuracy and consistency
  • Implement data security and privacy measures to protect sensitive data
  • Leverage cloud platforms (AWS preferred) to build scalable and efficient data solutions
  • Collaborate with Data Architects, Business SMEs, and Data Scientists to design and develop end-to-end data pipelines to meet fast paced business needs across geographic regions
Read More
Arrow Right

Lead Data Scientist - Clinical Informatics (Clinical Data Standards)

We’re building a world of health around every individual — shaping a more connec...
Location
Location
United States , New York
Salary
Salary:
130295.00 - 260590.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
June 29, 2026
Flip Icon
Requirements
Requirements
  • 7+ years of relevant experience in clinical informatics, healthcare analytics, or clinical data management
  • Deep expertise in clinical data types and structures, including CCD data, lab results, clinical notes, and administrative healthcare data
  • Strong knowledge of clinical coding systems and terminologies, such as ICD-10, CPT, HCPCS, SNOMED-CT, LOINC, NDC, and RxNorm
  • Experience designing and documenting data models, taxonomies, or classification frameworks for clinical or healthcare data
  • Proven ability to enable and support downstream data consumers (analysts, data scientists, business users) through documentation, training, and consultative support
  • Experience leading cross-functional projects from concept to delivery by coordinating across clinical, technical, and business stakeholders
  • Proficiency with SQL and experience working with large-scale healthcare datasets
  • Experience using cloud-based data platforms, preferably Google Cloud Platform (GCP) tools including BigQuery, for querying, transforming, and managing data
  • Strong understanding of data quality principles, including validation, profiling, and monitoring of healthcare data
  • Excellent written and verbal communication skills, including the ability to explain complex clinical data concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Serve as a subject matter expert in clinical data, including CCD data, with deep understanding of how to structure and apply this data to solve healthcare problems
  • Design and maintain clinical data models, taxonomies, and classification frameworks that enable consistent interpretation and use of clinical data across the organization
  • Develop and govern the clinical data feature store, establishing standards, documentation, and best practices that accelerate adoption of clinical data for downstream analytics, reporting, and AI/ML use cases
  • Enable self-service analytics by building well-documented, validated, and reusable data assets (tables, views, features) that empower analysts and data scientists to work independently with clinical data
  • Create and maintain comprehensive data documentation, including data dictionaries, lineage, business logic, known limitations, and appropriate use guidelines for clinical datasets
  • Build queries, dashboards, and data visualizations to effectively communicate data quality metrics, data availability, and clinical insights to technical and non-technical stakeholders
  • Partner with clinical, operational, and business stakeholders to understand their data needs, translate requirements into data solutions, and ensure clinical data assets meet their analytical objectives
  • Lead and mentor data scientists, data analysts, and data engineers, providing guidance on clinical data interpretation, appropriate use, and best practices for working with healthcare data
  • Establish data quality frameworks for clinical data, including validation rules, anomaly detection, and monitoring processes to ensure data integrity and reliability
  • Translate clinical concepts into analytical frameworks, ensuring that business partners understand the capabilities and limitations of available clinical data
What we offer
What we offer
  • Medical, dental, and vision coverage
  • Paid time off
  • Retirement savings options
  • Wellness programs
  • CVS Health bonus, commission or short-term incentive program
  • Award target in the company’s equity award program
  • Fulltime
Read More
Arrow Right

Clinical Data Validation Engineer Specialist

We are currently seeking a Senior Clinical Data Science Programmer to join our d...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
iconplc.com Logo
iconplc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree in a relevant field such as computer science, statistics, or life sciences
  • Extensive experience in programming for clinical trials, with proficiency in languages such as SAS, R, or Python
  • Strong problem-solving skills and the ability to work collaboratively in a fast-paced, cross-functional environment
  • Excellent attention to detail and organizational skills, with a commitment to delivering high-quality results
  • Strong communication and interpersonal skills, with the ability to effectively collaborate with diverse teams and influence outcomes
Job Responsibility
Job Responsibility
  • Developing, validating, and maintaining programming solutions for data analysis and reporting in clinical trials
  • Collaborating with clinical data scientists and biostatisticians to ensure the integration of programming solutions into the overall data management process
  • Overseeing the generation of statistical datasets, tables, listings, and figures to support regulatory submissions and study reports
  • Providing guidance on programming best practices, coding standards, and data quality control measures
  • Staying updated on advancements in programming languages and data management tools to enhance operational efficiencies
What we offer
What we offer
  • Various annual leave entitlements
  • A range of health insurance offerings to suit you and your family’s needs
  • Competitive retirement planning offerings to maximize savings and plan with confidence for the years ahead
  • Global Employee Assistance Programme, LifeWorks, offering 24-hour access to a global network of over 80,000 independent specialized professionals who are there to support you and your family’s well-being
  • Life assurance
  • Flexible country-specific optional benefits, including childcare vouchers, bike purchase schemes, discounted gym memberships, subsidized travel passes, health assessments, among others
Read More
Arrow Right
New

Senior Software Engineer, Clinical CMS

Location
Location
United States , Burlington; Durham
Salary
Salary:
100000.00 - 170000.00 USD / Year
themuse.com Logo
The Muse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Life Sciences, Computer Science or Engineering with 5 or more years of recent professional software development experience
  • 5 or more years building full-stack applications in Python and modern JavaScript frameworks
  • 5 or more years of recent experience with front-end technologies (e.g., React, Angular, Vue.js, JavaScript/TypeScript, HTML/CSS)
  • 5 or more years of recent experience with RESTful APIs and microservices architecture
  • 5 or more years of recent experience with CI/CD pipelines and DevOps practices
  • 5 or more years of recent experience with cloud platforms (AWS, Azure, or GCP)
Job Responsibility
Job Responsibility
  • Leverage scientific domain and technical knowledge to support genetic scientists with identifying relevant information and published knowledge
  • Establish, streamline, and automate knowledge-extraction pipelines for collecting, curating, and visualizing knowledge from scientific publications
  • Collaborate with data scientists, variant scientists, and software developers to design and implement biomedical solutions that meet stakeholder requirements
  • Monitor emerging technology trends in literature analysis and knowledge discovery
  • represent the team externally through conferences and publications
What we offer
What we offer
  • Medical
  • Dental
  • Vision
  • Life
  • STD/LTD
  • 401(k)
  • Paid Time Off (PTO) or Flexible Time Off (FTO)
  • Tuition Reimbursement
  • Employee Stock Purchase Plan
  • annual bonus under the Labcorp Bonus Plan
  • Fulltime
Read More
Arrow Right

Data Engineer – Lead

Data Engineer – Lead
Location
Location
India , Bengaluru Urban
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on experience with Microsoft Fabric, including Lakehouse, Warehouse, OneLake, Pipelines, Dataflows Gen2, Notebooks, and Power BI integration
  • Expertise in ETL/ELT, data pipelines, distributed data processing, and cloud-scale data engineering
  • Strong SQL, Python, PySpark, and data modeling skills
  • Experience with Lakehouse, Warehouse, and Medallion Architecture
  • Understanding of Delta tables, dimensional modeling, star schema, facts, dimensions, and curated analytical datasets
  • Experience integrating structured, semi-structured, file-based, API-based, enterprise application, and cloud data sources
  • Experience with data quality, reconciliation, logging, monitoring, and error-handling frameworks
  • Experience leading technical teams and coordinating onshore/offshore delivery
  • Experience with Git, CI/CD, Azure DevOps, branching, code reviews, and release management
  • Good to Have: Experience with Azure Data Factory, Synapse, Databricks, ADLS Gen2, Azure SQL, Microsoft Purview, or related Azure services
Job Responsibility
Job Responsibility
  • Lead the design and implementation of scalable data pipelines and data processing frameworks in Microsoft Fabric
  • Define data engineering standards, development practices, naming conventions, coding guidelines, and reusable technical patterns
  • Lead implementation of Bronze, Silver, and Gold layers in the Medallion Architecture
  • Oversee ingestion, transformation, orchestration, validation, and publication of data from multiple enterprise, clinical, operational, and cloud-based sources
  • Guide development of Fabric Pipelines, Dataflows Gen2, Notebooks, Lakehouse tables, Warehouse objects, and curated datasets
  • Ensure scalability, performance, reliability, maintainability, security, monitoring, and optimization of data solutions
  • Define standards for data quality, reconciliation, logging, error handling, auditability, and lineage
  • Conduct technical design reviews, code reviews, performance reviews, and deployment readiness reviews
  • Mentor and guide data engineering teams across onshore/offshore locations
  • Collaborate with architects, platform engineers, BI teams, QA teams, AI/ML teams, functional consultants, and stakeholders
Read More
Arrow Right

Lead Analytics Engineer - Data Modeling & Quality

Arcadia is the only healthcare data and software company dedicated to healthcare...
Location
Location
United States
Salary
Salary:
Not provided
themuse.com Logo
The Muse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced SQL: window functions, complex CTEs, aggregation patterns, performance tuning on columnar databases
  • DBT: hands-on experience authoring models, tests, macros, and yml documentation
  • familiarity with incremental strategies
  • Healthcare data literacy: working knowledge of claims data (professional, institutional, pharmacy), clinical data (EHR entities), and common quality dimensions (member months, coverage rates, null patterns)
  • Data quality mindset: ability to differentiate source data issues from transform issues, design systematic validation checks, and communicate data quality findings clearly
  • Clear communicator — able to translate technical findings for clients and non-technical stakeholders
  • Strong analytical judgment — you can look at a distribution and know when something is wrong
  • Ability to manage several projects simultaneously, leveraging AI tooling to stay organized and efficient
  • Genuine desire to learn and apply AI tools for operational efficiency
Job Responsibility
Job Responsibility
  • DATA MODELING & DBT DEVELOPMENT: Author, review, and maintain DBT models using Spark/Hudi from ingest through bronze and silver
  • Help clients understand their data model, assumptions, and limitations through intentional validation
  • Troubleshoot and fix issues, then write DBT tests to catch issues proactively
  • Optimize SQL performance for slow-running jobs
  • Partner with Data Engineering on Hudi table design, partition strategy, and incremental patterns
  • DATA QUALITY OWNERSHIP: Triage and classify data quality alerts, distinguishing source-level issues from transform-layer failures
  • Design and maintain volume monitors and DQ monitors (null rate, distribution, future-date checks)
  • Author and apply clinical DQ rules (entity volume, field coverage, LOINC coverage, referential integrity) and claims validation rules across silver and gold layers
  • Conduct quality reviews for connector promotions — evaluating silver entity coverage, validation rule pass rates, and bronze-to-silver transformation correctness
  • Own the ticket queue for DQ, attribution, hierarchy, and customer-specific data quality issues, writing clear customer-facing findings
What we offer
What we offer
  • Pet Insurance
  • Health Insurance
  • Dental Insurance
  • Vision Insurance
  • FSA
  • HSA
  • HSA With Employer Contribution
  • Life Insurance
  • Short-Term Disability
  • Long-Term Disability
Read More
Arrow Right

Sr. Data Engineer

Join Amgen’s Mission of Serving Patients. At Amgen, if you feel like you’re part...
Location
Location
United States , Thousand Oaks
Salary
Salary:
138325.00 - 172744.00 USD / Year
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s degree (or foreign equivalent) in Computer Science, Computer Engineering, Clinical Informatics or related field and 2 years of experience in the job offered or in a Data Engineer – related occupation. The position requires 2 years of experience in the following: Manage and tune Hadoop, MapReduce, or Spark SQL
  • Build ETL pipelines to transfer data
  • Optimize database for usage
  • Manage data stewardship and governance
  • Configure cloud services to orchestrate, monitor, and run end-to-end data warehousing solutions
  • and Model data for translation, linking, and consumption from source to data warehouse.
Job Responsibility
Job Responsibility
  • Responsible for managing and optimizing the company's data infrastructure and architecture for Clinical Data Hub, including cloud-based distributed compute and storage
  • Design and implement data pipelines, develop data models, perform data integration, and ensure data quality and governance
  • Contributes to the effective storage, processing, and utilization of large-scale data sets
  • Work with Product Owner, Product Management, Agile Team, and Data Architects to plan and deliver prioritized Clinical Data Hub backlog items and optimize the technical environment
  • Ensure data requirements for applications, data scientists, and cross-functional use cases are met, and ensure high performance and reliability of the system
  • Provide insights and actionable solutions for business clients.
What we offer
What we offer
  • stock
  • retirement
  • medical
  • life and disability insurance
  • eligibility for an annual bonus
  • health and welfare plans
  • financial plans
  • work/life balance
  • career development opportunities
  • Retirement and Savings Plan
  • Fulltime
Read More
Arrow Right