CrawlJobs Logo

Biology Data Quality Engineer

bioptimus.com Logo

Bioptimus

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Bioptimus is building the first universal AI foundation model for biology to fuel breakthrough discoveries and accelerate innovation in biomedicine. We are looking for a meticulous and detail-oriented Biology Data Quality Engineer to ensure the integrity and usability of the various and complex datasets that are central to our mission. In this critical role, you'll leverage your expertise in biology, data science, and machine learning to ensure the quality and consistency of biological data used to train and evaluate our foundation models.

Job Responsibility:

  • Data Validation Pipeline Development: Develop and implement comprehensive data validation protocols for diverse biological datasets (histology, omics, clinical)
  • Ensure data integrity, consistency, and accuracy through rigorous quality checks
  • Design and implement automated data quality pipelines
  • Data Curation & Standardization: Establish and enforce data standardization practices
  • Curate datasets to enhance their usability for machine learning
  • Collaboration & Communication: Work closely with the R&D team to understand data requirements and address data quality concerns
  • Communicate data quality findings and recommendations effectively
  • Communicate and synchronize with external data providers
  • Documentation & Reporting: Maintain a detailed documentation of the data-quality assessment procedures, validation results, and data specifications
  • Generate regular reports on data quality metrics and trends
  • Data Source Evaluation: Evaluate and validate external public data sources
  • Continuous Improvement: Stay up-to-date with the latest data quality best practices and tools
  • Propose and implement improvements to our data-quality assessment processes and pipelines

Requirements:

  • MSc in Biology, Computational Biology, Bioinformatics
  • Deep understanding of transcriptomics data types (bulk, single-cell, spatial) and their specific quality considerations
  • Good knowledge of genomics and proteomics data
  • Proven experience in implementing data quality control procedures and pipelines
  • Familiarity with data validation tools and techniques
  • Strong analytical and problem-solving skills
  • Proficiency in Python
  • Good knowledge of data visualization libraries (e.g. matplotlib)
  • Excellent written and verbal communication skills

Nice to have:

  • Computational Pathology Data Expertise: Experience in machine learning analysis of histology images
  • Cloud expertise: Experience working with AWS
  • Data Annotation Experience: Experience with developing and implementing data annotation guidelines and processes
  • Experience with data ontologies
  • Proven experience building or contributing to large-scale data collections (e.g. Human Cell Atlas)
  • Spatial alignment of multimodal datasets (e.g. alignment between different imaging modalities)
What we offer:
  • Competitive salary and equity package
  • Flexible work arrangements, including remote options
  • Opportunities for professional growth and leadership development

Additional Information:

Job Posted:
February 16, 2026

Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Biology Data Quality Engineer

Software Developer / Data Engineer in Proteomics

The Chair of Proteomics and Bioanalytics at the Technical University of Munich l...
Location
Location
Germany , Freising
Salary
Salary:
Not provided
jobrxiv.org Logo
jobRxiv
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MSc or PhD in Computer Science, Bioinformatics, Computational Biology, or a related quantitative field
  • Solid experience in programming and system architecture design (e.g., Python), workflow management tools (e.g., Nextflow) and containerization (e.g., Docker)
  • Strong knowledge of relational database systems and schema design (e.g., MySQL), and web-development (e.g., Vue.js) including data visualization frameworks (e.g., D3.js)
  • Excellent collaborative skills
  • ability to work in multidisciplinary teams
Job Responsibility
Job Responsibility
  • Develop and maintain scalable data-processing pipelines for mass spectrometry-based phosphoproteomics and related omics data
  • Design, build, and document database systems and interfaces for managing, visualizing, and mining complex multi-omics datasets
  • Integrate quantification, annotation, and pathway information with genomic and transcriptomic data relevant to cancer signaling and drug response
  • Collaborate closely with proteomics experts, cancer biologists, and clinicians to transform experimental data into biologically and clinically meaningful insights
  • Implement data quality control, and reproducibility workflows in accordance with regulatory requirements
  • Contribute to the development of APIs, web tools, and dashboards for internal users and collaborators
What we offer
What we offer
  • Join an interdisciplinary team of biochemists, cell biologists, bioinformaticians, and clinicians that uses the latest proteomic approaches to fight cancer
  • The Technical University of Munich is one of the best academic institutions in the world, and offers a stimulating work environment and excellent future perspectives
  • The position is initially available for two years but may be extended
  • The salary follows the TVL scale
  • Fulltime
Read More
Arrow Right

Process Engineer, Manufacturing Science & Technology

As we move our cell therapy programs through late-stage development toward BLA s...
Location
Location
United States , Philadelphia
Salary
Salary:
Not provided
cabalettabio.com Logo
Cabaletta Bio Inc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • B.S. with 3 + years or M.S. with 2-3 years' experience within biologics process development, MSAT or manufacturing under cGMP processes or PhD degree
  • Previous experience and working knowledge of T-cells or immunological cell therapies
  • Experience with cell therapy manufacturing scale up, technology transfer, and process development and optimization is required
  • Experience supporting late-stage MSAT activities, including process characterization, PPQ planning/execution, scale-up/scale-out strategies, and continued process verification
  • Direct involvement in BLA-enabling MSAT deliverables such as Module 3 process descriptions, validation packages, tech transfer documentation, and inspection readiness strongly preferred
  • Good working knowledge in cGMP manufacturing of biological process and ICH regulations
  • Strong written and verbal communication skills
  • Highly organized and efficient
  • Able to work independently
  • Strong problem-solving skills
Job Responsibility
Job Responsibility
  • Support technology transfer of mature, optimized processes to CMOs for clinical and late-phase cGMP manufacturing, ensuring readiness for PPQ and commercial-scale operations
  • Provide manufacturing oversight at CMOs, including person-in-plant support, review of manufacturing performance, and real-time issue escalation to maintain phase-appropriate control strategies
  • Execute process development and characterization studies, generate high-quality protocols and reports, and present data to cross-functional teams to inform PPQ planning and BLA Module 3 content
  • Ensure timely and accurate data capture, supporting data integrity requirements for late-stage filings, validation packages, and regulatory inspections
  • Maintain all training requirements in a compliant state, aligning with expectations for late-stage manufacturing and inspection readiness
  • Support phase-appropriate cell therapy processes using QbD principles, including identification of CPPs/CMAs and contributing to control strategy refinement for BLA submission
  • Identify and evaluate new technologies that enhance scalability, robustness, cost efficiency, and process consistency in preparation for commercial readiness
  • Provide ongoing oversight of CMO operations, including batch record and testing documentation review, data trending, and deviation/CAPA support consistent with PPQ and late-stage expectations
  • Coordinate internal and external activities related to patient material and product logistics, ensuring compliant chain-of-identity/chain-of-custody processes critical for pivotal trial operations
  • Support MSAT planning for development and validation materials, maintaining inventory and coordinating procurement to enable process characterization and PPQ readiness
What we offer
What we offer
  • health and retirement, PTO, and stock option plans
  • Fulltime
Read More
Arrow Right

Senior Scientific Knowledge Engineer

The Onyx Research Data Tech organization is GSK’s Research data ecosystem which ...
Location
Location
United States , Cambridge, Massachusetts; Seattle, Washington; San Francisco, California; Collegeville, Pennsylvania
Salary
Salary:
145200.00 - 242000.00 USD / Year
us.gsk.com Logo
GSK
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Masters degree in Bioinformatics, Biomedical Science, Biomedical Engineering, Molecular Biology, or Computer Science (with a life science application focus)
  • 6+ years of relevant work experience
  • Experience contributing to Knowledge Graph development efforts, including entity modeling, relationship design, and schema governance
  • Experience in operating and leading across organizational boundaries a matrixed team
  • Experience with industry standard data management / metadata platforms e.g. Collibra, Datahub, Datum, Informatica
  • Proficiency in at least one programming language — preferably Python — for scripting vocabulary mappings, building data models, automating QC, and prototyping pipelines
  • Experience with bioinformatics pipelines and workflow management systems (e.g., Nextflow)
Job Responsibility
Job Responsibility
  • Definition of schemas and data models of scientific information required for the creation of value adding data products
  • Accountable for the quality control (through validation and verification) of mapping specifications to be industrialized by data engineering and maintained in platform provisioned tooling
  • Working with Product managers/engineers confidently convert business need into defined deliverable business requirements to enable the integration of large-scale biology data
  • Collaborate with external groups to align GSK data standards with industry/ academic ontologies
  • Support effective ingestion of data by GSK through understanding the entry requirements required by platform engineering teams
  • Provides bespoke subject matter expertise for R&D data to translate deep science into data for actionable insights
  • Champion data lineage, data quality, and FAIR data principles across the Onyx platform
  • Contribute to and maintain documentation of data standards, ontology decisions, and mapping rationale
  • Support self-service data enablement by ensuring metadata and knowledge products are accessible, well-documented, and usable by scientists and analysts
What we offer
What we offer
  • Competitive base salary
  • Annual bonus based on company performance
  • Flexible working options available for most roles
  • Learning and career development
  • Access to healthcare & wellbeing programmes
  • Employee recognition programmes
  • Health care and other insurance benefits (for employee and family)
  • Retirement benefits
  • Paid holidays
  • Vacation
  • Fulltime
Read More
Arrow Right

Senior Director – BioIntelligence

The AI & Data for Engineered Biologics (AIDE) organization at Amgen is seeking a...
Location
Location
United States , Thousand Oaks
Salary
Salary:
239775.00 - 295217.00 USD / Year
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate degree in Computational Biology, Machine Learning, Bioinformatics, Computer Science, Biophysics, or related field and 5 years of experience applying machine learning or computational modeling to biological systems
  • OR Master’s degree in Computational Biology, Machine Learning, Bioinformatics, Computer Science, Biophysics, or related field and 9 years of experience applying machine learning or computational modeling to biological systems
  • OR Bachelor’s degree in Computational Biology, Machine Learning, Bioinformatics, Computer Science, Biophysics, or related field and 11 years of experience applying machine learning or computational modeling to biological systems
  • At least 5 years experience directly managing people and/or leadership experience leading teams, projects, programs, or directing the allocation or resources
Job Responsibility
Job Responsibility
  • Lead the BioIntelligence Team within our Large Molecule Discovery organization, defining strategy and priorities for AI-driven biologics modeling
  • Develop and execute a roadmap for machine learning and AI approaches that accelerate engineered biologics discovery
  • Align BioIntelligence capabilities with broader Research and Large Molecule Discovery priorities
  • Oversee development of predictive models for key biologics properties, including developability, stability, manufacturability, and immunogenicity
  • Advance modeling approaches using modern AI techniques such as: protein language models
  • generative modeling and inverse folding
  • representation learning
  • active learning and Bayesian optimization
  • Guide the use of multimodal biological datasets including sequence, structure, and experimental assay data
  • Lead development of production-quality research software and deployable ML models used across discovery teams
What we offer
What we offer
  • A comprehensive employee benefits package, including a Retirement and Savings Plan with generous company contributions, group medical, dental and vision coverage, life and disability insurance, and flexible spending accounts
  • A discretionary annual bonus program, or for field sales representatives, a sales-based incentive plan
  • Stock-based long-term incentives
  • Award-winning time-off plans
  • Flexible work models where possible
  • Fulltime
Read More
Arrow Right

Data Scientist

The Sponsor provides training, tradecraft guidance and tools for the data scienc...
Location
Location
United States , McLean
Salary
Salary:
Not provided
leadingpath.com Logo
Leading Path Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Demonstrated experience with data engineering, to include designing and building data infrastructure, developing data pipelines, transforming/preparing data, ensuring data quality and security, and monitoring/optimizing systems
  • Demonstrated experience with data management and integration, including designing and operating robust data layers for application development across local and cloud or web data sources
  • Demonstrated work experience programming with Python
  • Demonstrated experience building scalable ETL and ELT workflows for reporting and analytics
  • Demonstrated experience with general Linux computing and advanced bash scripting
  • Demonstrated experience with SQL
  • Demonstrated experience constructing complex multi- data source queries with database technologies such as PostgreSQL, MySQL, Neo4J or RDS
  • Demonstrated experience processing data sources containing structured or unstructured data
  • Demonstrated experience developing data pipelines with NiFi to bring data into a central environment
  • Demonstrated experience delivering results to stakeholders through written documentation and oral briefings
Job Responsibility
Job Responsibility
  • Development and maintenance of training, publication and coordination of tradecraft guidelines and services
  • Development of programming packages and data services to support cross-enterprise needs
  • Completion of short-term prioritized data science projects, which require programmatic and technical support
  • Work within a team environment and requires constant iteration with stakeholders to provide services and tools
  • Fulltime
Read More
Arrow Right

Specialist Manufacturing: NPI, Upstream Process Owner

Be part of Amgen's newest and most advanced drug substance manufacturing plant. ...
Location
Location
United States , Holly Springs
Salary
Salary:
114990.00 - 139433.00 USD / Year
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • High school diploma / GED & 10 years of biotechnology operations experience
  • Associate’s degree & 8 years of biotechnology operations experience
  • Bachelor’s degree and 4 years of biotechnology operations experience
  • Master’s degree in chemistry, biology, or engineering & 2 years of biotechnology operations experience
  • Doctorate degree
  • Degree in Chemical Engineering, Industrial Engineering, Biology, or Biochemistry
  • Excellent cross-functional project management, meeting facilitation, and technical writing skills
  • Experience in Upstream GMP manufacturing operations
  • Strong technical knowledge of Upstream drug substance processing (media preparation, cell culture, harvest) and a broad understanding of related disciplinary areas in bioprocessing
  • Ability to organize, analyze and interpret technical data through trend analysis, forecasting, modeling, etc.
Job Responsibility
Job Responsibility
  • Communicate and interface between the GMP manufacturing teams in the Amgen North Carolina (ANC) Biologics Drug Substance Manufacturing plant and Process Development scientific groups
  • Ensure new products are successfully introduced into ANC’s biologics manufacturing facility and ownership of upstream unit operations
  • Host cross-functional meetings to drive to timelines to support the tech transfer of the program into the facility as well as process ownership for some upstream process unit operations
  • New Product Introduction (NPI) lead coordinating with Manufacturing, Process Development, Supply Chain, Planning, Facilities and Engineering, as well as Quality to introduce new Drug Substance products and/or advanced technologies into the plant using project management tools (i.e. Smartsheet)
  • Upstream biologics drug substance technical expert who leads or participates in projects, including aiding in commissioning and qualification and training staff on equipment and processes
  • Support Manufacturing in troubleshooting, problem solving and RCAs
  • Support CAPA development to prevent error recurrence
  • Owns New Product Introduction Change Controls and collaborates with stakeholders to drive on-time completion
  • Responds to regulatory questions and/or audit findings
  • Ensures that manufacturing production documents (e.g. Standard Operating Procedures) are accurate and up to date
What we offer
What we offer
  • Competitive and comprehensive Total Rewards Plans that are aligned with local industry standards
  • Fulltime
Read More
Arrow Right

Specialist Manufacturing: NPI, Process Owner

Be part of Amgen's newest and most advanced drug substance manufacturing plant. ...
Location
Location
United States , Holly Springs
Salary
Salary:
114990.00 - 139433.00 USD / Year
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • High school diploma / GED & 10 years of biotechnology operations experience
  • Associate’s degree & 8 years of biotechnology operations experience
  • Bachelor’s degree and 4 years of biotechnology operations experience
  • Master’s degree in chemistry, biology, or engineering & 2 years of biotechnology operations experience
  • Doctorate degree
  • Degree in Chemical Engineering, Industrial Engineering, Biology, or Biochemistry
  • Excellent cross-functional project management, meeting facilitation, and technical writing skills
  • Experience in Downstream GMP manufacturing operations
  • Strong technical knowledge of drug substance processing (harvest, chromatography, filtration, buffer preparation) and a broad understanding of related disciplinary areas in bioprocessing
  • Ability to organize, analyze and interpret technical data through trend analysis, forecasting, modeling, etc.
Job Responsibility
Job Responsibility
  • Communicate and interface between the GMP manufacturing teams in the Amgen North Carolina (ANC) Biologics Drug Substance Manufacturing plant and Process Development scientific groups
  • Ensure new products are successfully introduced into ANC’s biologics manufacturing facility and ownership of downstream unit operations
  • Host cross-functional meetings to drive to timelines to support the tech transfer of the program into the facility as well as process ownership for some downstream process unit operations
  • New Product Introduction (NPI) lead coordinating with Manufacturing, Process Development, Supply Chain, Planning, Facilities and Engineering, as well as Quality to introduce new Drug Substance products and/or advanced technologies into the plant using project management tools (i.e. Smartsheet)
  • Downstream biologics drug substance technical expert who leads or participates in projects, including aiding in commissioning and qualification and training staff on equipment and processes
  • Support Manufacturing in troubleshooting, problem solving and RCAs
  • Support CAPA development to prevent error recurrence
  • Owns New Product Introduction Change Controls and collaborates with stakeholders to drive on-time completion
  • Responds to regulatory questions and/or audit findings
  • Ensures that manufacturing production documents (e.g. Standard Operating Procedures) are accurate and up to date
What we offer
What we offer
  • Competitive and comprehensive Total Rewards Plans that are aligned with local industry standards
  • Fulltime
Read More
Arrow Right

Scientific Data Engineer

Solvd Inc. is a rapidly growing AI-native consulting and technology services fir...
Location
Location
Salary
Salary:
Not provided
solvd.com Logo
Solvd Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must have 3+ years of experience in Python and SQL
  • Must have a degree in biology or chemistry or relevant fields OR 3+ years of work experience in pharmaceutical or biotech industry in scientific data engineering, data modeling, or data analytics
  • Excellent communication skills, attention to detail, and the confidence to take control of project delivery
  • Quickly understand a highly technical product and effectively communicate with product management and engineering
  • Proactive problem-solving skills
  • High-bandwidth: thrives when managing multiple simultaneous projects
  • Intellectually curious: Unwavering drive to learn and know more every day
  • Ability to think creatively about how to solve project risks without reducing quality
  • Team player and ability to "roll up your sleeves" and do what it takes to make the team successful
Job Responsibility
Job Responsibility
  • Research data acquisition strategy for scientific lab instrumentation
  • Research and productionize file parsers for instrument output files (.xlsx, .pdf, .txt, .raw, .fid, many other vendor binaries)
  • Design and build data models and the corresponding data pipelines, unit tests, integration tests, and reusable utility functions
  • Cross-analyze instrument data with the same instrument type or scientific workflow to design common data model components
  • Build visualization, report, and dashboards using Streamlit, Tableau, Jupyter notebook, etc
  • Drive value for the customers - test and make sure the solution fulfills their requirements and provides value
What we offer
What we offer
  • Shape real-world AI-driven projects across key industries, working with clients from startup innovation to enterprise transformation
  • Be part of a global team with equal opportunities for collaboration across continents and cultures
  • Thrive in an inclusive environment that prioritizes continuous learning, innovation, and ethical AI standards
  • Fulltime
Read More
Arrow Right