CrawlJobs Logo

Member of Technical Staff - Pretraining Text Data

United Kingdom, London · Job Posted February 13, 2026
Apply Position
Job Link Share

Job Description

We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you. In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development.

Job Responsibility

  • Create high-quality datasets for training and evaluation
  • run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.
  • Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.
  • Analyze real-world text datasets to assess quality, diversity, relevance, and identify areas for improvement.
  • Build lightweight tools and workflows for dataset auditing, visualization, and versioning.
  • Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.
  • Embody our culture and values.

Requirements

  • Bachelor's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) OR equivalent experience.
  • 2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structured.
  • Proficiency in statistics and exploratory data analysis methods.

Nice to have

Master's Degree in in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) OR Bachelor's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 12+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) OR equivalent experience.

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Member of Technical Staff - Pretraining Text Data

8 matching positions

Member of Technical Staff - Data Infra - MAI Superintelligence Team

Help build the world’s most advanced multimodal dataset at Microsoft AI. We are ...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • OR Master’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ year(s) experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience
  • Bachelor’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 8+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • OR Master’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years of business analytics, data science, software development, data modeling or data engineering work experience
  • OR equivalent experience
Job Responsibility
Job Responsibility
  • Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video)
  • Own and maintain critical data infrastructures, including spark, ray, vector databases, and others
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Data Infra - MAI Superintelligence Team

Help build the world’s most advanced multimodal dataset at Microsoft AI. We are ...
Location
Location
United States , Mountain View
Salary
Salary:
163000.00 - 296400.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 8+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR equivalent experience
  • Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 12+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 15+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR equivalent experience
  • 4+ years experience with data governance, data compliance and/or data security
  • Passionate about the role of data in large-scale AI model training
  • Thrive in a highly collaborative, fast-paced environment
  • Have a high degree of expertise and pay close attention to details
Job Responsibility
Job Responsibility
  • Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video)
  • Own and maintain critical data infrastructures, including spark, ray, vector databases, and others
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Data - MAI Superintelligence Team

Help build the world’s most advanced multimodal dataset at Microsoft AI. We are ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
  • OR equivalent experience
  • Expertise in large scale data engineering ideally applied to AI
  • Expertise in Spark, Kubernetes or similar
Job Responsibility
Job Responsibility
  • Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video)
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation
  • Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - ML Research Engineer, Data

Our Data team powers Liquid Foundation Models across pre-training, vision, audio...
Location
Location
United States , San Francisco; Boston
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Python skills with the ability to quickly comprehend problems and translate them into clean, working code
  • Solid ML fundamentals: experience training, evaluating, and iterating on models (PyTorch preferred)
  • Track record of learning new technical domains quickly
  • 3+ years relevant experience with an M.S., or 1+ year with a Ph.D. (5+ years with a B.S.)
Job Responsibility
Job Responsibility
  • Build and maintain data processing, filtering, and selection pipelines at scale
  • Create pipelines for pretraining, midtraining, SFT, and preference optimization datasets
  • Design synthetic data generation systems using LLMs, structured prompting, and domain-specific generators
  • Design and run evaluations and ablations to measure dataset's impact on model performance
  • Monitor public datasets across text, vision, and audio domains
  • Collaborate with pre-training, vision, and audio teams on modality-specific data needs
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right
New

Management Accountant – Fibre

We’re looking for an experienced Management Accountant to provide financial supp...
Location
Location
New Zealand , Whangārei
Salary
Salary:
Not provided
northpower.nz Logo
Northpower Ltd
Expiration Date
June 19, 2026
Flip Icon
Requirements
Requirements
  • CAANZ (or equivalent) qualification, or currently working towards it
  • 1–2 years’ post-qualification experience in a medium to large commercial organisation or CA firm
  • Strong Excel skills (intermediate to advanced)
  • Proven ability to meet deadlines with a high level of accuracy
  • Great problem-solving and analytical skills
  • Strong interpersonal and relationship-building skills
  • Experience with the following is advantageous: Commerce Commission / regulatory reporting
  • JDE (JD Edwards)
Job Responsibility
Job Responsibility
  • Deliver accurate and timely month-end and year-end processing and reporting, including meaningful budget variance analysis
  • Assist with annual regulatory Information Disclosure and support statutory and regulatory audit processes
  • Support budgeting and quarterly forecasting activities
  • Provide in-depth financial analysis and develop reporting tools to support managerial planning and commercial decisions
  • Partner with the Fibre leadership and operational teams to improve financial capability and outcomes
  • Contribute to internal audits and identify opportunities for process and system improvements
  • Role model organisational behaviours and contribute positively to team culture and brand reputation
  • Ensure compliance with health, safety, quality and environmental (HSQE) requirements
What we offer
What we offer
  • Competitive remuneration
  • An in-house well-being programme, a peer support network (Kaitiaki) and EAP services
  • Life Insurance and group discounted medical insurance
  • Commitment to professional growth and development
  • A friendly workplace where people are valued and appreciated
  • Family-friendly events and discounted gym membership along with various retail discounts
  • Fulltime
Read More
Arrow Right
New

Senior People and Capability Partner

Location
Location
New Zealand , Auckland
Salary
Salary:
Not provided
northpower.nz Logo
Northpower Ltd
Expiration Date
June 20, 2026
Flip Icon
Requirements
Requirements
  • A genuine passion for people
  • Strong written and verbal communication skills
  • Able to thrive in a fast-paced environment, prioritise effectively, and stay resilient
  • Confident communicating with people at all levels of the business, including frontline teams
  • Proactive, adaptable, and collaborative
  • Proficient in Microsoft Office, including Excel, Word, and PowerPoint
  • Aligned with our values of care, passion, integrity, and achieving together
  • Essential experience in a generalist HR or recruitment role
  • A tertiary qualification in HR is desirable
Job Responsibility
Job Responsibility
  • Contribute to the delivery of P&C continuous improvement initiatives
  • Increase & enhance P&C's contribution & presence to the wider business
  • Support P&C Delivery team and managers on solving ER issues
  • Aid Recruitment partner regarding the end-to-end recruitment process
  • Coordinating and producing HR reports and metrics on behalf of HR team
  • Assisting in the delivery of communications to the Auckland Energy Services Team and wider business
What we offer
What we offer
  • A people‑first culture built on safety, inclusion, and deep care, where contributions are recognised and wellbeing is supported through EAP and our award‑winning peer support network (Kaitiaki)
  • Fair and market competitive pay including tool of trade vehicle, with the flexibility to tailor through additional leave and increased employer KiwiSaver contributions
  • Life insurance and preventative health initiatives (including annual flu immunisations and Mole Maps)
  • Discounted group medical cover and health and fitness programmes (including Fitness Passport)
  • Discounted fuel rates plus a range of retail discounts (including Boost, PB Tech and Specsavers)
  • Ongoing learning and development, with clear career pathways to help you grow, specialise, or progress
  • A welcoming, family‑friendly workplace where people are encouraged to be themselves and feel valued
  • Fulltime
Read More
Arrow Right
New

Full Charge Bookkeeper

We are a growing yacht manufacturer seeking a hands-on Full Charge Bookkeeper to...
Location
Location
United States , Dania Beach
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Accounting or related field (or equivalent experience)
  • 5–10 years of progressive bookkeeping/accounting experience, ideally in a small to mid-sized business
  • Proven experience operating in a full charge or highly autonomous bookkeeping role
  • Strong working knowledge of accounting systems and bookkeeping software
  • High attention to detail with strong organizational and communication skills
  • Comfortable wearing multiple hats in a hands-on, team-oriented environment
Job Responsibility
Job Responsibility
  • Manage full-cycle bookkeeping including AR, AP, GL, and all cash, bank, and credit card transactions
  • Perform monthly reconciliations (bank accounts, credit cards, purchasing, and inventory)
  • Maintain accurate and timely accounts payable processing and reporting
  • Ensure proper recordkeeping and organization of accounting documentation
  • Coordinate closely with purchasing and service teams on transactions and inventory-related accounting
  • Handle sales tax filings, government/statistical reporting, and licensing/inspection requirements
  • Support general administrative functions, including mail handling and office coordination
  • Assist with boat show preparation and participation as needed (logistics, coordination, light client interaction)
What we offer
What we offer
  • Competitive salary
  • Medical benefits
  • 401(k)
  • Paid Time Off (PTO)
Read More
Arrow Right
New

Change Management Specialist

As a Change Management Specialist, you will ensure the smooth adoption of a new ...
Location
Location
Canada , Toronto
Salary
Salary:
67.68 - 74.36 USD / Hour
https://www.randstad.com Logo
Randstad
Expiration Date
June 29, 2026
Flip Icon
Requirements
Requirements
  • 5+ years of organizational change management experience specifically for technology implementations
  • Proven track record supporting contact centre transformations, SaaS implementations, or cloud migrations (e.g., AWS, Azure, Genesys, Oracle)
  • Prosci Certified Change Practitioner or a similar recognized certification is strongly preferred
  • In-depth knowledge of change management principles, tools, and the practical application of readiness and culture assessments
  • Exceptional written and verbal communication skills, with a strong ability to facilitate discussions and influence stakeholders
  • Understanding of cloud contact centre architecture, CRM integrations, multi-channel routing, and data privacy/compliance
  • Ability to translate complex technical requirements into clear operational impacts for business readiness
Job Responsibility
Job Responsibility
  • Develop and execute a comprehensive Change Management Plan aligned with the cloud migration roadmap
  • Conduct detailed change impact assessments across agent workflows, leadership, and IT teams
  • Identify and analyze stakeholders to develop targeted engagement strategies
  • Proactively identify and mitigate risks related to people, processes, and organizational impacts
  • Collaborate with leads to design communication strategies and training needs assessments
  • Facilitate training sessions and oversee vendor-led delivery to ensure high-quality learning outcomes
  • Monitor post-migration adoption and develop reinforcement plans for continuous improvement, stabilization, and knowledge transfer
  • Work with Subject Matter Experts (SMEs) to ensure change activities align with infrastructure changes, system integrations, and testing phases
What we offer
What we offer
  • Drive a large-scale digital transformation in a modernized cloud environment
  • Work closely with diverse teams, from IT leadership to frontline contact centre agents
  • Opportunity to apply and refine Prosci or similar methodologies on a feature-rich SaaS implementation
  • Join a dynamic team located in a central business hub
  • Fulltime
Read More
Arrow Right