CrawlJobs Logo

Language Engineer

United States, Sunnyvale Employment contract 86500.00 - 151400.00 USD / Year · Job Posted May 20, 2026
Apply Position
Job Link Share

Job Description

The Amazon Artificial General Intelligence (AGI) Data Services organization is responsible for developing diverse datasets to train and evaluate the Amazon AI models. We are looking for Language Engineers to join our science and engineering team to support the development of complex, multimodal datasets, using a range of approaches including synthetic data generation, model-supported data generation, and human-in-the-loop data collections. You will play a critical role in driving innovation and advancing the state-of-the-art in evaluating and training AI models. You will work closely with cross-functional teams, including product managers, engineers, and data scientists to ensure that our AI systems are best in class.

Job Responsibility

  • Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
  • Analyze and extract insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
  • Use modeling tools to bootstrap or test new AI functionalities
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models

Requirements

  • Experience owning and executing language data collection projects, including guidelines, labelset and annotation workflow development
  • Experience with language annotation and other forms of data markup
  • Experience in one or more scripting languages (e.g., Python, Ruby, Perl)
  • Master's or higher degree in a relevant field (Computational Linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing or AI data creation
  • Experience working with speech, text, and multimodal data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment

Nice to have

  • PhD in Computational Linguistics (or equivalent field with computational emphasis)
  • Expertise in bootstrapping AI data collections for quickly evolving requirements
  • Extensive experience working with speech, text, and multimodal data in multiple languages
  • Experience in data creation for complex agentic workflows
  • Practical experience with Machine Learning and technical concepts such as API
  • Practical knowledge of version control and agile development
  • familiarity with database queries and data analysis processes (SQL, R, Matlab, etc.)

What we offer

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Language Engineer

8 matching positions

Language Engineer

The Amazon Artificial General Intelligence (AGI) Data Services organization is l...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
Not provided
Amazon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with language annotation and other forms of data markup
  • Experience in one or more scripting languages (e.g., Python, Ruby, Perl)
  • Experience in a fast paced, dynamic organization
  • Masters’s or higher degree in a relevant field (computational linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing
  • Experience working with speech and text language data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
Job Responsibility
Job Responsibility
  • Design data collection/creation tasks in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Analyze and extract language-related insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data authoring, using Python or another scripting language
  • Use modeling tools to bootstrap or test new functionalities
  • Collaborate with scientists and software engineers to evaluate performance of language models
  • Handle competing requests from a range of data customers
What we offer
What we offer
  • Health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • Paid time off
  • Parental leave
  • Sign-on payments
  • Restricted stock units (RSUs)
  • Fulltime
Read More
Arrow Right

Language Engineer

The Amazon Artificial General Intelligence (AGI) Data Services organization is l...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
Not provided
Amazon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with language annotation and other forms of data markup
  • Experience with one or more scripting language (e.g., Python, KornShell)
  • Experience working with speech and text language data in multiple languages
  • Experience working in a fast-paced, team environment
  • Masters’s or higher degree in a relevant field (computational linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing
  • Excellent communication, strong organizational skills and very detailed oriented
Job Responsibility
Job Responsibility
  • Design data collection/creation tasks in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Analyze and extract language-related insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data authoring, using Python or another scripting language
  • Use modeling tools to bootstrap or test new functionalities
  • Collaborate with scientists and software engineers to evaluate performance of language models
  • Handle competing requests from a range of data customers
What we offer
What we offer
  • Sign-on payments
  • Restricted stock units (RSUs)
  • Health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • Paid time off
  • Parental leave
  • Fulltime
Read More
Arrow Right

Ai Research Engineer, Language

Meta is seeking talented engineers to join our teams in building cutting-edge pr...
Location
Location
United States , Redmond
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • 2+ years of programming experience in a relevant language OR a PhD + 9 months programming experience in a relevant language
  • Experience building maintainable and testable codebases, including API design and unit testing techniques
  • Experience effectively utilizing AI technologies and tools (e.g., large language models, agents, etc.) to enhance workflows
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams (product, design, operations, infrastructure) to build innovative AI-native application experiences
  • Build and integrate LLM / generative AI capabilities into product surfaces (mobile, web), including prompt engineering, structured prompting, and context management
  • Develop and maintain reusable software components for interfacing with back-end platforms, model serving/inference layers, and AI toolchains
  • Implement retrieval-augmented generation (RAG) patterns (e.g., embeddings + retrieval) and contribute to context-aware and personalized user experiences
  • Design/Contribute to agentic workflows and leverage AI tools and agents (including human-in-the-loop / expert-in-the-loop designs) to automate tasks and scale impact
  • Analyze, debug, and optimize code and systems for quality, efficiency, performance, reliability, and cost
  • Establish effective quality practices for AI features, including evaluation/QA for AI outputs, monitoring, and iterative improvement via feedback loops
  • Architect efficient and scalable systems that power complex applications and AI-enabled features, identify and resolve performance and scalability issues
  • Drive end-to-end execution of medium-to-large features with increasing independence, contribute to technical direction within the team
  • Establish ownership of components, features, or systems with comprehensive end-to-end understanding
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Language Engineer, Artificial General Intelligence - Data Services

The Amazon Artificial General Intelligence (AGI) Data Services organization is r...
Location
Location
Netherlands , Den Haag
Salary
Salary:
Not provided
Amazon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree in computer science, mathematics, statistics, machine learning or equivalent quantitative field
  • Experience with language annotation and other forms of data markup
  • Knowledge of one or more scripting languages (e.g., Python, Ruby, Perl)
  • 2+ years experience in computational linguistics or language data processing or AI data creation
  • Experience working with speech, text, and multimodal data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
  • Analyze and extract insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
  • Use modeling tools to bootstrap or test new AI functionalities
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models
Read More
Arrow Right

Research Engineer, Language

Reality Labs is seeking a Research Engineer to join our Large Language Model (LL...
Location
Location
United States
Salary
Salary:
154003.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Research experience in machine learning, deep learning, and/or natural language processing
  • Experience with developing machine learning models at scale from inception to business impact
  • Programming experience in Python and hands-on experience with frameworks such as PyTorch
  • Exposure to architectural patterns of large scale software applications
Job Responsibility
Job Responsibility
  • Design methods, tools, and infrastructure to push forward the state of the art in large language models
  • Define research goals informed by practical engineering concerns
  • Contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results
  • Adapt standard machine learning methods to best exploit modern parallel environments (e.g. distributed clusters, multicore SMP, and GPU)
  • Work with a large and globally distributed team
  • Contribute to publications and open-sourcing efforts
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Language Engineer, Artificial General Intelligence - Data Services

The Amazon Artificial General Intelligence (AGI) Data Services organization is r...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
75200.00 - 151400.00 USD / Year
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience owning and executing language data collection projects, including guidelines, labelset and annotation workflow development
  • Master's or higher degree in a relevant field (Computational Linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing or AI data creation
  • Experience with language data annotation systems and other forms of data markup
  • Proficient with scripting languages, such as Python
  • Experience working with speech, text, and multimodal data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
  • Analyze and extract insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
  • Use modeling tools to bootstrap or test new AI functionalities
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models
What we offer
What we offer
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
  • Fulltime
Read More
Arrow Right

Language Engineer, Artificial General Intelligence - Data Services

The Amazon Artificial General Intelligence (AGI) Data Services organization is l...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
65200.00 - 131100.00 USD / Year
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Masters’s or higher degree in a relevant field (computational linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing
  • Experience with language annotation and other forms of data markup
  • Experience with scripting languages, such as Python
  • Experience working with speech and text language data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design data collection/creation tasks in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Analyze and extract language-related insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data authoring, using Python or another scripting language
  • Use modeling tools to bootstrap or test new functionalities
  • Collaborate with scientists and software engineers to evaluate performance of language models
  • Handle competing requests from a range of data customers
What we offer
What we offer
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
Read More
Arrow Right

Language Engineer, Artificial General Intelligence - Data Services

The Amazon Artificial General Intelligence (AGI) Data Services organization is r...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
75200.00 - 151400.00 USD / Year
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's or higher degree in a relevant field (Computational Linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing or AI data creation
  • Experience with language data annotation systems and other forms of data markup
  • Proficient with scripting languages, such as Python
  • Experience working with speech, text, and multimodal data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
  • Analyze and extract insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
  • Use modeling tools to bootstrap or test new AI functionalities
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models
What we offer
What we offer
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
Read More
Arrow Right