CrawlJobs Logo

Language Engineer

United States, Sunnyvale · Job Posted May 20, 2026
Apply Position
Job Link Share

Job Description

The Amazon Artificial General Intelligence (AGI) Data Services organization is looking for a Language Engineer with experience in dataset construction, linguistic annotation, dialog/semantic schemas, and automatic processing of large datasets. You will play a critical role in driving innovation and advancing the state-of-the-art in natural language processing and machine learning. You will work closely with cross-functional teams, including product managers, engineers, and data scientists to ensure that our AI systems are aligned with human policies and preferences.

Job Responsibility

  • Design data collection/creation tasks in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Analyze and extract language-related insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data authoring, using Python or another scripting language
  • Use modeling tools to bootstrap or test new functionalities
  • Collaborate with scientists and software engineers to evaluate performance of language models
  • Handle competing requests from a range of data customers

Requirements

  • Experience with language annotation and other forms of data markup
  • Experience with one or more scripting language (e.g., Python, KornShell)
  • Experience working with speech and text language data in multiple languages
  • Experience working in a fast-paced, team environment
  • Masters’s or higher degree in a relevant field (computational linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing
  • Excellent communication, strong organizational skills and very detailed oriented

Nice to have

  • Experience in writing grammars and building FSTs
  • Experience with statistical language modeling
  • PhD in Computational Linguistics (or equivalent field with computational emphasis)
  • Expertise in bootstrapping language data collections in a quickly changing environment
  • Practical knowledge of version control and agile development
  • Familiarity with database queries and data analysis processes (SQL, R, Matlab, etc.)
  • Willingness to support several projects at one time, and to accept reprioritization as necessary
  • Able to think creatively and possess strong analytical and problem solving skills

What we offer

  • Sign-on payments
  • Restricted stock units (RSUs)
  • Health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • Paid time off
  • Parental leave

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Language Engineer

8 matching positions

Language Engineer

The Amazon Artificial General Intelligence (AGI) Data Services organization is r...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
86500.00 - 151400.00 USD / Year
Amazon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience owning and executing language data collection projects, including guidelines, labelset and annotation workflow development
  • Experience with language annotation and other forms of data markup
  • Experience in one or more scripting languages (e.g., Python, Ruby, Perl)
  • Master's or higher degree in a relevant field (Computational Linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing or AI data creation
  • Experience working with speech, text, and multimodal data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
  • Analyze and extract insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
  • Use modeling tools to bootstrap or test new AI functionalities
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models
What we offer
What we offer
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
  • Fulltime
Read More
Arrow Right

Language Engineer

The Amazon Artificial General Intelligence (AGI) Data Services organization is l...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
Not provided
Amazon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with language annotation and other forms of data markup
  • Experience in one or more scripting languages (e.g., Python, Ruby, Perl)
  • Experience in a fast paced, dynamic organization
  • Masters’s or higher degree in a relevant field (computational linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing
  • Experience working with speech and text language data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
Job Responsibility
Job Responsibility
  • Design data collection/creation tasks in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Analyze and extract language-related insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data authoring, using Python or another scripting language
  • Use modeling tools to bootstrap or test new functionalities
  • Collaborate with scientists and software engineers to evaluate performance of language models
  • Handle competing requests from a range of data customers
What we offer
What we offer
  • Health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • Paid time off
  • Parental leave
  • Sign-on payments
  • Restricted stock units (RSUs)
  • Fulltime
Read More
Arrow Right

Ai Research Engineer, Language

Meta is seeking talented engineers to join our teams in building cutting-edge pr...
Location
Location
United States , Redmond
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • 2+ years of programming experience in a relevant language OR a PhD + 9 months programming experience in a relevant language
  • Experience building maintainable and testable codebases, including API design and unit testing techniques
  • Experience effectively utilizing AI technologies and tools (e.g., large language models, agents, etc.) to enhance workflows
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams (product, design, operations, infrastructure) to build innovative AI-native application experiences
  • Build and integrate LLM / generative AI capabilities into product surfaces (mobile, web), including prompt engineering, structured prompting, and context management
  • Develop and maintain reusable software components for interfacing with back-end platforms, model serving/inference layers, and AI toolchains
  • Implement retrieval-augmented generation (RAG) patterns (e.g., embeddings + retrieval) and contribute to context-aware and personalized user experiences
  • Design/Contribute to agentic workflows and leverage AI tools and agents (including human-in-the-loop / expert-in-the-loop designs) to automate tasks and scale impact
  • Analyze, debug, and optimize code and systems for quality, efficiency, performance, reliability, and cost
  • Establish effective quality practices for AI features, including evaluation/QA for AI outputs, monitoring, and iterative improvement via feedback loops
  • Architect efficient and scalable systems that power complex applications and AI-enabled features, identify and resolve performance and scalability issues
  • Drive end-to-end execution of medium-to-large features with increasing independence, contribute to technical direction within the team
  • Establish ownership of components, features, or systems with comprehensive end-to-end understanding
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Language Engineer, Artificial General Intelligence - Data Services

The Amazon Artificial General Intelligence (AGI) Data Services organization is r...
Location
Location
Netherlands , Den Haag
Salary
Salary:
Not provided
Amazon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree in computer science, mathematics, statistics, machine learning or equivalent quantitative field
  • Experience with language annotation and other forms of data markup
  • Knowledge of one or more scripting languages (e.g., Python, Ruby, Perl)
  • 2+ years experience in computational linguistics or language data processing or AI data creation
  • Experience working with speech, text, and multimodal data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
  • Analyze and extract insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
  • Use modeling tools to bootstrap or test new AI functionalities
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models
Read More
Arrow Right

Research Engineer, Language

Reality Labs is seeking a Research Engineer to join our Large Language Model (LL...
Location
Location
United States
Salary
Salary:
154003.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Research experience in machine learning, deep learning, and/or natural language processing
  • Experience with developing machine learning models at scale from inception to business impact
  • Programming experience in Python and hands-on experience with frameworks such as PyTorch
  • Exposure to architectural patterns of large scale software applications
Job Responsibility
Job Responsibility
  • Design methods, tools, and infrastructure to push forward the state of the art in large language models
  • Define research goals informed by practical engineering concerns
  • Contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results
  • Adapt standard machine learning methods to best exploit modern parallel environments (e.g. distributed clusters, multicore SMP, and GPU)
  • Work with a large and globally distributed team
  • Contribute to publications and open-sourcing efforts
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Language Engineer, Artificial General Intelligence - Data Services

The Amazon Artificial General Intelligence (AGI) Data Services organization is r...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
75200.00 - 151400.00 USD / Year
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience owning and executing language data collection projects, including guidelines, labelset and annotation workflow development
  • Master's or higher degree in a relevant field (Computational Linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing or AI data creation
  • Experience with language data annotation systems and other forms of data markup
  • Proficient with scripting languages, such as Python
  • Experience working with speech, text, and multimodal data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
  • Analyze and extract insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
  • Use modeling tools to bootstrap or test new AI functionalities
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models
What we offer
What we offer
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
  • Fulltime
Read More
Arrow Right

Language Engineer, Artificial General Intelligence - Data Services

The Amazon Artificial General Intelligence (AGI) Data Services organization is l...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
65200.00 - 131100.00 USD / Year
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Masters’s or higher degree in a relevant field (computational linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing
  • Experience with language annotation and other forms of data markup
  • Experience with scripting languages, such as Python
  • Experience working with speech and text language data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design data collection/creation tasks in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Analyze and extract language-related insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data authoring, using Python or another scripting language
  • Use modeling tools to bootstrap or test new functionalities
  • Collaborate with scientists and software engineers to evaluate performance of language models
  • Handle competing requests from a range of data customers
What we offer
What we offer
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
Read More
Arrow Right

Language Engineer, Artificial General Intelligence - Data Services

The Amazon Artificial General Intelligence (AGI) Data Services organization is r...
Location
Location
United States , Sunnyvale; Boston; Bellevue
Salary
Salary:
75200.00 - 151400.00 USD / Year
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's or higher degree in a relevant field (Computational Linguistics or equivalent field with computational analysis)
  • 2+ years experience in computational linguistics or language data processing or AI data creation
  • Experience with language data annotation systems and other forms of data markup
  • Proficient with scripting languages, such as Python
  • Experience working with speech, text, and multimodal data in multiple languages
  • Excellent communication, strong organizational skills and very detailed oriented
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
Job Responsibility
Job Responsibility
  • Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
  • Analyze and extract insights from large amounts of data
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
  • Use modeling tools to bootstrap or test new AI functionalities
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models
What we offer
What we offer
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
Read More
Arrow Right