CrawlJobs Logo

Natural language annotator

oneforma.com Logo

One Forma

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Natural Language (NL) Annotators will analyze and annotate user utterances by converting them into structured representations of intent. These annotations enable AI systems to accurately understand and respond to user requests, ultimately improving the system’s performance and reliability.

Job Responsibility:

Analyze and annotate user utterances by converting them into structured representations of intent

Requirements:

  • Must be a native speaker of target language
  • Commitment to complete the onboarding and training process
  • Availability to contribute 30–50 hours per month
  • Consistent delivery of work that meets predefined quality standards, metrics, and turnaround times
  • Ability to identify, analyze, and escalate anomalies, trends, or recurring issues
  • Proactive and clear communication with the Project Manager
  • Strict adherence to all project guidelines, specifications, and procedures
  • Access to a macOS system
  • Academic background in Linguistics, Philology, Translation, or related field
  • At least 1 year of experience working on annotation or data labeling projects
  • Strong technical curiosity and experience working across different technical tools or environments
  • Proven ability to identify, troubleshoot, and resolve issues of varying complexity

Additional Information:

Job Posted:
February 16, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Natural language annotator

Mint

Participants will be requested to review and annotate bilingual text across the ...
Location
Location
Salary
Salary:
Not provided
oneforma.com Logo
One Forma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Being a native speaker/highly proficient in both source and target languages
Job Responsibility
Job Responsibility
  • Review and annotate bilingual text across the language pair assigned, evaluating both source and target text quality
  • Evaluate the source text itself for naturalness and language correctness
  • Annotations will be done following an annotation schema provided by our client
  • Tasks will be done on our online platform
  • Review AI-generated translations for accuracy, fluency, and idiomatic usage
  • Ensure translations follow the grammar and syntax rules of the target language
Read More
Arrow Right

Lead Applied Scientist Security Models Training Team

The Security Models Training team is expanding to drive the development of a new...
Location
Location
Israel , Tel Aviv, Herzliya
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • M.Sc. / Ph.D. in Computer Science, Information Systems, Electrical or Computer Engineering or Data Science (Ph.D. strongly preferred)
  • Candidates with M.Sc. / Ph.D. in related fields with proven industry experience or a strong publication record in the areas of LLM, Information Retrieval, Machine Learning, Natural Language Processing, Time Series Forecasting and Deep Learning are considered as well
  • Proven hands-on experience of at least 8 years (including post-grad work) in building and deploying Machine Learning products
  • Key areas of expertise include Natural Language Processing and Large Language Models, along with an understanding of concepts such as Privacy and Responsible AI
  • Candidates are expected to demonstrate a strong history of successfully translating applied research into production-ready solutions, along with a proven track record of delivering projects within large-scale production environments
  • Demonstrated ability to set long‑term technical strategy, align multiple teams, and serve as a technical decision‑maker for high‑risk, high‑impact investments
  • Proven expertise in the LLM and/or time-series forecasting domain, demonstrating comprehensive knowledge of relevant concepts in the domain
  • Ideal applicants should be proficient in areas such as LLM’s pre and post training, including CPT, SFT and RL, LLM benchmarking, agentic flows, and model alignment
  • Hands-on experience in building neural model architectures at the 100M+ scale and the proficiency to adapt them at all abstraction levels down the individual block (e.g. changing the innerworkings of an attention block, introducing new blocks, or changing the routings)
  • Demonstrated proficiency in problem-solving and data analysis, with substantial expertise in evaluating the performance of large language models (LLMs) and/or time-series forecasting models, developing benchmarks tailored to practical scenarios
Job Responsibility
Job Responsibility
  • Technical Leadership & Ownership: set technical direction for major security domain initiatives and align roadmaps across multiple teams
  • lead security model programs spanning pre‑training, task tuning, reinforcement learning, and evaluation
  • translate cutting‑edge research into production‑ready capabilities
  • This role influences portfolio‑level technical tradeoffs, investment prioritization, and long‑term architecture decisions for security models
  • Advanced Model Design – Building and customizing deep learning model architectures (e.g., modifying transformer blocks, attention/memory modules, etc.) at the SLM/LLM scale
  • making principled architectural tradeoffs to improve reliability, robustness, and security‑specific behavior
  • Advanced Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and other modalities, including time-series
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks
  • define objective evaluation frameworks and quality gates
  • run ablation studies to measure impact and optimize data and training effectiveness to support confident product decisions
  • Fulltime
Read More
Arrow Right

Data Entry Clerk

AI is the most transformational technology of our time, capable of tackling some...
Location
Location
Netherlands , Den Haag
Salary
Salary:
Not provided
Amazon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in natural language data labeling, data annotation, linguistic annotation or other forms of data markup
  • Speak, write, and read fluently in German
Job Responsibility
Job Responsibility
  • Maintain and follow strict confidentiality as customer privacy is our most important tenet
  • Work with a range of different types of data including, but not limited to: text, speech, audio, image, and video
  • Deliver high-quality labelled data, using guidelines provided to meet our KPIs and using in-house tools and software, as part of Amazon's commitment to developing and deploying AI responsibly
  • Demonstrate proficiency in generating high quality human insight data across a range of modalities, inclusive of text, image video and audio
  • Capable of making sound judgments and logical decisions when faced with ambiguous or incomplete information while performing tasks
  • Eye for detail and ability to pivot from one category of requirement to another instantaneously
  • Demonstrate support on daily operational deliverables for multiple task types assigned to you and the team
  • Analyze root causes, identify error patterns, and propose solutions to enhance the quality of labeling tasks and their outputs
  • Responsible for identifying day-to-day process and operational issues in Standard Operating Procedure, tools and suggest changes to unblock operations
  • Demonstrate ownership in floor support to clarify internal queries during execution on need basis
Read More
Arrow Right

Linguist

Join our team as a passionate Lingüist and help shape compelling language soluti...
Location
Location
India , Noida
Salary
Salary:
Not provided
aqusag.com Logo
AquSag Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Demonstrated passion for linguistics, language structure, and communication
  • Exceptional written and verbal communication skills in both English and target language(s)
  • Educational background in Linguistics, Applied Linguistics, or a related field
  • Attention to detail with a commitment to linguistic accuracy
  • Proven ability to work independently and manage priorities in a remote setting
  • Strong teamwork skills and willingness to collaborate closely with peers and project leads
  • Open mindset, eager to learn, and comfortable with feedback
Job Responsibility
Job Responsibility
  • Analyze, review, and edit written materials for linguistic accuracy, style, and clarity
  • Contribute to the development and maintenance of linguistic resources such as glossaries, style guides, and corpora
  • Support language data annotation and quality assurance processes
  • Collaborate with cross-functional teams to ensure the linguistic integrity of diverse projects
  • Participate in linguistic research to identify trends and inform project strategy
  • Provide feedback to improve natural language processing models and language tools
  • Communicate findings and recommendations clearly to both technical and non-technical stakeholders
  • Fulltime
Read More
Arrow Right

AI/ML & Data Engineer

Location
Location
India , Chennai
Salary
Salary:
Not provided
congruentsoft.com Logo
Congruent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4~6 years in ML/NLP, preferably in document-heavy domains (finance, legal, policy)
  • Languages: Python, SQL
  • AI/ML/NLP: Hugging face transformers, OpenAI API, Spacy, Scikit-Learn, LangChain, RAG, LLM prompt-tuning, LLM fine-tuning
  • Vector Search: Pinecone, Weaviate, FAISS
  • Data Engineering: Airflow, Kafka, OCR (Tesseract, pdfminer)
  • MLOps: MLflow, Docker
Job Responsibility
Job Responsibility
  • Data Ingestion and Preprocessing: Ability to build and maintain data pipelines to ingest unstructured data from PDFs, gazettes, HTML circulars etc. and process data extraction, parsing, and normalization
  • NLP & LLM Modeling: Ability to fine-tune or prompt-tune LLMs for summarization, classification, and change detection in regulations. Ability to develop embeddings for semantic similarity.
  • Knowledge Graph Engineering: Ability to design entity relationships (regulation, control, policy) and implement retrieval over Neo4j or similar graph DBs.
  • Information Retrieval (RAG): Ability to build RAG pipelines for natural language querying of regulations.
  • Annotation and Validation: Ability to annotate training data by collaborating with SMEs and validate model outputs
  • MLOps: Ability to build CI/CD for model retraining, versioning, and evaluation (precision, recall, BLEU, etc.)
  • API and Integration: Ability to expose ML models as REST APIs (FastAPI) for integration with product frontend.
Read More
Arrow Right

Principal/Senior Applied Scientist Security Models Training Team - Next-Gen frontier research

The Security Models Training team is expanding to drive the development of a new...
Location
Location
Israel , Tel Aviv, Herzliya
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • M.Sc. / Ph.D. in Computer Science, Information Systems, Electrical or Computer Engineering or Data Science (Ph.D. strongly preferred)
  • Candidates with M.Sc. / Ph.D. in related fields with proven industry experience or a strong publication record in the areas of LLM, Information Retrieval, Machine Learning, Natural Language Processing, Time Series Forecasting and Deep Learning are considered as well
  • Proven hands-on experience of at least 5 years (including post-grad work) in building and deploying Machine Learning products
  • Key areas of expertise include Natural Language Processing and Large Language Models, along with an understanding of concepts such as Privacy and Responsible AI
  • Candidates are expected to demonstrate a strong history of successfully translating applied research into production-ready solutions, along with a proven track record of delivering projects within large-scale production environments
  • Proven expertise in the LLM and/or time-series forecasting domain, demonstrating comprehensive knowledge of relevant concepts in the domain
  • Ideal applicants should be proficient in areas such as LLM’s pre and post training, including CPT, SFT and RL, LLM benchmarking, agentic flows, and model alignment
  • Hands-on experience in building neural model architectures at the 100M+ scale and the proficiency to adapt them at all abstraction levels down the individual block (e.g. changing the innerworkings of an attention block, introducing new blocks, or changing the routings)
  • Demonstrated proficiency in problem-solving and data analysis, with substantial expertise in evaluating the performance of large language models (LLMs) and/or time-series forecasting models, developing benchmarks tailored to practical scenarios
Job Responsibility
Job Responsibility
  • Technical Leadership & Ownership: set technical direction for major security domain initiatives
  • lead security model programs spanning pre‑training, task tuning, reinforcement learning, and evaluation
  • translate cutting‑edge research into production‑ready capabilities
  • Advanced Model Design – Building and customizing deep learning model architectures (e.g., modifying transformer blocks, attention/memory modules, etc.) at the SLM/LLM scale
  • making principled architectural tradeoffs to improve reliability, robustness, and security‑specific behavior
  • Advanced Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and other modalities, including time-series
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks
  • define objective evaluation frameworks and quality gates
  • run ablation studies to measure impact and optimize data and training effectiveness to support confident product decisions
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets, with attention to privacy, governance, and long‑term reuse across security scenarios
  • Fulltime
Read More
Arrow Right

Japanese audio collection projects

The Voice Command Audio Collection Project focuses on collecting natural speech ...
Location
Location
Japan , Tokyo
Salary
Salary:
Not provided
sigma.ai Logo
Sigma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Fluent in the language you are applying for
  • Knowledge of English is optional
  • Mobile phone with Android OS
  • Minimum 4GB RAM
  • Microphone and webcam
  • Operating system: Windows 10 or higher, macOS 13 Ventura or higher
  • All OS updates installed and supported by the vendor
  • Stable internet connection
  • Headphones
  • Secure internet location, protected by a strong password
Job Responsibility
Job Responsibility
  • Produce a total of 500 short utterances, completed across two recording sessions
  • Deliver recordings in a natural, conversational tone, with expressive and realistic intonation to reflect everyday voice interactions
Read More
Arrow Right

Italian audio collection projects

The Voice Command Audio Collection Project focuses on collecting natural speech ...
Location
Location
Italy
Salary
Salary:
Not provided
sigma.ai Logo
Sigma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Fluent in the language you are applying for
  • Knowledge of English is optional
  • Mobile phone with Android OS (Tablets and iOS devices are not supported)
  • Minimum 4GB RAM
  • Microphone and webcam
  • Operating system: Windows 10 or higher or macOS 13 Ventura or higher
  • All OS updates installed and supported by the vendor
  • Stable internet connection
  • Headphones
  • Secure internet location, protected by a strong password
Job Responsibility
Job Responsibility
  • Produce a total of 500 short utterances, completed across two recording sessions
  • Deliver recordings in a natural, conversational tone, with expressive and realistic intonation to reflect everyday voice interactions
Read More
Arrow Right