CrawlJobs Logo

Ai engineer (speech)

https://www.randstad.com Logo

Randstad

Location Icon

Location:
Japan , Tokyo

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

8000000.00 - 12000000.00 JPY / Year

Job Responsibility:

  • Conduct research and development on speech language models, including spoken dialogue, speech recognition, synthesis, and translation
  • Generate and validate large-scale speech datasets, and explore effective preprocessing methods

Requirements:

  • Strong initiative and execution skills to drive large-scale training
  • Experience in developing models in one or more of the following areas: speech recognition, speech synthesis, speech translation, or spoken dialogue systems
  • Proficiency in Python and hands-on experience with deep learning frameworks such as PyTorch
What we offer:
  • 健康保険
  • 厚生年金保険
  • 雇用保険
  • 賞与

Additional Information:

Job Posted:
February 16, 2026

Expiration:
January 11, 2028

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Ai engineer (speech)

Multimodal Speech Engineer, AI Companion

As a Multimodal Speech Engineer on the AI Companion Team, you will lead the deve...
Location
Location
United States , Palo Alto
Salary
Salary:
150000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in speech and audio modeling domains
  • Experience with multi-modal conversational models (language, audio, vision)
  • Ability to take open-ended problems in conversation modeling, develop creative solutions, build proof-of-concepts, and scale them to production
Job Responsibility
Job Responsibility
  • Design and implement data pipelines for large-scale speech interactions using internal and external datasets
  • Train speech-to-speech models that incorporate awareness of NEO’s physical form
  • Create dynamic responses for a wide range of user queries
  • Synchronize NEO’s speech with physical gestures and body language
  • Customize NEO’s speech behavior to reflect different personalities
What we offer
What we offer
  • Equity
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

Staff Backend Engineer, Speech AI

Our intelligent runtime must seamlessly connect to foundational models to power ...
Location
Location
United States , Mountain View
Salary
Salary:
200000.00 - 300000.00 USD / Year
inworld.ai Logo
Inworld AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A BA/BS degree in Computer Science or a related technical field, or equivalent practical experience
  • 5+ years of professional experience in software development, with a proven track record of shipping high-quality, user-facing products
  • Strong product sense and an ability to think critically about user experience and business impact
  • Demonstrated experience in building and scaling production-grade backend APIs and distributed systems
  • Strong proficiency in Python and professional experience with one or more of the following: Java/Kotlin, or Go
  • Hands-on experience with containerization (Docker) and deploying services on orchestration platforms like Kubernetes
  • A solid foundation in data structures, algorithms, and system design
Job Responsibility
Job Responsibility
  • Own features end-to-end, from collaborating on the initial concept with product managers to shipping and monitoring the final product
  • Translate product requirements and user needs into robust, scalable, and maintainable backend services and APIs
  • Design, build, and launch user-facing APIs and backend systems in Python, Java/Kotlin, and Go that deliver seamless voice experiences
  • Partner closely with Product Managers and ML engineers to define scope, identify technical trade-offs, and drive the product roadmap forward
  • Write high-quality, production-grade code that powers real-time audio processing, model inference, and complex data pipelines
  • Champion engineering and product excellence, with a focus on delivering tangible value to our users quickly and iteratively
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • relocation assistance
  • Fulltime
Read More
Arrow Right

Multimodal Speech Engineer

The AI Companion team creates the speech interface for NEO, as well as the physi...
Location
Location
United States , Palo Alto
Salary
Salary:
150000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in speech and audio modeling domains
  • Experience in multi-modal conversational models (language, audio, vision) is a strong plus
  • Ability to take open-ended problems in conversation models, come up with creative solutions, implement proof-of-concepts, and translate those to production.
Job Responsibility
Job Responsibility
  • Design and implement data pipelines for large scale speech interactions from NEO data and external datasets
  • Train speech2speech models to be aware of NEO’s embodiment
  • Design appropriate responses for a variety of user queries
  • Synchronize speech with body language
  • Customize NEO with different personalities
What we offer
What we offer
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

We are seeking an experienced Senior Python Software Engineer (Senior AI Develop...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in Computer Science, Data Science, Artificial Intelligence, or a related field, or equivalent practical experience
  • Several years of experience in AI and Machine Learning development, ideally within Customer Care solutions
  • Strong proficiency in Python and NLP frameworks
  • Hands-on experience with Azure AI services (e.g., Azure Machine Learning, Cognitive Services, Bot Services)
  • Solid understanding of cloud architectures and microservices on Azure
  • Experience with CI/CD pipelines and MLOps
  • Analytical mindset and strong problem-solving capabilities
  • Polish & English speaker
Job Responsibility
Job Responsibility
  • Design, develop, and integrate AI/ML solutions, with a particular focus on Generative AI (GenAI), LLMs, and multi-modal (chat, voice) interfaces
  • Architect and deliver customer-facing AI agents that provide real-time, intelligent automation for support, marketing, or transactional use cases
  • Build and maintain multi-model pipelines for inference, fine-tuning, chunking, and embedding-based retrieval (RAG) systems
  • Deploy, monitor, and optimize AI models in production-grade environments using Kubernetes and Azure-native services
  • Integrate GenAI agents with cross-company APIs, backend services, and partner systems through MCP for dynamic tool use and data enrichment
  • Collaborate closely with DevOps engineers to implement scalable CI/CD pipelines, infrastructure-as-code, and secure AI workload automation
  • Evaluate and integrate open-source and proprietary LLMs, embeddings, and vector databases
  • Optimize prompt engineering strategies and implement orchestration tools (e.g., LangChain, MCP) to enable complex task execution
  • Build robust model evaluation frameworks, A/B testing environments, and experiment tracking for iterative development
  • Design privacy-first AI workflows that comply with GDPR, anonymization, and auditability (e.g., PII scrubbing, user consent)
What we offer
What we offer
  • Flexible working hours
  • Hybrid work model, allowing employees to divide their time between home and modern offices in key Polish cities
  • A cafeteria system that allows employees to personalize benefits by choosing from a variety of options
  • Generous referral bonuses, offering up to PLN6,000 for referring specialists
  • Additional revenue sharing opportunities for initiating partnerships with new clients
  • Ongoing guidance from a dedicated Team Manager for each employee
  • Tailored technical mentoring from an assigned technical leader, depending on individual expertise and project needs
  • Dedicated team-building budget for online and on-site team events
  • Opportunities to participate in charitable initiatives and local sports programs
  • A supportive and inclusive work culture with an emphasis on diversity and mutual respect
  • Fulltime
Read More
Arrow Right

Full-Stack Engineer, AI Companion

The AI Companion team creates the speech interface to NEO, as well as the physic...
Location
Location
United States , Palo Alto
Salary
Salary:
150000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience with C++
  • 4+ years of experience with Python
  • 4+ years of experience with Bazel
  • 4+ years of experience with PyTorch
  • Experience with real‑time or streaming model architectures or systems
  • Product obsession with quality, performance, and design taste
  • Ability to take research ideas into production systems that work reliably
  • Good product taste as pertaining to human‑robot interaction, non‑verbal communication, and speech UX
Job Responsibility
Job Responsibility
  • Design the software architecture for real-time multimodal I/O
  • Design application flows like scheduling chores and triggering autonomous tasks from the voice interface
  • Optimize the companion stack for enabling seamless interactions with NEO
  • Make the Companion scalable and reliable while serving models from remote machines
What we offer
What we offer
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

Body Language Engineer, AI Companion

As a Body Language Engineer on the AI Companion team at 1X, you will build and r...
Location
Location
United States , Palo Alto
Salary
Salary:
150000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience training multi‑modal models combining language, audio, and vision
  • Experience in robotics — such as path planning for collision‑free trajectories and kinematic retargeting for humanoid robots
  • Strong ability to tackle open‑ended problems in conversation and behavior modeling: come up with creative solutions, build proofs-of-concept, and drive them through to production
Job Responsibility
Job Responsibility
  • Scale up data collection efforts for nonverbal communication behaviors
  • Extend real‑time speech‑to‑speech models to also include real‑time emotive body language
  • Design and implement video understanding models to interpret user body language and intent
  • Devise methods to blend functional visuomotor tasks (e.g. object manipulation) with communicative behaviors (gesticulation, expressive motion)
What we offer
What we offer
  • Equity
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

Research Engineer

We are looking for a Research Engineer to join the research team at ElevenLabs. ...
Location
Location
Poland
Salary
Salary:
Not provided
elevenlabs.io Logo
ElevenLabs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years industry experience as a Machine Learning Engineer, with a key emphasis on constructing data pipelines, as well as developing and implementing machine learning models
  • Demonstrating the capacity to autonomously evaluate novel concepts or enhance current machine learning projects, with the potential outcome of contributing to published works
  • Extensive background in conducting exploratory research to enhance the excellence of gathered data, particularly within the realm of audio and text-to-speech domains
Job Responsibility
Job Responsibility
  • Creating and upholding a reliable and expandable data management system specialized for text-to-speech projects. This includes establishing guidelines for versioning and ensuring data quality
  • Establishing a streamlined process for autonomously training, assessing, and launching text-to-speech models. This encompasses implementing procedures for dynamic learning, as well as routines for fine-tuning and refreshing validation data
  • Investigating cutting-edge approaches and strategies in machine learning, deep learning, and algorithms pertaining to text-to-speech technology
What we offer
What we offer
  • Innovative culture
  • Growth paths
  • Learning & development: ElevenLabs proactively supports professional development through an annual discretionary stipend
  • Social travel: We also provide an annual discretionary stipend to meet up with colleagues each year, however you choose
  • Annual company offsite
  • Co-working: If you’re not located near one of our main hubs, we offer a monthly co-working stipend
  • Fulltime
Read More
Arrow Right

ML Engineer

The IT company Andersen invites an experienced ML Engineer for a large-scale pro...
Location
Location
Salary
Salary:
Not provided
andersenlab.com Logo
Andersen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience as a ML Engineer for 3+ years
  • Strong proficiency in Python, with deep knowledge of software development principles, architecture patterns, and ML model integration
  • Hands-on experience with TTS systems (e.g., Tacotron, FastSpeech, VITS) and an understanding of SST pipelines
  • Familiarity with real-time AI systems, including LLM integration and latency-sensitive applications
  • Experience tuning and maintaining ML models for performance, scalability, and quality in production
  • Level of English – from Intermediate+
Job Responsibility
Job Responsibility
  • Designing, integrating, and optimizing Text-to-Speech (TTS) systems within real-time conversational AI pipelines
  • Fine-tuning models based on user feedback, improving clarity, naturalness, and emotional expression in voice output
  • Contributing to customer-specific deployments with high adaptability and quick turnaround requirements
  • Collaborating with ML, product, and engineering teams to ensure seamless voice experiences across our platform
What we offer
What we offer
  • Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others
  • The opportunity to change the project and/or develop expertise in an interesting business domain
  • Guarantee of professional, financial, and career growth
  • The opportunity to earn up to an additional 1,000 EUR per month, depending on the level of expertise, which will be included in the annual bonus, by participating in the company's activities
  • Access to the corporate training portal
  • Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies)
  • Certification compensation (AWS, PMP, etc)
  • Referral program
  • English courses
  • Private health insurance and compensation for sports activities
Read More
Arrow Right