Ai engineer (speech) Job at Randstad (Tokyo)

Multimodal Speech Engineer, AI Companion

As a Multimodal Speech Engineer on the AI Companion Team, you will lead the deve...

Location

United States , Palo Alto

Salary:

150000.00 - 250000.00 USD / Year

1X Technologies

Expiration Date

Until further notice

Requirements

3+ years of experience in speech and audio modeling domains
Experience with multi-modal conversational models (language, audio, vision)
Ability to take open-ended problems in conversation modeling, develop creative solutions, build proof-of-concepts, and scale them to production

Job Responsibility

Design and implement data pipelines for large-scale speech interactions using internal and external datasets
Train speech-to-speech models that incorporate awareness of NEO’s physical form
Create dynamic responses for a wide range of user queries
Synchronize NEO’s speech with physical gestures and body language
Customize NEO’s speech behavior to reflect different personalities

What we offer

Equity
Health, dental, and vision insurance
401(k) with company match
Paid time off and holidays

Fulltime

Staff Backend Engineer, Speech AI

Our intelligent runtime must seamlessly connect to foundational models to power ...

Location

United States , Mountain View

Salary:

200000.00 - 300000.00 USD / Year

Inworld AI

Expiration Date

Until further notice

Requirements

A BA/BS degree in Computer Science or a related technical field, or equivalent practical experience
5+ years of professional experience in software development, with a proven track record of shipping high-quality, user-facing products
Strong product sense and an ability to think critically about user experience and business impact
Demonstrated experience in building and scaling production-grade backend APIs and distributed systems
Strong proficiency in Python and professional experience with one or more of the following: Java/Kotlin, or Go
Hands-on experience with containerization (Docker) and deploying services on orchestration platforms like Kubernetes
A solid foundation in data structures, algorithms, and system design

Job Responsibility

Own features end-to-end, from collaborating on the initial concept with product managers to shipping and monitoring the final product
Translate product requirements and user needs into robust, scalable, and maintainable backend services and APIs
Design, build, and launch user-facing APIs and backend systems in Python, Java/Kotlin, and Go that deliver seamless voice experiences
Partner closely with Product Managers and ML engineers to define scope, identify technical trade-offs, and drive the product roadmap forward
Write high-quality, production-grade code that powers real-time audio processing, model inference, and complex data pipelines
Champion engineering and product excellence, with a focus on delivering tangible value to our users quickly and iteratively

What we offer

bonus
equity
benefits
relocation assistance

Fulltime

Multimodal Speech Engineer

The AI Companion team creates the speech interface for NEO, as well as the physi...

Location

United States , Palo Alto

Salary:

150000.00 - 250000.00 USD / Year

1X Technologies

Expiration Date

Until further notice

Requirements

3+ years of experience in speech and audio modeling domains
Experience in multi-modal conversational models (language, audio, vision) is a strong plus
Ability to take open-ended problems in conversation models, come up with creative solutions, implement proof-of-concepts, and translate those to production.

Job Responsibility

Design and implement data pipelines for large scale speech interactions from NEO data and external datasets
Train speech2speech models to be aware of NEO’s embodiment
Design appropriate responses for a variety of user queries
Synchronize speech with body language
Customize NEO with different personalities

What we offer

Health, dental, and vision insurance
401(k) with company match
Paid time off and holidays

Fulltime

Senior AI Engineer

We are seeking an experienced Senior Python Software Engineer (Senior AI Develop...

Location

Poland , Warsaw

Salary:

Not provided

Inetum

Expiration Date

Until further notice

Requirements

Degree in Computer Science, Data Science, Artificial Intelligence, or a related field, or equivalent practical experience
Several years of experience in AI and Machine Learning development, ideally within Customer Care solutions
Strong proficiency in Python and NLP frameworks
Hands-on experience with Azure AI services (e.g., Azure Machine Learning, Cognitive Services, Bot Services)
Solid understanding of cloud architectures and microservices on Azure
Experience with CI/CD pipelines and MLOps
Analytical mindset and strong problem-solving capabilities
Polish & English speaker

Job Responsibility

Design, develop, and integrate AI/ML solutions, with a particular focus on Generative AI (GenAI), LLMs, and multi-modal (chat, voice) interfaces
Architect and deliver customer-facing AI agents that provide real-time, intelligent automation for support, marketing, or transactional use cases
Build and maintain multi-model pipelines for inference, fine-tuning, chunking, and embedding-based retrieval (RAG) systems
Deploy, monitor, and optimize AI models in production-grade environments using Kubernetes and Azure-native services
Integrate GenAI agents with cross-company APIs, backend services, and partner systems through MCP for dynamic tool use and data enrichment
Collaborate closely with DevOps engineers to implement scalable CI/CD pipelines, infrastructure-as-code, and secure AI workload automation
Evaluate and integrate open-source and proprietary LLMs, embeddings, and vector databases
Optimize prompt engineering strategies and implement orchestration tools (e.g., LangChain, MCP) to enable complex task execution
Build robust model evaluation frameworks, A/B testing environments, and experiment tracking for iterative development
Design privacy-first AI workflows that comply with GDPR, anonymization, and auditability (e.g., PII scrubbing, user consent)

What we offer

Flexible working hours
Hybrid work model, allowing employees to divide their time between home and modern offices in key Polish cities
A cafeteria system that allows employees to personalize benefits by choosing from a variety of options
Generous referral bonuses, offering up to PLN6,000 for referring specialists
Additional revenue sharing opportunities for initiating partnerships with new clients
Ongoing guidance from a dedicated Team Manager for each employee
Tailored technical mentoring from an assigned technical leader, depending on individual expertise and project needs
Dedicated team-building budget for online and on-site team events
Opportunities to participate in charitable initiatives and local sports programs
A supportive and inclusive work culture with an emphasis on diversity and mutual respect

Fulltime

Full-Stack Engineer, AI Companion

The AI Companion team creates the speech interface to NEO, as well as the physic...

Location

United States , Palo Alto

Salary:

150000.00 - 250000.00 USD / Year

1X Technologies

Expiration Date

Until further notice

Requirements

4+ years of experience with C++
4+ years of experience with Python
4+ years of experience with Bazel
4+ years of experience with PyTorch
Experience with real‑time or streaming model architectures or systems
Product obsession with quality, performance, and design taste
Ability to take research ideas into production systems that work reliably
Good product taste as pertaining to human‑robot interaction, non‑verbal communication, and speech UX

Job Responsibility

Design the software architecture for real-time multimodal I/O
Design application flows like scheduling chores and triggering autonomous tasks from the voice interface
Optimize the companion stack for enabling seamless interactions with NEO
Make the Companion scalable and reliable while serving models from remote machines

What we offer

Health, dental, and vision insurance
401(k) with company match
Paid time off and holidays

Fulltime

Body Language Engineer, AI Companion

As a Body Language Engineer on the AI Companion team at 1X, you will build and r...

Location

United States , Palo Alto

Salary:

150000.00 - 250000.00 USD / Year

1X Technologies

Expiration Date

Until further notice

Requirements

3+ years of experience training multi‑modal models combining language, audio, and vision
Experience in robotics — such as path planning for collision‑free trajectories and kinematic retargeting for humanoid robots
Strong ability to tackle open‑ended problems in conversation and behavior modeling: come up with creative solutions, build proofs-of-concept, and drive them through to production

Job Responsibility

Scale up data collection efforts for nonverbal communication behaviors
Extend real‑time speech‑to‑speech models to also include real‑time emotive body language
Design and implement video understanding models to interpret user body language and intent
Devise methods to blend functional visuomotor tasks (e.g. object manipulation) with communicative behaviors (gesticulation, expressive motion)

What we offer

Equity
Health, dental, and vision insurance
401(k) with company match
Paid time off and holidays

Fulltime

Research Engineer

We are looking for a Research Engineer to join the research team at ElevenLabs. ...

Location

Poland

Salary:

Not provided

ElevenLabs

Expiration Date

Until further notice

Requirements

3+ years industry experience as a Machine Learning Engineer, with a key emphasis on constructing data pipelines, as well as developing and implementing machine learning models
Demonstrating the capacity to autonomously evaluate novel concepts or enhance current machine learning projects, with the potential outcome of contributing to published works
Extensive background in conducting exploratory research to enhance the excellence of gathered data, particularly within the realm of audio and text-to-speech domains

Job Responsibility

Creating and upholding a reliable and expandable data management system specialized for text-to-speech projects. This includes establishing guidelines for versioning and ensuring data quality
Establishing a streamlined process for autonomously training, assessing, and launching text-to-speech models. This encompasses implementing procedures for dynamic learning, as well as routines for fine-tuning and refreshing validation data
Investigating cutting-edge approaches and strategies in machine learning, deep learning, and algorithms pertaining to text-to-speech technology

What we offer

Innovative culture
Growth paths
Learning & development: ElevenLabs proactively supports professional development through an annual discretionary stipend
Social travel: We also provide an annual discretionary stipend to meet up with colleagues each year, however you choose
Annual company offsite
Co-working: If you’re not located near one of our main hubs, we offer a monthly co-working stipend

Fulltime

ML Engineer

The IT company Andersen invites an experienced ML Engineer for a large-scale pro...

Location

Salary:

Not provided

Andersen

Expiration Date

Until further notice

Requirements

Experience as a ML Engineer for 3+ years
Strong proficiency in Python, with deep knowledge of software development principles, architecture patterns, and ML model integration
Hands-on experience with TTS systems (e.g., Tacotron, FastSpeech, VITS) and an understanding of SST pipelines
Familiarity with real-time AI systems, including LLM integration and latency-sensitive applications
Experience tuning and maintaining ML models for performance, scalability, and quality in production
Level of English – from Intermediate+

Job Responsibility

Designing, integrating, and optimizing Text-to-Speech (TTS) systems within real-time conversational AI pipelines
Fine-tuning models based on user feedback, improving clarity, naturalness, and emotional expression in voice output
Contributing to customer-specific deployments with high adaptability and quick turnaround requirements
Collaborating with ML, product, and engineering teams to ensure seamless voice experiences across our platform

What we offer

Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others
The opportunity to change the project and/or develop expertise in an interesting business domain
Guarantee of professional, financial, and career growth
The opportunity to earn up to an additional 1,000 EUR per month, depending on the level of expertise, which will be included in the annual bonus, by participating in the company's activities
Access to the corporate training portal
Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies)
Certification compensation (AWS, PMP, etc)
Referral program
English courses
Private health insurance and compensation for sports activities

Select Country

Ai engineer (speech)

Randstad

Location:
Japan , Tokyo

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Responsibility:

Requirements:

Additional Information:

Job Posted:
February 16, 2026

Expiration:
January 11, 2028

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Ai engineer (speech)

Multimodal Speech Engineer, AI Companion

Staff Backend Engineer, Speech AI

Multimodal Speech Engineer

Senior AI Engineer

Full-Stack Engineer, AI Companion

Body Language Engineer, AI Companion

Research Engineer

ML Engineer

Our AI answers in your language

Ai engineer (speech)

Randstad

Location:Japan , Tokyo

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Responsibility:

Requirements:

Additional Information:

Job Posted:February 16, 2026

Expiration:January 11, 2028

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Ai engineer (speech)

Multimodal Speech Engineer, AI Companion

Staff Backend Engineer, Speech AI

Multimodal Speech Engineer

Senior AI Engineer

Full-Stack Engineer, AI Companion

Body Language Engineer, AI Companion

Research Engineer

ML Engineer

Location:
Japan , Tokyo

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
February 16, 2026

Expiration:
January 11, 2028