CrawlJobs Logo

Software Engineer, Inference - Multi Modal

openai.com Logo

OpenAI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

295000.00 - 555000.00 USD / Year

Job Description:

OpenAI’s Inference team powers the deployment of our most advanced models - including our GPT models, 4o Image Generation, and Whisper - across a variety of platforms. Our work ensures these models are available, performant, and scalable in production, and we partner closely with Research to bring the next generation of models into the world. We're a small, fast-moving team of engineers focused on delivering a world-class developer experience while pushing the boundaries of what AI can do. We’re expanding into multimodal inference, building the infrastructure needed to serve models that handle image, audio, and other non-text modalities. These workloads are inherently more heterogeneous and experimental, involving diverse model sizes and interactions, more complex input/output formats, and tighter coordination with product and research. We’re looking for a software engineer to help us serve OpenAI’s multimodal models at scale. You’ll be part of a small team responsible for building reliable, high-performance infrastructure for serving real-time audio, image, and other MM workloads in production. This work is inherently cross-functional: you’ll collaborate directly with researchers training these models and with product teams defining new modalities of interaction. You'll build and optimize the systems that let users generate speech, understand images, and interact with models in ways far beyond text.

Job Responsibility:

  • Design and implement inference infrastructure for large-scale multimodal models
  • Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs
  • Enable experimental research workflows to transition into reliable production services
  • Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities
  • Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers

Requirements:

  • Experience building and scaling inference systems for LLMs or multimodal models
  • Worked with GPU-based ML workloads and understand the performance dynamics of large models, especially with complex data like images or audio
  • Enjoy experimental, fast-evolving work and collaborating closely with research
  • Comfortable dealing with systems that span networking, distributed compute, and high-throughput data handling
  • Familiarity with inference tooling like vLLM, TensorRT-LLM, or custom model parallel systems
  • Own problems end-to-end and are excited to operate in ambiguous, fast-moving spaces

Nice to have:

  • Experience working with image generation or audio synthesis models in production
  • Exposure to distributed ML training or system-efficient model design
What we offer:
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Relocation support for eligible employees
  • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Engineer, Inference - Multi Modal

Member of Technical Staff, Multimodal Infrastructure

Microsoft AI is looking for a Member of Technical Staff, Multimodal Infrastructu...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience in multi-modal data processing: Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations
  • Experience with state-of-art model inference and serving frameworks
  • Experience with image/video/audio data processing
  • Experience with common data formats for efficient I/O
  • Experience in multi-modal pretraining and post-training: Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed
  • Knowledge of auto-regressive and diffusion transformer models
  • Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism
  • Proven experiences in at least one of the following areas: image/video generation and editing
  • efficient architectures (e.g., MoE, window attention)
Job Responsibility
Job Responsibility
  • Design, develop and maintain large-scale multimodal data processing pipelines
  • Design, develop and maintain large-scale multimodal model pretraining and post-training frameworks
  • Design, develop and maintain large-scale multimodal model inference and serving frameworks
  • Work with research scientists and product engineers to solve infra-related problems
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Wayve Foundation Model

This is a rare opportunity to join the small but high-leverage engineering team ...
Location
Location
Canada , Vancouver
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering skills with experience building and maintaining distributed systems, data pipelines, or backend platforms at scale
  • Experience developing infrastructure that supports machine learning workflows—such as training orchestration, evaluation tooling, or inference systems
  • Comfort working closely with research or ML teams to understand their iteration needs and build systems that accelerate them
  • Familiarity with technologies like Flyte, Ray, Spark, Airflow, or Kubernetes, and an understanding of how to use them to scale data and compute
  • Ownership mindset with the ability to identify bottlenecks, operate across team boundaries, and “get stuff done” in ambiguous, fast-moving environments
Job Responsibility
Job Responsibility
  • Design and scale infrastructure for data ingestion, filtering, and curation of multi-modal embodied data
  • Build robust, efficient training, evaluation, and inference pipelines to support foundation model development
  • Partner closely with scientists and MLEs to accelerate experimentation and unblock research
  • Improve ML systems performance, scalability, and automation across the stack
  • Act as a cross-functional force multiplier—connecting Science, Software, and Data teams through well-designed tooling and systems
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Wayve Foundation Model

This is a rare opportunity to join the small but high-leverage engineering team ...
Location
Location
Canada , Vancouver
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering skills with experience building and maintaining distributed systems, data pipelines, or backend platforms at scale
  • Experience developing infrastructure that supports machine learning workflows—such as training orchestration, evaluation tooling, or inference systems
  • Comfort working closely with research or ML teams to understand their iteration needs and build systems that accelerate them
  • Familiarity with technologies like Flyte, Ray, Spark, Airflow, or Kubernetes, and an understanding of how to use them to scale data and compute
  • Ownership mindset with the ability to identify bottlenecks, operate across team boundaries, and “get stuff done” in ambiguous, fast-moving environments
Job Responsibility
Job Responsibility
  • Design and scale infrastructure for data ingestion, filtering, and curation of multi-modal embodied data
  • Build robust, efficient training, evaluation, and inference pipelines to support foundation model development
  • Partner closely with scientists and MLEs to accelerate experimentation and unblock research
  • Improve ML systems performance, scalability, and automation across the stack
  • Act as a cross-functional force multiplier—connecting Science, Software, and Data teams through well-designed tooling and systems
  • Fulltime
Read More
Arrow Right

Engineering Director, AI Solutions and Automation (ASA)-AI Product Acceleration

We are seeking a highly accomplished Engineering Director with extensive technic...
Location
Location
United States , Bellevue, WA
Salary
Salary:
271000.00 - 347000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of experience growing and leading successful Engineering teams, with a proven ability to recruit, land, and grow both engineering technical managers and individual contributors
  • Extensive expertise (15+ years) in Machine Learning (ML), and Artificial Intelligence (AI), with a history of functioning as a technical leader or lead architect on production systems
  • Extensive experience building and deploying complex, large-scale, distributed AI/ML software systems from the ground up
  • Experience as a great collaborator, building models and processes for aligning work across large, multi-disciplinary teams (Engineering, Data Science, Product Management)
  • Hands-on technical experience in relevant ML/AI languages (e.g., Python, C++) and applying data-driven methodologies to define and manage large software projects
  • Demonstrated ability to drive technical strategy and execution in cutting-edge AI domains like multi-modal processing, model evaluation, or RL-based post-training
Job Responsibility
Job Responsibility
  • Lead and manage teams of AI applied researchers and engineers, providing extensive technical guidance, mentorship, and support to ensure the successful end-to-end delivery of high-quality, scalable AI/ML systems
  • Serve as the technical authority, driving the design, development, and deployment of complex AI solutions, including LLM post-training techniques (like Reinforcement Learning and Fine-Tuning), Multi-modal Content Understanding, and Agentic AI platforms
  • Define and lead the long-term technical strategy and roadmap for large, enterprise-wide AI efforts, ensuring alignment with the ASA mission to deliver cost-efficient and performant AI models
  • Foster an environment of innovation, rapid prototyping, and technical excellence, encouraging experimentation and continuous improvement in the pursuit of SoTA performance
  • Identify new, high-leverage opportunities for LLM-based automation across Meta's product portfolio and influence cross-functional partners for appropriate staffing and prioritization
  • Supervise the development of AI-centric platforms, such as the AI Evaluation and scalable inference and serving infrastructure for 1P, 2P, and 3P models
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Applied Scientist

We are reimagining Windows in the era of AI. As a Applied Scientist you would pl...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java or Python
  • OR equivalent experience
  • 4+ Overall experience End- end shipping of commercial software, with at least 3+ years of experience in AI/ML, predictive analytics or research, and exposure to generative AI/LLM/SLM algorithms
  • A Customer focused innovation mindset
  • Passionate about Craftmanship in engineering
  • Experience building AI/ML solutions is good to have
  • Aptitude to learn and adapt with intensity and agility
Job Responsibility
Job Responsibility
  • Design and implement and experiment end-to-end AI-powered user experiences
  • Build scalable fullstack solutions that integrate AI models (LLMs, vision, speech) via SDKs, APIs, and custom pipelines
  • Collaborate with other engineers to optimize model selection, inference performance, and user interaction loops
  • Partner with PMs, designers, and researchers to prototype and validate new interaction paradigms
  • Contribute to the architecture and infrastructure for AI-first features, ensuring reliability, privacy, and compliance
  • Drive engineering excellence through code reviews, testing, telemetry, and continuous improvement
  • Mentor junior engineers and contribute to a culture of innovation and inclusion
  • Be a Subject Matter Expert in a specific domain or tech
  • Be customer and telemetry focussed and reduce mean time to market and mean time to recover through Engineering Excellence
  • Research and implement state-of-the-art using foundation models, prompt engineering, RAG, graphs, multi-agent architectures, as well as classical machine learning techniques
  • Fulltime
Read More
Arrow Right

Dietary Assistant Manager

Rosewood Rehabilitation and Nursing Center is currently seeking a dedicated and ...
Location
Location
United States , Schuylkill Haven
Salary
Salary:
Not provided
rosewoodrehab.org Logo
Rosewood Rehabilitation & Nursing Center
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum Years of Experience: 2 years
  • Minimum Level of Education: Certificate
  • Previous experience in food service, healthcare dietary services, or kitchen supervision preferred
  • Knowledge of food safety and sanitation standards
  • Ability to work in a fast-paced environment
  • Strong communication and leadership skills
  • Ability to work well as part of a team
  • Reliable and dependable
Job Responsibility
Job Responsibility
  • Assist in supervising dietary staff and daily kitchen operations
  • Ensure meals are prepared and served according to resident dietary needs and physician orders
  • Maintain compliance with state, federal, and facility sanitation and food safety standards
  • Help coordinate meal preparation, service schedules, and staff assignments
  • Monitor food quality, portion control, and presentation
  • Assist with inventory, ordering supplies, and maintaining stock levels
  • Support training and guidance of dietary staff as needed
  • Ensure a clean, safe, and organized kitchen environment
What we offer
What we offer
  • Competitive pay
  • Supportive team environment
  • Opportunities for growth within the facility
  • Fulltime
Read More
Arrow Right

Branch Manager

The Branch Manager leads in the management and direction of the branch to ensure...
Location
Location
Canada , Nanaimo
Salary
Salary:
88000.00 - 90000.00 CAD / Year
https://www.randstad.com Logo
Randstad
Expiration Date
May 19, 2026
Flip Icon
Requirements
Requirements
  • Good judgment and strong decision-making skills
  • Excellent interpersonal and communication skills
  • Demonstrated problem solving and negotiation skills
  • A strong team player, experience with high performance teams a plus
  • Commitment to company values a must
  • Experience with pump integrity and testing
  • Advanced computer skills required. (Microsoft Word, Excel, Powerpoint, Outlook)
  • ERP/Power BI experience a plus
  • Ability to independently solve problems at both a strategic and functional level
  • Leadership abilities - demonstrated ability to lead people and get results through others
Job Responsibility
Job Responsibility
  • Manage daily operations of the branch to achieve budget goals
  • Manage and maintain full P&L for the branch level
  • Participate in budgetary and forecasting planning
  • Review daily and weekly reports as needed to maintain optimal business results
  • Develop and maintain relationships with customers
  • Oversee and maintain appropriate staffing levels of the branch, as well as initiating the onboarding process for any employees hired
  • Oversees branch employees training and coaching needs
  • Complete and return requisition requests for any vacant branch level positions
  • provide feedback to HR on all candidate resumes received
  • Conduct 60 and 90 day reviews with new hires
What we offer
What we offer
  • Vacation package
  • Pension
  • Benefits
  • Monday to Friday
  • Fulltime
Read More
Arrow Right

Lineman Foreman

Seasonal Contract position supporting the National Science Foundation managed Un...
Location
Location
Antarctica , McMurdo Station
Salary
Salary:
Not provided
amentum.com Logo
Amentum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • High School diploma or GED
  • Current journeyman lineman license, completion of an apprenticeship program, or proof of holding a journeyman lineman union card
  • Minimum six years of experience in all phases of the lineman trade including rough wiring, power distribution wiring, installation and maintenance of power distribution systems, installation of conduit, and wiring of electrical equipment
  • Must have been in a lead, Foreman, or supervisory position for a minimum of two of the six years of experience
  • Experience working with 4160 volt systems or higher
  • Thorough knowledge of the National Electrical Safety Code (NESC) and the principles of electricity
  • Understanding of National Electrical Code Grounding and Bonding requirements
  • Willingness and ability to deploy to Antarctica for extended periods
  • Successful completion of Medical and Dental examinations required by the NSF for deployment to Antarctica
  • Successful completion of drug screening and background check required by employer
Job Responsibility
Job Responsibility
  • Supervise the activities of Facilities Line Shop and Linemen
  • Supervise and perform hands-on installation of branch circuits, power distribution wiring, electrical equipment, fixtures, and conduit in compliance with the National Electrical Safety Code (NESC) and National Electric Code (NEC)
  • Supervise and perform hands-on installation and maintenance of overhead electrical lines in accordance with applicable codes, working with voltages up to 4160 volts
  • Supervise and perform hands-on diagnosis, repair, and scheduled Preventative Maintenance on existing systems, equipment, fixtures, and infrastructure
  • Follow ASC Electrical Safety Program and promote electrical safety throughout the United States Antarctic Program
  • Timely response to emergency service calls during and outside normal working hours
  • Maintain effective communication between departments and represent department as needed at operational meetings
  • Assist with ensuring accurate material and labor charge code use, and accurate timecard submittal
  • Interact with the National Science Foundation, military and other agency officials, both over the telephone and in person
  • Accurate and thorough documentation of hours worked, tasking performed, materials used, and processes involved
  • Fulltime
Read More
Arrow Right