CrawlJobs Logo

Machine Learning Platform Engineer

together.ai Logo

Together AI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

160000.00 - 250000.00 USD / Year

Job Description:

Our team focuses on enabling custom models and dedicated inference on Together. We are responsible for building a container platform, optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling. We often focus on video or audio generation across the stack: CUDA kernels, pytorch optimization, inference engines, container orchestration, queueing theory, etc. An ideal candidate will be great at profiling/optimization but know the word kubernetes, or be intimately familiar with multi-cluster scheduling and have some sense of ML bottlenecks.

Job Responsibility:

  • New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, model optimization, APIs for managing deployments, inference worker SDKs, and CLI tools
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems
  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance

Requirements:

  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems
  • Experience running serverless inference platforms, doing model bring-up on short notice, being on call, or running a cloud provider is a very big plus
  • Good taste and ability to thoughtfully discuss how what you’ve built has failed over time
  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
  • Excellent understanding of low level operating systems concepts including concurrency, networking and storage, performance and scale
  • Expert-level programmer in one or more of Python, Golang, Rust, C++, or Haskell
  • Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform
  • Experience with Kubernetes internals or other container orchestration systems
  • Sound judgement for when to use and when to not use LLMs for code
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
  • Writing-heavy roles or companies are a plus
What we offer:
  • competitive compensation
  • startup equity
  • health insurance
  • other competitive benefits

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Machine Learning Platform Engineer

Senior Machine Learning Engineer

We are looking for a Senior ML engineer to join the Teamwork Graph team and a gr...
Location
Location
United States , Mountain View; Seattle; San Francisco
Salary
Salary:
165500.00 - 265800.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience building and scaling business applications and systems using large amounts of data
  • Built generalizable platforms with a keen eye on empowering applications in product
  • Experience building and scaling business applications and systems using large amounts of data
  • Agile development mindset, appreciating the benefit of iteration and improvement
  • Experience working with LLMs and RAG a bonus
Job Responsibility
Job Responsibility
  • Process and structure large amounts of graph data to power applications across Atlassian
  • Build great APIs
  • Creative use of generative AI and ML to process, structure, and reason over large amounts of data to power intelligent products
  • Mentor and coach your team members on best practices, code quality, design patterns, testing, debugging, and documentation
  • Communicate and explain data science concepts to diverse audiences, craft a compelling story
  • Communicate effectively with internal and external partners, present technical concepts and results clearly and concisely, and solicit feedback and input from various stakeholders
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Principle Machine Learning Engineer

We are looking for a Principle ML engineer to join the Teamwork Graph team and a...
Location
Location
United States , Mountain View; Seattle; San Francisco
Salary
Salary:
190300.00 - 305600.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience building and scaling business applications and systems using large amounts of data
  • Built generalizable platforms with a keen eye on empowering applications in product
  • Agile development mindset, appreciating the benefit of iteration and improvement
  • Experience working with LLMs and RAG a bonus
Job Responsibility
Job Responsibility
  • Process and structure large amounts of graph data to power applications across Atlassian
  • Build great APIs
  • Creative use of generative AI and ML to process, structure, and reason over large amounts of data to power intelligent products
  • Mentor and coach your team members on best practices, code quality, design patterns, testing, debugging, and documentation
  • Communicate effectively with internal and external partners, present technical concepts and results clearly and concisely, and solicit feedback and input from various stakeholders
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

We are looking for a Senior ML engineer to join the Teamwork Graph team and a gr...
Location
Location
United States , Mountain View; Seattle; San Francisco
Salary
Salary:
165500.00 - 265800.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience building and scaling business applications and systems using large amounts of data
  • Built generalizable platforms with a keen eye on empowering applications in product
  • Experience building and scaling business applications and systems using large amounts of data
  • Agile development mindset, appreciating the benefit of iteration and improvement
  • Experience working with LLMs and RAG a bonus
Job Responsibility
Job Responsibility
  • Process and structure large amounts of graph data to power applications across Atlassian
  • Build great APIs
  • Creative use of generative AI and ML to process, structure, and reason over large amounts of data to power intelligent products
  • Mentor and coach your team members on best practices, code quality, design patterns, testing, debugging, and documentation.
  • Communicate and explain data science concepts to diverse audiences, craft a compelling story
  • Communicate effectively with internal and external partners, present technical concepts and results clearly and concisely, and solicit feedback and input from various stakeholders.
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Engineer

Join PagerDuty as a Staff Machine Learning Engineer to tackle complex problems, ...
Location
Location
Canada , Toronto
Salary
Salary:
156000.00 - 232000.00 CAD / Year
https://www.pagerduty.com Logo
PagerDuty
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience building, designing, and evolving data architecture for large-scale systems
  • Excellent communication skills
  • Experience working with Product teams, ensuring and driving a timely delivery
  • Have a deep understanding of the trade-offs to be considered when designing and delivering machine learning solutions to production
  • Experience leading cross-team architecture discussions, building technical prototypes, and driving the adoption of best practices across diverse teams
  • Demonstrated experience with data engineering processes, working with unstructured data and cloud-based data infrastructures
  • Passionate about ML engineering and interested in driving discussions with stakeholders and executives
Job Responsibility
Job Responsibility
  • Build and improve the capabilities of the data platform that enable and accelerate the production of ML/AI-based solutions
  • Drive and define standards for AI/ML across the organization
  • Provide guidance, technical leadership, and mentoring to other members of the team
  • Mentor junior members and participate in scaling up the existing team
  • Proactively recommend improvements and new approaches addressing potential systemic pain points and technical debt
  • Anticipate technical demands on the data platform based on the organization’s roadmap and systematically drive the evolution of the architecture toward those ends
  • Develop a long-term plan for ML/AI investments
What we offer
What we offer
  • Competitive salary
  • Comprehensive benefits package from day one
  • Flexible work arrangements
  • Company equity
  • ESPP (Employee Stock Purchase Program)
  • Retirement or pension plan
  • Generous paid vacation time
  • Paid holidays and sick leave
  • Dutonian Wellness Days & HibernationDuty - companywide paid days off in addition to PTO
  • Paid parental leave: 22 weeks for pregnant parent, 12 weeks for non-pregnant parent
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineering Manager

We are looking for a senior ML engineering manager to join the Teamwork Graph te...
Location
Location
United States , Mountain View; San Francisco; Seattle
Salary
Salary:
190300.00 - 305700.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience managing engineering teams running core AI/ML services at scale
  • Experience building and scaling business applications and systems using large amounts of data
  • Ability to communicate and explain complex technical concepts to diverse audiences, craft a compelling story
  • Build generalizable platforms with a keen eye on empowering applications in product
  • Solid understanding of foundational ML concepts and Graphs
  • Experience working with LLM based applications or other ML powered products
Job Responsibility
Job Responsibility
  • Process and structure large amounts of graph data to power applications across Atlassian
  • Mentor and coach your team members on best practices, code quality, design patterns, testing, debugging, and documentation
  • Hire, onboard, and retain top talent for your team and foster a culture of innovation, collaboration, and excellence
  • Communicate effectively with internal and external partners, present technical concepts and results clearly and concisely, and solicit feedback and input from various stakeholders
  • Creative use of generative AI and ML to process, structure, and reason over large amounts of data to power intelligent products
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Data / Machine Learning Engineer

Inetum is in the midst of a strategic project to develop our competences in the ...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 2 years' experience as a Machine Learning Engineer or related position
  • Proficiency in Python and experience with ML frameworks (e.g. TensorFlow, PyTorch)
  • Knowledge of mathematics and statistics, including linear algebra, differential calculus and probability
  • Experience of working with large datasets and their processing and analysis
  • Familiarity with cloud tools and platforms, such as Azure Machine Learning or Google Vertex AI
  • Ability to work in a team and communicate with technical and non-technical stakeholders
Job Responsibility
Job Responsibility
  • Design, development and implementation of machine learning models and AI systems in cloud environments (Azure, Google Cloud)
  • Creating and maintaining data pipelines: from data extraction and cleaning to feature engineering and preparing data for modelling
  • Selection of appropriate ML algorithms and their implementation and optimisation in the context of specific business problems
  • Collaborating with Data Science, Data Engineering and Software Development teams to integrate models into production systems
  • Monitoring model performance, conducting A/B testing and updating models in response to changing data and requirements
  • Keeping abreast of the latest ML trends and technologies and proposing innovative solutions
What we offer
What we offer
  • Flexible working hours
  • Hybrid work model
  • A cafeteria system that allows employees to personalize benefits by choosing from a variety of options
  • Generous referral bonuses, offering up to PLN6,000 for referring specialists
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

As Babylist’s Senior Machine Learning Engineer, you’ll be the founding expert dr...
Location
Location
United States; Canada
Salary
Salary:
189900.00 - 237400.00 USD / Year
babylist.com Logo
Babylist
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience delivering end to end solutions that drive business growth
  • Deep expertise in the Python ML ecosystem (pandas, sklearn, xgboost, PyTorch)
  • Strong expertise creating custom embeddings from raw data sources
  • Experience going beyond off-the-shelf image/text embeddings to develop domain-specific representations for ML models
  • Proven track record building and delivering production-grade ML solutions, especially recommender systems or personalization features
  • Experience with the entire ML lifecycle, including workflow orchestration (Airflow) and model monitoring
  • Strong communication skills with technical and non-technical stakeholders
  • Background in consumer-facing products preferred
  • Expertise with e-commerce, dating apps, or complex user journey platforms is highly relevant
  • Demonstrate autonomy: able to define problem space, architect solutions from scratch (zero to one), and operate with ownership
Job Responsibility
Job Responsibility
  • Pioneer ML at Babylist: shape the roadmap, practices, and culture for machine learning and personalization at scale
  • Own high-impact work end-to-end: build and launch high-leverage personalization features from the ground up
  • Lead both technically and strategically: be the technical pioneer and strategic leader for ML at Babylist
  • Collaborate across a strong data organization: partner with data scientists, data engineers, and analytics engineers
What we offer
What we offer
  • Competitive salary with equity and bonus opportunities
  • Company-paid medical, dental, and vision insurance
  • Retirement savings plan with company matching and flexible spending accounts
  • Generous paid parental leave and PTO
  • Remote work stipend to set up your office
  • Perks for physical, mental, and emotional health, parenting, childcare, and financial planning
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer - Computer Vision

We are seeking a highly skilled and motivated Machine Learning Engineer speciali...
Location
Location
United States , Arlington
Salary
Salary:
Not provided
caseguard.com Logo
CaseGuard
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master’s degree in Computer Science, Artificial Intelligence, Data Science, or a related field
  • Experience in deep learning models, their training, and hyperparameter tuning using libraries such as TensorFlow, PyTorch, and Transformers or other Huggingface tools
  • Experience with data manipulation tools such as Pandas, NumPy, and SQL
  • Strong programming skills in Python and C++
  • Experience in MLOps principles and model deployment and instrumentation on cloud platforms such as AWS, Azure, or Google Cloud for model deployment and knowledge with efficient serving tools such as ONNX, triton, and vllm
  • Proficiency in working with image and video data, including preprocessing and augmentation techniques
  • Strong understanding of machine learning algorithms, including supervised and unsupervised learning and deep learning
  • Strong communication skills and the ability to work collaboratively in a team environment
Job Responsibility
Job Responsibility
  • Design, develop, and deploy computer vision models for tasks such as object detection, object tracking, video segmentation, and facial recognition
  • Optimize and fine-tune deep learning algorithms for real-time performance
  • Work closely with the software engineers and product teams to identify opportunities for leveraging data
  • Collect, clean, and preprocess large datasets to prepare for model training and evaluation
  • Evaluate and optimize machine learning models for accuracy, performance, and scalability
  • Deploy models into production environments and monitor their performance to ensure reliability
  • Stay up-to-date with the latest advancements in computer vision and artificial intelligence
  • Collaborate with cross-functional teams to integrate machine learning solutions into business processes
  • Document processes, models, and implementations to ensure reproducibility and scalability
What we offer
What we offer
  • Competitive salary
  • Comprehensive health and wellness benefits
  • Professional development opportunities and continuous learning programs
  • Collaborative and inclusive work environment
Read More
Arrow Right