CrawlJobs Logo

Software Engineer: ML Optimization

generalistai.com Logo

Generalist AI

Location Icon

Location:
United States , San Mateo

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

200000.00 - 350000.00 USD / Year

Job Description:

We internally call this team MBMB (More Big More Better). You will own optimizations on both the training and on-robot inference stacks. We are still in a regime of step-function, not incremental, gains.

Job Responsibility:

  • Making GPUs go brrrrr
  • Implementing ML, hardware, and software changes that lead to step-function gains
  • Optimizing both the inference and training stacks

Requirements:

  • Proficient and stay current with the latest ML techniques for training and inference optimizations in transformer and diffusion based architectures
  • Will chase ML optimizations anywhere: From the CUDA kernels, to ML architecture, to frontend or backend network bottlenecks, CPU bottlenecks, NVLink and comms, to torch, numpy, and Python inefficiencies
What we offer:

Offers Equity

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Engineer: ML Optimization

Senior Software Engineer - Network Enablement (Applied ML)

We build simple yet innovative consumer products and developer APIs that shape h...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering skills including systems design, APIs, and building reliable backend services (Go or Python preferred)
  • Production experience with batch and streaming data pipelines and orchestration tools such as Airflow or Spark
  • Experience building or operating real-time scoring and online feature-serving systems, including feature stores and low-latency model inference
  • Experience integrating model outputs into product flows (APIs, feature flags) and measuring impact through experiments and product metrics
  • Experience with model lifecycle and operations: model registries, CI/CD for models, reproducible training, offline & online parity, monitoring and incident response
Job Responsibility
Job Responsibility
  • Embed model inference into Network Enablement product flows and decision logic (APIs, feature flags, backend flows)
  • Define and instrument product + ML success metrics (fraud reduction, retention lift, false positives, downstream impact)
  • Design and run experiments and rollout plans (backtesting, shadow scoring, A/B tests, feature-flagged releases) to validate product hypotheses
  • Build and operate offline training pipelines and production batch scoring for bank intelligence products
  • Ship and maintain online feature serving and low-latency model inference endpoints for real-time partner/bank scoring
  • Implement model CI/CD, model/version registry, and safe rollout/rollback strategies
  • Monitor model/data health: drift/regression detection, model-quality dashboards, alerts, and SLOs targeted to partner product needs
  • Ensure offline and online parity, data lineage, and automated validation / data contracts to reduce regressions
  • Optimize inference performance and cost for real-time scoring (batching, caching, runtime selection)
  • Ensure fairness, explainability and PII-aware handling for partner-facing ML features
What we offer
What we offer
  • medical
  • dental
  • vision
  • 401(k)
  • equity
  • commission
  • Fulltime
Read More
Arrow Right

Senior Software Engineer – ML Model Compliance & Automation

We are seeking a highly skilled and motivated Senior Software Engineer to lead t...
Location
Location
India , Jaipur
Salary
Salary:
Not provided
infoobjects.com Logo
InfoObjects
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience Required: 3 - 7 yrs
  • GoLang (preferred)
  • Python (preferred)
  • Bash
  • MLOps Tools: KitOps, MLModelCI, MLflow, ONNX, TensorFlow, PyTorch, Docker
  • SBOM & Security: Syft, Grype, Trivy, CycloneDX, SPDX
  • CI/CD: GitHub Actions, GitLab CI, Jenkins, ArgoCD
  • Infra: Kubernetes, Docker, Helm, Terraform
  • Cloud: AWS, GCP, Azure (EKS/GKE/ECS preferred)
  • Version Control: Git, GitOps
Job Responsibility
Job Responsibility
  • Model Packaging & Artifact Management: Design and implement workflows for packaging ML models using KitOps, ONNX, MLflow, or TensorFlow SavedModel
  • Manage model artifact versioning, registries, and reproducibility
  • Ensure artifact integrity, consistency, and traceability across CI/CD pipelines
  • Model Profiling & Optimization: Automate model profiling (latency, size, ops) using MLModelCI, TorchServe, or ONNX Runtime
  • Apply quantization, pruning, and format conversions (e.g., FP32→INT8) for optimization
  • Embed profiling and optimization checks into CI/CD pipelines to assess deployment readiness
  • Compliance & SBOM Generation: Develop pipelines to generate and validate SBOMs for ML models
  • Implement compliance checks for licensing, vulnerabilities, and security using CycloneDX, SPDX, Syft, or Trivy
  • Validate schema, dependencies, and runtime environments for production readiness
  • Cloud Integration & Deployment: Automate model registration, endpoint creation, and monitoring setup in AWS/GCP/Azure
  • Fulltime
Read More
Arrow Right

Software Engineer (Data Engineering)

We are seeking a Software Engineer (Data Engineering) who can seamlessly integra...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
nstarxinc.com Logo
NStarX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years in Data Engineering and AI/ML roles
  • Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field
  • Python, SQL, Bash, PySpark, Spark SQL, boto3, pandas
  • Apache Spark on EMR (driver/executor model, sizing, dynamic allocation)
  • Amazon S3 (Parquet) with lifecycle management to Glacier
  • AWS Glue Catalog and Crawlers
  • AWS Step Functions, AWS Lambda, Amazon EventBridge
  • CloudWatch Logs and Metrics, Kinesis Data Firehose (or Kafka/MSK)
  • Amazon Redshift and Redshift Spectrum
  • IAM (least privilege), Secrets Manager, SSM
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable ETL and ELT pipelines for large-scale data processing
  • Develop and optimize data architectures supporting analytics and ML workflows
  • Ensure data integrity, security, and compliance with organizational and industry standards
  • Collaborate with DevOps teams to deploy and monitor data pipelines in production environments
  • Build predictive and prescriptive models leveraging AI and ML techniques
  • Develop and deploy machine learning and deep learning models using TensorFlow, PyTorch, or Scikit-learn
  • Perform feature engineering, statistical analysis, and data preprocessing
  • Continuously monitor and optimize models for accuracy and scalability
  • Integrate AI-driven insights into business processes and strategies
  • Serve as the technical liaison between NStarX and client teams
What we offer
What we offer
  • Competitive salary and performance-based incentives
  • Opportunity to work on cutting-edge AI and ML projects
  • Exposure to global clients and international project delivery
  • Continuous learning and professional development opportunities
  • Competitive base + commission
  • Fast growth into leadership roles
  • Fulltime
Read More
Arrow Right

ML Engineer

The IT company Andersen invites an experienced ML Engineer for a large-scale pro...
Location
Location
Salary
Salary:
Not provided
andersenlab.com Logo
Andersen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience as a ML Engineer for 3+ years
  • Strong proficiency in Python, with deep knowledge of software development principles, architecture patterns, and ML model integration
  • Hands-on experience with TTS systems (e.g., Tacotron, FastSpeech, VITS) and an understanding of SST pipelines
  • Familiarity with real-time AI systems, including LLM integration and latency-sensitive applications
  • Experience tuning and maintaining ML models for performance, scalability, and quality in production
  • Level of English – from Intermediate+
Job Responsibility
Job Responsibility
  • Designing, integrating, and optimizing Text-to-Speech (TTS) systems within real-time conversational AI pipelines
  • Fine-tuning models based on user feedback, improving clarity, naturalness, and emotional expression in voice output
  • Contributing to customer-specific deployments with high adaptability and quick turnaround requirements
  • Collaborating with ML, product, and engineering teams to ensure seamless voice experiences across our platform
What we offer
What we offer
  • Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others
  • The opportunity to change the project and/or develop expertise in an interesting business domain
  • Guarantee of professional, financial, and career growth
  • The opportunity to earn up to an additional 1,000 EUR per month, depending on the level of expertise, which will be included in the annual bonus, by participating in the company's activities
  • Access to the corporate training portal
  • Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies)
  • Certification compensation (AWS, PMP, etc)
  • Referral program
  • English courses
  • Private health insurance and compensation for sports activities
Read More
Arrow Right

Software Engineer

Coralogix AI is hiring a Software Engineer to help revolutionize the world of ob...
Location
Location
Israel , Ramat Gan
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 4 years of experience in software development within a cloud environment
  • Strong proficiency in designing and developing scalable, distributed systems
  • Experience with Kubernetes (K8s), cloud infrastructure (AWS, GCP, or Azure), and cloud-native development practices
  • Solid understanding of performance optimization, troubleshooting, and functional/non-functional testing
  • Proficiency in Python, Go, or other backend programming languages
  • Experience with CI/CD pipelines, infrastructure as code (Terraform / Pulumi, Helm, ArgoCD) – advantage
  • Hands-on experience with AI/ML development, including monitoring ML models, evaluating performance
Job Responsibility
Job Responsibility
  • End-to-end development and ownership of products and features, from design to scalable and predictable production behavior
  • Solve diverse and complex problems in a high-scale, cloud-native environment
  • Collaborate with other engineers and product managers to improve product functionality, scalability, and performance
  • Design, develop, and maintain robust, secure, and efficient software solutions
  • Ensure high system reliability by implementing best practices in monitoring, observability, and automation
  • Review code, architecture, and data to identify and troubleshoot technical and performance issues
  • Work with AI/ML teams to integrate AI capabilities, including model monitoring, evaluation, and fine-tuning
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Backend

The Staff Engineer will work closely with AI/ML engineers, product managers, app...
Location
Location
United States , NYC
Salary
Salary:
160000.00 - 190000.00 USD / Year
conductor.com Logo
Conductor
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Completed studies in Computer Science, Mathematics, engineering or a related field or equivalent professional experience
  • 8+ years of experience in software development, with experience in product-driven companies
  • Strong expertise in system design, distributed computing, and scalable architecture patterns for handling large datasets and high-throughput applications
  • Proficiency in multiple programming languages with strong Python coding skills. Experience with Java is highly valued
  • Strong database experience including both SQL and NoSQL systems, with knowledge of data modeling and optimization techniques
  • Experience with AI/ML technologies including LLMs, vector databases (e.g., Milvus), embeddings, and ML frameworks
  • Knowledge of MLOps practices, model deployment, and AI system integration in production environments
  • Experience working across the full software development lifecycle including CI/CD, monitoring, testing, and production deployment
  • Proven track record of technical leadership, mentoring engineers, and driving engineering excellence within teams
  • Up-to-date with rapidly-evolving technologies and demonstrated ability to evaluate and adopt new tools and frameworks
Job Responsibility
Job Responsibility
  • Lead the technical architecture, design, and implementation of large-scale distributed systems and data platforms to support customer needs and business growth
  • Oversee the planning, execution, and successful delivery of complex engineering projects, ensuring adherence to engineering best practices and quality standards
  • Design and build scalable, high-performance backend systems and APIs that handle millions of requests and large datasets efficiently
  • Architect robust data processing pipelines and ETL workflows using modern cloud technologies and distributed computing frameworks
  • Drive technical decision-making across the engineering organization, evaluating trade-offs and establishing engineering standards and practices
  • Lead cross-functional collaboration with product, AI/ML engineering, data engineering, and infrastructure teams to deliver comprehensive solutions
  • Build and maintain CI/CD pipelines, monitoring systems, and deployment automation to ensure reliable software delivery
  • Implement AI/ML capabilities including LLM integration, vector databases, and intelligent content processing workflows
  • Mentor senior and junior engineers, fostering technical excellence and knowledge sharing within the engineering organization
What we offer
What we offer
  • 100% covered employee medical plan
  • a dental & vision plans
  • 401(k) with employer contribution
  • an unlimited vacation policy
  • 10 sick days
  • short-term disability
  • long-term disability
  • generous paid parental leave
  • employee assistance program
  • flexible savings accounts
  • Fulltime
Read More
Arrow Right

Middle/Senior AI, ML Engineer

Join us at Provectus to be a part of a team that is dedicated to building cuttin...
Location
Location
Salary
Salary:
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Comfortable with standard ML algorithms and underlying math
  • Strong hands-on experience with LLMs in production, RAG architecture, and agentic systems
  • AWS Bedrock experience strongly preferred
  • Practical experience with solving classification and regression tasks in general, feature engineering
  • Practical experience with ML models in production
  • Practical experience with one or more use cases from the following: NLP, LLMs, and Recommendation engines
  • Solid software engineering skills (i.e., ability to produce well-structured modules, not only notebook scripts)
  • Python expertise, Docker
  • English level - strong Intermediate
  • Excellent communication and problem-solving skills
Job Responsibility
Job Responsibility
  • Create ML models from scratch or improve existing models
  • Collaborate with the engineering team, data scientists, and product managers on production models
  • Develop experimentation roadmap
  • Set up a reproducible experimentation environment and maintain experimentation pipelines
  • Monitor and maintain ML models in production to ensure optimal performance
  • Write clear and comprehensive documentation for ML models, processes, and pipelines
  • Stay updated with the latest developments in ML and AI and propose innovative solutions
Read More
Arrow Right

Middle/senior AI, ML Engineer

Join us at Provectus to be a part of a team that is dedicated to building cuttin...
Location
Location
Salary
Salary:
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Comfortable with standard ML algorithms and underlying math
  • Strong hands-on experience with LLMs in production, RAG architecture, and agentic systems
  • AWS Bedrock experience strongly preferred
  • Practical experience with solving classification and regression tasks in general, feature engineering
  • Practical experience with ML models in production
  • Practical experience with one or more use cases from the following: NLP, LLMs, and Recommendation engines
  • Solid software engineering skills (i.e., ability to produce well-structured modules, not only notebook scripts)
  • Python expertise, Docker
  • English level - strong Intermediate
  • Excellent communication and problem-solving skills
Job Responsibility
Job Responsibility
  • Create ML models from scratch or improve existing models
  • Collaborate with the engineering team, data scientists, and product managers on production models
  • Develop experimentation roadmap
  • Set up a reproducible experimentation environment and maintain experimentation pipelines
  • Monitor and maintain ML models in production to ensure optimal performance
  • Write clear and comprehensive documentation for ML models, processes, and pipelines
  • Stay updated with the latest developments in ML and AI and propose innovative solutions
Read More
Arrow Right