CrawlJobs Logo

LLM - Senior Staff Engineer - Python + Machine Learning

aqusag.com Logo

AquSag Technologies

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

40.00 - 60.00 USD / Hour

Job Description:

AquSag is seeking a hands-on Machine Learning Senior Staff Engineer to lead cross-functional teams building and deploying cutting-edge LLM and ML systems. In this role, you’ll drive the full lifecycle of AI development, from research and large-scale model training to production deployment, while mentoring top engineers and collaborating closely with research and infrastructure leaders. You’ll combine technical depth in deep learning and MLOps with leadership in execution and strategy, ensuring that our AI initiatives deliver reliable, high-performance systems that translate research breakthroughs into measurable business impact. This position is ideal for leaders who are still comfortable coding, optimizing large-scale training pipelines, building collab notebooks that break the models and navigating the intersection of research, engineering, and product delivery.

Job Responsibility:

  • Lead and mentor a cross-functional team of ML engineers, data scientists, and MLOps professionals
  • Oversee the full lifecycle of LLM and ML projects — from data collection to training, evaluation, and deployment
  • Collaborate with Research, Product, and Infrastructure teams to define goals, milestones, and success metrics
  • Provide technical direction on large-scale model training, fine-tuning, and distributed systems design
  • Implement best practices in MLOps, model governance, experiment tracking, and CI/CD for ML
  • Manage compute resources, budgets, and ensure compliance with data security and responsible AI standards
  • Communicate progress, risks, and results to stakeholders and executives effectively

Requirements:

  • 9+ yrs of strong background in Machine Learning, NLP, and modern deep learning architectures (Transformers, LLMs)
  • Hands-on experience with frameworks such as PyTorch, TensorFlow, Hugging Face, or DeepSpeed
  • Hands-on experience in Docker for Production deployment
  • Proven experience managing teams delivering ML/LLM models in production environments
  • Knowledge of distributed training, GPU/TPU optimization, and cloud platforms (AWS, GCP, Azure)
  • Familiarity with MLOps tools like MLflow, Kubeflow, or Vertex AI for scalable ML pipelines
  • Excellent leadership, communication, and cross-functional collaboration skills
  • Bachelor’s or Master’s in Computer Science, Engineering, or related field (PhD preferred)
  • Overlap of 6 hours with PST time zone is mandatory
  • Commitments Required: 8 hours per day with overlap of 6 hours with PST

Nice to have:

  • Experience building Agentic applications
  • Experience training or fine-tuning foundation models
  • Contributions to open-source ML or LLM frameworks
  • Understanding of Responsible AI, bias mitigation, and model interpretability

Additional Information:

Job Posted:
December 19, 2025

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for LLM - Senior Staff Engineer - Python + Machine Learning

Senior Staff Machine Learning Engineer

Help design our AI platform and develop our next generation of machine learning ...
Location
Location
United States , San Francisco
Salary
Salary:
216500.00 - 324500.00 USD / Year
gofundme.com Logo
GoFundMe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 9+ years of hands-on experience in machine learning engineering, AI development, software engineering, or related fields
  • Experience emphasizing secure, large-scale, distributed system design, AI/ML pipeline development, and implementation
  • Extensive experience designing, developing, and operating scalable backend systems
  • Experience applying software engineering best practices such as domain-driven design, event-driven architectures, and microservices
  • Deep expertise in agentic workflows, AI evaluation solutions, prompt management, and secure AI development and testing practices
  • Strong knowledge of relational and document-based databases, data storage paradigms, and efficient RESTful API design
  • Experience establishing robust CI/CD pipelines, automated testing (unit and integration), and deployment practices
  • Strong leadership skills, including effective planning and management of complex projects, mentoring of team members, and fostering a collaborative, high-performing engineering culture
  • Excellent communicator, able to articulate complex technical concepts clearly to both technical and non-technical stakeholders
  • Bachelor's degree in Computer Science, Software Engineering, or a related technical field (preferred)
Job Responsibility
Job Responsibility
  • Design and implement AI platforms to enable scalable and secure access to LLMs from multiple model providers for diverse use cases
  • Design and implement agentic workflows, agentic tool ecosystems, and LLM prompt management solutions
  • Design, build, and optimize scalable model training, fine tuning, and inference pipelines, ensuring robust integration with production systems
  • Influence technical strategy and approach to developing embedding stores, vector databases, and other reusable assets
  • Lead initiatives to streamline ML and AI workflows, improve operational efficiency, and establish standardized procedures to achieve consistent, high-quality results across our AI systems
  • Design and develop backend services and RESTful APIs using Python and FastAPI, integrating seamlessly with ML pipelines and services
  • Take operational responsibility for team-owned services, including performance monitoring, optimization, troubleshooting, and participation in an on-call rotation
  • Collaborate with both technical and non-technical colleagues, including data and applied scientists, software engineers, product managers, and business stakeholders, to deliver reliable and scalable ML-driven products
  • Coach and mentor fellow ML engineers, promoting a culture of collaboration, continuous improvement, and engineering excellence within the team
  • Employ a diverse set of tools and platforms including Python, AWS, Databricks, Docker, Kubernetes, FastAPI, Terraform, Snowflake, Coralogix, and GitHub to build, deploy, and maintain scalable, highly available machine learning infrastructure
What we offer
What we offer
  • Competitive pay
  • Comprehensive healthcare benefits
  • Financial assistance for things like hybrid work, family planning
  • Generous parental leave
  • Flexible time-off policies
  • Mental health and wellness resources
  • Learning, development, and recognition programs
  • Fulltime
Read More
Arrow Right
New

Senior Staff Engineer, Applied AI

GEICO is seeking a Senior Staff Engineer, Applied AI to provide technical archit...
Location
Location
United States , Chevy Chase, MD; Palo Alto, CA
Salary
Salary:
130000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8 or more years of professional software engineering or applied machine learning experience
  • 2 or more years working with Generative AI or LLM-based systems in production
  • Proven track record of architecting and delivering complex AI/ML capabilities that span multiple teams and have measurable business impact
  • Deep hands-on expertise with Python and modern AI frameworks including LangChain, LangGraph, LangSmith, LlamaIndex, Hugging Face, OpenAI/Anthropic APIs, and emerging agentic frameworks
  • Demonstrated experience building and deploying production RAG (Retrieval-Augmented Generation) systems including document ingestion, chunking strategies, vector search, and context retrieval
  • Demonstrated experience designing and operating production AI systems including multi-agent architectures, intelligent automation, and workflow orchestration
  • Strong understanding of agent architectures, workflow orchestration, retrieval-augmented generation (RAG), vector databases, knowledge graphs, and semantic reasoning
  • Familiarity with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) for building interoperable AI systems
  • Experience ensuring platform scalability, cross-domain coherence, and alignment with AI platform capabilities and strategy
  • Strong expertise in distributed systems, microservices architecture, service design, performance optimization, and reliability engineering
Job Responsibility
Job Responsibility
  • Specify architectures and system decompositions for AI/ML capabilities that involve significant integrations and cross-team collaboration across multiple product areas
  • Provide technical architecture and leadership for medium to large, complex, cross-functional AI initiatives with visibility at the tech VP level
  • Architect and lead implementation of advanced Generative AI solutions including agent-based systems, intelligent automation, document intelligence, and decision support systems that span multiple business domains
  • Design and implement sophisticated agentic workflows that orchestrate multiple AI agents, tools, APIs, reasoning steps, and business logic to automate complex enterprise processes at scale
  • Question status quo with an eye for simpler designs and more secure approaches, influencing tech VPs to set direction for multiple teams
  • Build systems and platforms that meet the highest standards for scalability, resilience, performance, availability, security, and compliance
  • Identify and scope opportunities for automating business processes using AI across multiple product areas and business domains
  • Advance the state-of-the-art in applied AI by integrating knowledge graphs, vector reasoning, retrieval architectures, and multi-agent systems to solve complex business problems
  • Drive innovation by exploring new models, frameworks, reasoning techniques, and AI architectures and applying them strategically to high-impact business challenges
  • Run rigorous experimentation programs including hypothesis definition, A/B testing, measurement frameworks, and iterative improvement across production AI systems
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right
New

Senior Staff Machine Learning Engineer – Agent Engineering

GEICO is seeking an experienced Sr Staff Machine Learning Engineer – Agent Engin...
Location
Location
United States , New York City; Palo Alto; Chevy Chase
Salary
Salary:
130000.00 - 300000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of professional software development experience with at least two general-purpose programming languages such as Java, C++, Python, TypeScript, etc.
  • 7+ years of experience architecting, building & deploying end-to-end AI solutions utilizing open-source/cloud-agnostic components such as search engine (e.g. elastic search, Qdrant), data warehouse (e.g. snowflake), streaming platform (e.g. Kafka), relational database (e.g. postgresql), Nosql (e.g. Cassandra), distributed processing (e.g. Spark, Ray), workflow orchestration (e.g. Airflow, Temporal), etc.
  • 5+ years’ experience managing end-to-end solution development life cycle, esp. Measurement and monitoring of operations metrics, analytical insights and business outcomes via dashboards and other tools
  • Bachelor’s degree or above in Computer Science, Engineering, Statistics or a related field
Job Responsibility
Job Responsibility
  • Own design, development and maintenance of high-performance AI solutions that utilize agentic workflows to deliver concrete business value for internal stakeholders
  • Collaborate with cross-functional teams, including data scientists, ML engineers, software engineers, product managers, designers to gather requirements, define project scope and prioritize feature backlogs
  • Contribute to the selection, evaluation, and implementation of software technologies, tools, and frameworks
  • Take ownership in project planning and stakeholder management
  • Mentor and guide junior engineers via code reviews and design sessions
What we offer
What we offer
  • Comprehensive Total Rewards program
  • 401K savings plan with 6% match
  • performance and recognition-based incentives
  • tuition assistance
  • mental healthcare
  • fertility and adoption assistance
  • workplace flexibility
  • GEICO Flex program (work from anywhere in the US for up to four weeks per year)
  • Fulltime
Read More
Arrow Right

Senior Staff Machine Learning Engineer - Clinical - AI Teams

We are looking for a Senior Staff Machine Learning Engineer to join the Clinical...
Location
Location
France , Paris
Salary
Salary:
Not provided
doctolib.fr Logo
Doctolib
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years in ML/AI with 3+ years at Staff+ or Principal level leading complex, multi-team technical initiatives
  • PhD in Computer Science, AI, Statistics, or related field (or equivalent research experience)
  • Deep expertise in at least two of: clinical NLP, LLM fine-tuning and evaluation, automatic speech recognition, RAG systems, or reinforcement learning
  • Expert in Python, PyTorch/Transformers for training, vLLM/similar for inference with a track record deploying ML systems in production (AWS/GCP)
  • Exceptional communication skills and are able to align diverse stakeholders and explain complex technical decisions
Job Responsibility
Job Responsibility
  • Own the long-term technical roadmap for clinical ML systems, from model architecture selection to production deployment patterns
  • Drive strategic build vs. buy decisions, balancing custom development with foundation model APIs
  • Define standards for safe rollout in healthcare contexts: shadow testing, staged deployment, and human-in-the-loop workflows
  • Lead design and implementation of LLM-powered clinical agents (consultation summarization, clinical coding, evidence-grounded recommendations)
  • Establish rigorous evaluation frameworks using real-world clinical datasets, ensuring outputs meet clinical accuracy and safety standards
  • Build production infrastructure: model and prompt versioning, guardrails, uncertainty quantification, cost optimization, and observability
  • Define a bold strategy for online and offline performance objectives to reach new levels of healthcare professionals satisfaction
  • Mentor Staff or Senior ML Engineers and Applied Scientists, elevating technical standards across the organization
  • Lead cross-functional initiatives spanning Product, Medical Affairs, Legal, and Compliance teams
  • Translate clinical needs into research questions and production systems that measurably improve patient outcomes
What we offer
What we offer
  • Free comprehensive health insurance for you and your children
  • Parent Care Program: receive one additional month of leave on top of the legal parental leave
  • Free mental health and coaching services through our partner Moka.care
  • For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
  • Work from abroad for up to 10 days per year thanks to our flexibility days policy
  • Work Council subsidy to refund part of sport club membership or creative class
  • Up to 14 days of RTT
  • A subsidy from the work council to refund part of the membership to a sport club or a creative class
  • Lunch voucher with Swile card
  • Fulltime
Read More
Arrow Right

Senior LLM / Machine Learning Engineer – Clinical Platforms

Design, build, and deploy LLM pipelines that operate on real-world clinical data...
Location
Location
United States , Boston
Salary
Salary:
78540.80 - 125652.80 USD / Year
childrenshospital.org Logo
Boston Children's Hospital
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in a STEM
  • PhD, MD, MPH or MS preferred
  • 2-3 years of experience in a professional work environment outside of academic setting
  • Demonstrated proficiency in Python and SQL and/or numpy libraries for natural language processing and validation
  • Strong foundation in statistics and applied quantitative methods
  • Excellent communication, teamwork, and problem-solving skills
  • Excellent problem-solving ability, collaborative spirit, and scientific curiosity
Job Responsibility
Job Responsibility
  • Develop repeatable pipelines in Python using pandas, scikit-learn, and other statistical tools for data structuring, extraction and validation
  • Develop, analyze, and interpret large clinical text datasets using the latest natural language processing (NLP/LLM) methods to extract and validate insights to support clinical research and predictive modeling
  • Query and manage health datasets using SQL on AWS cloud
  • Produce innovative solutions driven by exploratory data analysis from complex and high-dimensional datasets
  • With minimal supervision and direction, completes assignments in the required timeframe
  • Routinely leads, co-leads, or participates in biomedical informatics projects with other members from the BCH research community and external collaborators
  • Trains staff and researchers
  • Creates or contributes to a range of compelling communications
  • Presents at project meetings
What we offer
What we offer
  • flexible schedules
  • affordable health, vision and dental insurance
  • child care and student loan subsidies
  • generous levels of time off
  • 403(b) Retirement Savings plan
  • Pension
  • Tuition and certain License and Certification Reimbursement
  • cell phone plan discounts
  • discounted rates on T-passes
  • Fulltime
Read More
Arrow Right

Sr. Staff Engineer (Conversational/Voice AI)

Uber’s Customer Obsession team builds the platform and AI that powers world‑clas...
Location
Location
United States , Sunnyvale, California; San Francisco, California
Salary
Salary:
267000.00 - 297000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years building production ML/AI systems
  • 4+ years leading complex ML initiatives end‑to‑end
  • Deep expertise in LLM‑driven systems (inference optimization, prompt/program design, fine‑tuning, distillation/LoRA, safety/guardrails, evals)
  • Strong software engineering in Python plus one of Go/Java/C++
  • hands‑on with microservices, gRPC/HTTP, cloud infra, containers, CI/CD, and real‑time telemetry/observability
  • Demonstrated ownership of high‑availability services (SLO/SLA design, incident response, on‑call leadership, postmortems)
  • Track record of shipping customer‑facing intelligent experiences with measurable impact (A/B testing, metrics literacy)
Job Responsibility
Job Responsibility
  • Own the end‑to‑end agent architecture: agentic planning and execution loops, long-term memory, persona/voice, knowledge routing, and policy enforcement for compliant, on‑brand conversations
  • Ship production systems that handle millions of conversations with rigorous SLOs, fallbacks, and canaries
  • design graceful degradation (e.g., human handoff) and safety guardrails (prompt‑injection, jailbreak, PII redaction)
  • Lead voice agent initiatives: Drive the development of Uber’s voice support agent—covering real-time speech recognition (ASR), text-to-speech, natural turn-taking (barge-in and endpointing), and reliable telephony/WebRTC integration
  • Advance retrieval & reasoning: Build next-generation retrieval and reasoning pipelines, where the agent can search across different knowledge sources, apply policy-driven tools, and call structured workflows and ensure that responses are consistently grounded
  • Establish evals that matter: offline rubrics, simulated scenarios, safety tests, cost/latency tradeoff suites, and LLM‑as‑judge (with calibrated human review) wired into CI/CD and experiment platforms
  • Drive automation at scale: partner with Product/Design/Operations on coverage, policy alignment, localization, and rollout strategy to better customer experience and reduce cost per contact
  • Mentor/principal‑lead multiple pods
  • set technical strategy and quality bars
  • coach senior engineers on agentic patterns, reliability, and experiment velocity
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • may be offered an equity award & other types of comp
  • eligible for various benefits (details at https://www.uber.com/careers/benefits)
  • Fulltime
Read More
Arrow Right
New

Senior Staff Software Engineer - AI

GEICO is seeking an experienced Engineer with a passion for building high-perfor...
Location
Location
United States , Seattle, WA; Austin, TX; Palo Alto, CA; Chicago, IL; Dallas, TX
Salary
Salary:
110000.00 - 230000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience building and deploying ML systems in production with cross-functional engineering teams
  • Fluency in at least two modern languages such as Python, Go, Java, C++, or C# including object-oriented design
  • Experience architecting multi-component ML platforms using open-source/cloud-agnostic components: Datastores: PostgreSQL, NoSQL (MongoDB, Cassandra, CosmosDB) Streaming: Kafka, Flink, or Spark Streaming
  • Experience with end-to-end ML lifecycle: version control, CI/CD, Kubernetes, testing, monitoring, and production support
  • Experience with cloud providers (Azure, AWS or GCP) in production ML environments
  • Experience with observability tools and distributed systems monitoring, logging, tracing, and root cause analysis
  • Experience building multi-agent systems using LLMs and agentic frameworks (e.g., LangChain, LangGraph, AutoGen, Semantic Kernel, CrewAI)
  • Hands-on experience with RAG, semantic search, and vector databases (e.g., Milvus, pgvector, Qdrant, ElasticSearch)
  • Experience designing human-in-the-loop workflows and safety controls for autonomous systems
  • Strong architecture and design skills with ability to influence technical direction and roadmap
Job Responsibility
Job Responsibility
  • Design and build a multi-agent AI platform where specialized agents autonomously detect, diagnose, and resolve issues through agent-to-agent (A2A) collaboration
  • Develop intelligent agents using LLMs and agentic frameworks that coordinate detection, diagnostic, remediation, and knowledge tasks with minimal human intervention
  • Define agent interaction protocols, A2A communication standards, and evaluation frameworks for agent decision quality and autonomous action safety
  • Architect vector database solutions (Milvus, pgvector, Qdrant) for semantic search and RAG to enable context-aware agent decision-making
  • Build end-to-end ML pipelines for severity classification, anomaly detection, failure pattern recognition, and impact forecasting using observability data
  • Establish scalable orchestration infrastructure for multi-agent workflows with CI/CD, automated evaluation, canary releases, and rollback strategies
  • Implement monitoring for agent interactions, A2A communication patterns, decision quality, data drift, and system reliability
  • Lead technical architecture ensuring scalability, observability, and integration with existing alerting, logging, and monitoring systems
  • Define standards for agent safety, explainability, governance, and human-in-the-loop controls for high-impact automated actions
  • Partner with SRE, Product, and Engineering teams to translate reliability goals into measurable ML objectives and maintain pragmatic technical roadmaps
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right
New

Staff Applied Research Scientist - Martech AI

We are seeking a highly experienced and strategic Staff Applied Research Scienti...
Location
Location
United States , New York City; Palo Alto; Chevy Chase; Dallas; Seattle; Chicago; Austin
Salary
Salary:
130000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Computer Science, Machine Learning, Statistics, Mathematics, or a related quantitative field, PhD is preferred
  • At least 6+ years of professional experience delivering applied AI/ML solutions in production, including leading cross-functional initiatives with enterprise impact
  • Demonstrated ability to leverage LLMs and agentic AI systems to develop and deploy personalized marketing solutions, including individualized content generation, campaign targeting, and optimization of customer engagement/retention/conversion strategies
  • Strong proficiency in Python and SQL
  • deep experience with ML frameworks such as PyTorch, TensorFlow, and Scikit-learn
  • Demonstrated experience establishing KPI frameworks, experimentation, and causal analysis to quantify model impact and inform prioritization
  • Excellent stakeholder management skills with a track record of driving alignment and adoption across product, engineering, and business teams
  • Strong written and verbal communication skills
  • ability to set the right context and explain complex technical topics to varied audiences, including executives
Job Responsibility
Job Responsibility
  • Identify High-Impact Opportunities: Proactively surface and shape high-value AI/ML initiatives by engaging with product, engineering, and operations to align technical roadmaps with strategic business goals
  • Architecture & Technical Direction: Provide architectural leadership for AI/ML solutions impacting multiple stakeholders. Establish standards for scalability, reliability, observability, compliance, and cost efficiency across online and batch systems
  • Development & Productionization: Lead end-to-end delivery of AI/ML solutions, including model design, data pipelines, feature stores, evaluation, deployment, A/B testing, and monitoring in real-time and batch environments. Ensure clear plans, milestones, and on-time delivery
  • ROI Measurement & Experimentation: Establish robust mechanisms to quantify business impact, including KPI definition, experimentation frameworks, and causal inference approaches to guide decision-making and prioritize investments
  • Innovation & Research Integration: Stay current with cutting-edge research in ML, GenAI, and optimization. Prototype and harden novel techniques that push the boundaries of innovation within GEICO’s insurance ecosystem
  • Set technical direction for multi-quarter research initiatives
  • build evaluation frameworks, ensure reproducibility/responsible AI, and drive cross-functional adoption
  • shepherd patents
  • Cross-Functional Collaboration: Champion collaboration across Product, Engineering, Data Platform, Governance, Legal, and Operations to ensure responsible, compliant, and effective adoption of AI systems
  • Mentorship & Capability Building: Mentor junior and senior scientists, elevate technical standards (coding, testing, documentation, reproducibility), and foster a culture of scientific rigor and engineering excellence
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right