LLM - Senior Staff Engineer - Python + Machine Learning Job at AquSag Technologies

Senior Staff Machine Learning Engineer

Help design our AI platform and develop our next generation of machine learning ...

Location

United States , San Francisco

Salary:

216500.00 - 324500.00 USD / Year

GoFundMe

Expiration Date

Until further notice

Requirements

9+ years of hands-on experience in machine learning engineering, AI development, software engineering, or related fields
Experience emphasizing secure, large-scale, distributed system design, AI/ML pipeline development, and implementation
Extensive experience designing, developing, and operating scalable backend systems
Experience applying software engineering best practices such as domain-driven design, event-driven architectures, and microservices
Deep expertise in agentic workflows, AI evaluation solutions, prompt management, and secure AI development and testing practices
Strong knowledge of relational and document-based databases, data storage paradigms, and efficient RESTful API design
Experience establishing robust CI/CD pipelines, automated testing (unit and integration), and deployment practices
Strong leadership skills, including effective planning and management of complex projects, mentoring of team members, and fostering a collaborative, high-performing engineering culture
Excellent communicator, able to articulate complex technical concepts clearly to both technical and non-technical stakeholders
Bachelor's degree in Computer Science, Software Engineering, or a related technical field (preferred)

Job Responsibility

Design and implement AI platforms to enable scalable and secure access to LLMs from multiple model providers for diverse use cases
Design and implement agentic workflows, agentic tool ecosystems, and LLM prompt management solutions
Design, build, and optimize scalable model training, fine tuning, and inference pipelines, ensuring robust integration with production systems
Influence technical strategy and approach to developing embedding stores, vector databases, and other reusable assets
Lead initiatives to streamline ML and AI workflows, improve operational efficiency, and establish standardized procedures to achieve consistent, high-quality results across our AI systems
Design and develop backend services and RESTful APIs using Python and FastAPI, integrating seamlessly with ML pipelines and services
Take operational responsibility for team-owned services, including performance monitoring, optimization, troubleshooting, and participation in an on-call rotation
Collaborate with both technical and non-technical colleagues, including data and applied scientists, software engineers, product managers, and business stakeholders, to deliver reliable and scalable ML-driven products
Coach and mentor fellow ML engineers, promoting a culture of collaboration, continuous improvement, and engineering excellence within the team
Employ a diverse set of tools and platforms including Python, AWS, Databricks, Docker, Kubernetes, FastAPI, Terraform, Snowflake, Coralogix, and GitHub to build, deploy, and maintain scalable, highly available machine learning infrastructure

What we offer

Competitive pay
Comprehensive healthcare benefits
Financial assistance for things like hybrid work, family planning
Generous parental leave
Flexible time-off policies
Mental health and wellness resources
Learning, development, and recognition programs

Fulltime

Senior Staff Engineer, Applied AI

GEICO is seeking a Senior Staff Engineer, Applied AI to provide technical archit...

Location

United States , Chevy Chase, MD; Palo Alto, CA

Salary:

130000.00 - 260000.00 USD / Year

Geico

Expiration Date

Until further notice

Requirements

8 or more years of professional software engineering or applied machine learning experience
2 or more years working with Generative AI or LLM-based systems in production
Proven track record of architecting and delivering complex AI/ML capabilities that span multiple teams and have measurable business impact
Deep hands-on expertise with Python and modern AI frameworks including LangChain, LangGraph, LangSmith, LlamaIndex, Hugging Face, OpenAI/Anthropic APIs, and emerging agentic frameworks
Demonstrated experience building and deploying production RAG (Retrieval-Augmented Generation) systems including document ingestion, chunking strategies, vector search, and context retrieval
Demonstrated experience designing and operating production AI systems including multi-agent architectures, intelligent automation, and workflow orchestration
Strong understanding of agent architectures, workflow orchestration, retrieval-augmented generation (RAG), vector databases, knowledge graphs, and semantic reasoning
Familiarity with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) for building interoperable AI systems
Experience ensuring platform scalability, cross-domain coherence, and alignment with AI platform capabilities and strategy
Strong expertise in distributed systems, microservices architecture, service design, performance optimization, and reliability engineering

Job Responsibility

Specify architectures and system decompositions for AI/ML capabilities that involve significant integrations and cross-team collaboration across multiple product areas
Provide technical architecture and leadership for medium to large, complex, cross-functional AI initiatives with visibility at the tech VP level
Architect and lead implementation of advanced Generative AI solutions including agent-based systems, intelligent automation, document intelligence, and decision support systems that span multiple business domains
Design and implement sophisticated agentic workflows that orchestrate multiple AI agents, tools, APIs, reasoning steps, and business logic to automate complex enterprise processes at scale
Question status quo with an eye for simpler designs and more secure approaches, influencing tech VPs to set direction for multiple teams
Build systems and platforms that meet the highest standards for scalability, resilience, performance, availability, security, and compliance
Identify and scope opportunities for automating business processes using AI across multiple product areas and business domains
Advance the state-of-the-art in applied AI by integrating knowledge graphs, vector reasoning, retrieval architectures, and multi-agent systems to solve complex business problems
Drive innovation by exploring new models, frameworks, reasoning techniques, and AI architectures and applying them strategically to high-impact business challenges
Run rigorous experimentation programs including hypothesis definition, A/B testing, measurement frameworks, and iterative improvement across production AI systems

What we offer

Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
Financial benefits including market-competitive compensation
a 401K savings plan vested from day one that offers a 6% match
performance and recognition-based incentives
and tuition assistance
Access to additional benefits like mental healthcare as well as fertility and adoption assistance
Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year

Fulltime

Staff II Software Engineer AI/ML Ops

We're looking for a Lead Data Engineer to design, build, and optimize data pipel...

Location

United States , Pleasanton

Salary:

245000.00 - 307000.00 USD / Year

BlackLine

Expiration Date

Until further notice

Requirements

Strong programming skills in languages such as Python, Java, or Scala
Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads
Proficiency in containerization technologies (e.g., Docker, Kubernetes)
Proficient in scripting languages (e.g., Bash, python) for automation
Experience with workflow orchestration tools (e.g., Apache Airflow)

Job Responsibility

Lead data pipeline development: Build and maintain PySpark ETL pipelines with high data quality and performance
Manage integrations: Establish robust connections to client data sources via APIs and tools like FiveTran, Plaid, and BlackLine's own internal connector ecosystem
Ensure reliability: Monitor pipeline performance, automate testing, and validate data accuracy
Optimize for scale: Implement performance improvements (e.g., CDC mechanisms, indexing strategies) for large-scale datasets
Collaborate & innovate: Work with business stakeholders to refine data requirements and integrate cutting-edge AI and big data technologies
Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
Lead incident response and reliability strategies for ML/AI systems
Collaborate with development teams to integrate AI solutions into existing workflows and applications
Ensure seamless integration with different platforms and technologies

What we offer

Short-term and long-term incentive programs
Robust offering of benefit and wellness plans

Fulltime

Member of Technical Staff, Principal Engineering Manager

As Microsoft continues to push the boundaries of AI, we are on the lookout for s...

Location

United States , Redmond

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, Javascript, or Python OR equivalent experience
Demonstrated track record of building and scaling engineering organizations (hiring teams from scratch, structuring orgs, growing managers)
Experience delivering large-scale software systems in AI, machine learning, or related fields
Experience managing organizations of 30+ engineers across multiple teams and workstreams
Deep expertise in LLM evaluation, AI quality measurement, or ML infrastructure at scale
Track record of partnering with senior leadership (VP/CVP level) to set strategy and drive cross-organizational programs
Experience recruiting and developing senior engineering talent (principal engineers, engineering managers) in a competitive market
Proven ability to operate effectively in fast-paced, ambiguous environments — comfortable making decisions with incomplete information and course-correcting quickly
Strong technical judgment: ability to evaluate architectural tradeoffs, assess technical risk, and guide teams toward sound engineering decisions without needing to write the code yourself
Experience leading distributed or multi-site engineering teams.

Job Responsibility

Build and lead a multi-team engineering organization (30+ engineers across multiple teams), including hiring and developing engineering managers who lead their own teams
Set the technical and organizational strategy for Copilot AI Evaluation and response quality, aligning with MAI's broader product and engineering vision
Partner with senior Eng and Product leadership (Partner+ level) to define priorities, influence roadmaps, and drive cross-organizational initiatives
Own end-to-end delivery of evaluation platforms, novel evaluation techniques, and agentic solutions for measuring and improving Copilot quality at scale
Recruit, develop, and retain world-class engineering talent — building a culture of technical excellence, accountability, and continuous learning
Drive operational rigor: establish engineering processes, quality bars, and delivery cadences that enable predictable, high-quality execution across multiple concurrent workstreams
Navigate ambiguity and make high-judgment tradeoff decisions on technology, staffing, and investment priorities in a fast-moving AI landscape
Foster a diverse, inclusive team culture where engineers at all levels can do their best work and grow their careers
Embody our Culture and Values.

Fulltime

Ai-first Core It Software Engineering: Software, Ml & Data

This is a Unified Application for our AI-First IT Transformation portfolio. We r...

Location

United States , Santa Clara

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

3-5+ years of experience in Software Engineering, Data Science, or Machine Learning (Staff level)
6-8+ years (Senior Staff)
8-12+ years (Principal level)
Expert-level server-side development (Python, Java, Go) OR deep expertise in statistical modeling, ML algorithms, and LLM fine-tuning
Direct experience with RAG architectures, LLM APIs, and Vector Databases (e.g., Pinecone, Milvus)
Hands-on experience with Kubernetes, CI/CD, and distributed systems for large-scale AI deployment

Job Responsibility

Lead the hands-on development of core Enterprise IT Business software leveraging AI components and LLM infrastructure with both traditional and Generative AI model deployment
Build and industrialize agentic AI systems and multi-agent frameworks, ensuring secure and effective use of GenAI technologies at the platform level
Design and implement robust foundational data pipelines, perform advanced statistical analysis, and develop new ML models to drive autonomous system behavior
Design large-scale, distributed AI/ML systems optimized for low latency, high throughput, and developer-friendliness (Inference optimization)
Establish evaluation frameworks to measure AI quality (accuracy, hallucination rates) and overall system reliability across the Enterprise AI Factory

Fulltime

Senior Staff Machine Learning Engineer – Agent Engineering

GEICO is seeking an experienced Sr Staff Machine Learning Engineer – Agent Engin...

Location

United States , New York City; Palo Alto; Chevy Chase

Salary:

130000.00 - 300000.00 USD / Year

Geico

Expiration Date

Until further notice

Requirements

10+ years of professional software development experience with at least two general-purpose programming languages such as Java, C++, Python, TypeScript, etc.
7+ years of experience architecting, building & deploying end-to-end AI solutions utilizing open-source/cloud-agnostic components such as search engine (e.g. elastic search, Qdrant), data warehouse (e.g. snowflake), streaming platform (e.g. Kafka), relational database (e.g. postgresql), Nosql (e.g. Cassandra), distributed processing (e.g. Spark, Ray), workflow orchestration (e.g. Airflow, Temporal), etc.
5+ years’ experience managing end-to-end solution development life cycle, esp. Measurement and monitoring of operations metrics, analytical insights and business outcomes via dashboards and other tools
Bachelor’s degree or above in Computer Science, Engineering, Statistics or a related field

Job Responsibility

Own design, development and maintenance of high-performance AI solutions that utilize agentic workflows to deliver concrete business value for internal stakeholders
Collaborate with cross-functional teams, including data scientists, ML engineers, software engineers, product managers, designers to gather requirements, define project scope and prioritize feature backlogs
Contribute to the selection, evaluation, and implementation of software technologies, tools, and frameworks
Take ownership in project planning and stakeholder management
Mentor and guide junior engineers via code reviews and design sessions

What we offer

Comprehensive Total Rewards program
401K savings plan with 6% match
performance and recognition-based incentives
tuition assistance
mental healthcare
fertility and adoption assistance
workplace flexibility
GEICO Flex program (work from anywhere in the US for up to four weeks per year)

Fulltime

Senior Staff Machine Learning Engineer - Clinical - AI Teams

We are looking for a Senior Staff Machine Learning Engineer to join the Clinical...

Location

France , Paris

Salary:

Not provided

Doctolib

Expiration Date

Until further notice

Requirements

10+ years in ML/AI with 3+ years at Staff+ or Principal level leading complex, multi-team technical initiatives
PhD in Computer Science, AI, Statistics, or related field (or equivalent research experience)
Deep expertise in at least two of: clinical NLP, LLM fine-tuning and evaluation, automatic speech recognition, RAG systems, or reinforcement learning
Expert in Python, PyTorch/Transformers for training, vLLM/similar for inference with a track record deploying ML systems in production (AWS/GCP)
Exceptional communication skills and are able to align diverse stakeholders and explain complex technical decisions

Job Responsibility

Own the long-term technical roadmap for clinical ML systems, from model architecture selection to production deployment patterns
Drive strategic build vs. buy decisions, balancing custom development with foundation model APIs
Define standards for safe rollout in healthcare contexts: shadow testing, staged deployment, and human-in-the-loop workflows
Lead design and implementation of LLM-powered clinical agents (consultation summarization, clinical coding, evidence-grounded recommendations)
Establish rigorous evaluation frameworks using real-world clinical datasets, ensuring outputs meet clinical accuracy and safety standards
Build production infrastructure: model and prompt versioning, guardrails, uncertainty quantification, cost optimization, and observability
Define a bold strategy for online and offline performance objectives to reach new levels of healthcare professionals satisfaction
Mentor Staff or Senior ML Engineers and Applied Scientists, elevating technical standards across the organization
Lead cross-functional initiatives spanning Product, Medical Affairs, Legal, and Compliance teams
Translate clinical needs into research questions and production systems that measurably improve patient outcomes

What we offer

Free comprehensive health insurance for you and your children
Parent Care Program: receive one additional month of leave on top of the legal parental leave
Free mental health and coaching services through our partner Moka.care
For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
Work from abroad for up to 10 days per year thanks to our flexibility days policy
Work Council subsidy to refund part of sport club membership or creative class
Up to 14 days of RTT
A subsidy from the work council to refund part of the membership to a sport club or a creative class
Lunch voucher with Swile card

Fulltime

Senior Staff Research Scientist - Clinical AI Lab

At Doctolib, we're revolutionizing healthcare delivery through advanced AI syste...

Location

France , Paris

Salary:

Not provided

Doctolib

Expiration Date

Until further notice

Requirements

PhD or Master’s degree (plus significant experience) in Computer Science, Machine Learning, Mathematics, or a related field
Publications as first author in top-tier AI/ML conferences such as NeurIPS, ICLR, ICML, ACL, EMNLP, or relevant medical informatics venues are a plus
Deep understanding of machine learning and deep learning concepts, with hands-on experience in NLP, LLMs, or generative AI
Strong programming skills in Python (and ideally experience with ML frameworks such as PyTorch or JAX)
Familiarity with modern data tools and scalable training on large datasets
Track record of independently driving research and translating it into real-world applications
Experience designing experiments, evaluating intrinsic and extrinsic metrics, and iterating quickly
Curiosity and drive to learn, research, and apply new machine learning techniques
Comfortable working in highly cross-functional teams, with the ability to clearly communicate complex concepts to diverse audiences
Passionate about sharing knowledge and contributing to a culture of learning and innovation

Job Responsibility

Solve Real-World Healthcare Challenges: Design and build advanced machine learning models, especially in NLP and large language models, to improve healthcare workflows, enhance patient experiences, and power intelligent health solutions at scale
Innovate & Experiment: Research and prototype novel algorithms and architectures, staying at the forefront of AI/ML developments. Bring new ideas from concept to working prototypes, leveraging the latest advances in deep learning and generative AI
Collaborate Across Teams: Work closely with product, engineering, medical experts, and other stakeholders to translate business needs into impactful research projects. Communicate technical concepts and results clearly to both technical and non-technical audiences
Drive Impact: Analyze large, complex data sets to extract actionable insights, define key metrics, and rigorously evaluate model performance
Share & Learn: Advance Doctolib’s AI expertise by sharing findings within the team and with the wider ML community through publications, talks, and workshops. Mentor and support other scientists and engineers in adopting best practices

What we offer

Free comprehensive health insurance for you and your children
25 days of paid vacation per year, plus up to 14 days of RTT
Free mental health and coaching services through our partner Moka.care
Work from abroad for up to 10 days per year thanks to our flexibility days policy
Lunch vouchers (Swile card) worth €8.50 per working day, with €4.50 covered by Doctolib
A subsidy from the work council to refund part of the membership to a sport club or a creative class
50% reimbursement of your public transport subscription
Parent Care Program: receive one additional month of leave on top of the legal parental leave
For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
Relocation support in case of international mobility

Fulltime

Select Country

LLM - Senior Staff Engineer - Python + Machine Learning

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?