CrawlJobs Logo

Staff Software Engineer, AI & Automation

United States 138000.00 - 217000.00 USD / Year · Job Posted January 08, 2026
Apply Position
Job Link Share

Job Description

Mozilla is seeking a Staff Software Engineer to lead the next evolution of Service Desk and digital workplace operations. This role goes beyond traditional IT support: it is centered on AI-driven service enablement, automation, and platform ownership. You will be a single-threaded owner for key internal platforms and Enterprise AI-powered services, ensuring they deliver maximum value to the business. You’ll architect intelligent workflows, integrate SaaS and AI systems, and drive adoption across the organization — all while staying connected to the user experience through escalations, feedback loops, and advanced troubleshooting. This is a lead-level role for someone who thrives at the intersection of technology, AI, security, and human experience — with deep experience in high-end Python scripting to build, scale, and maintain automation frameworks and integrations across Mozilla’s internal tools ecosystem.

Job Responsibility

  • Roll out new collaboration and Enterprise GenAI capabilities: run pilots, manage change/comms, create guides, and deliver training that drives adoption and safety
  • Design and ship automations (scripts, workflows, chatbots, AI agents) to streamline provisioning/deprovisioning, entitlements, group lifecycle, and tier-1 deflection
  • Build monitoring and alerting for SaaS health and key workflows
  • lead incident, problem, and change practices to reduce MTTR and prevent recurrences
  • Define SLAs/SLOs, instrument CSAT and operational metrics, and run regular service reviews that turn data into improvements
  • Serve as technical escalation for complex tickets: drive resolution, perform root-cause analysis, and capture knowledge to uplevel the Service Desk
  • Maintain clean, current documentation: runbooks, KB articles, architecture diagrams, and admin playbooks
  • Champion a 'shift-left' model: expand self-service portals, guided flows, and AI virtual agents to improve velocity and user experience
  • Enterprise AI Security & Governance: implement guardrails for AI/automation usage, including access controls, data handling policies, compliance alignment, and ongoing risk monitoring
  • Cross-Functional Partnership: collaborate with People, Security, Finance, Legal, and Procurement to ensure platforms align with organizational goals
  • Vendor & Contract Management: lead relationships with SaaS and AI providers, negotiate terms, and ensure strong ROI
  • Mentor & Lead: coach P3 engineers in automation, AI prompt engineering, and scalable integration design

Requirements

  • 7+ years in IT engineering, enterprise applications, or service delivery
  • ITIL 4 Foundations certification (required)
  • Comfortable leading incident, problem, and change management (ITIL), measuring SLAs/SLOs, and using data to drive continuous service improvements
  • Demonstrated success owning platforms end-to-end, including lifecycle management, integrations, and adoption
  • Advanced proficiency in Python scripting — with proven experience developing, optimizing, and maintaining automation frameworks, APIs/webhooks, and integration logic across enterprise tools
  • Strong automation & scripting skills in one or more of: Python
  • with experience building APIs/webhooks, workflow engines, and chatbots
  • Advanced experience in AI & automation platforms (GenAI copilots, RPA, intelligent workflows), including security and governance guardrails for safe adoption
  • A builder’s mindset: bias for automation and documentation, empathy for end users, and a willingness to mentor Service Desk colleagues
  • Strong program/project management skills with proven ability to independently deliver initiatives
  • Ability to translate ambiguous business needs into structured, scalable solutions
  • Leadership presence and credibility with executives and peers
  • High tolerance for ambiguity with skill in bringing clarity and structure

What we offer

  • Generous performance-based bonus plans
  • Rich medical, dental, and vision coverage
  • Generous retirement contributions with 100% immediate vesting
  • Quarterly all-company wellness days
  • Country specific holidays plus a day off for your birthday
  • One-time home office stipend
  • Annual professional development budget
  • Quarterly well-being stipend
  • Considerable paid parental leave
  • Employee referral bonus program
  • Other benefits (life/AD&D, disability, EAP, etc. - varies by country)
  • Flexible work environment
  • Industry-leading paid parental leave (up to 26 weeks for childbearing parents, up to 12 weeks for non-childbearing parents)
  • Reimbursement for professional development (up to $3,000/year)
  • A work setup including the latest hardware and software of your choice

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Software Engineer, AI & Automation

8 matching positions

Senior Staff Software Engineer - AI

GEICO is seeking an experienced Engineer with a passion for building high-perfor...
Location
Location
United States , Seattle, WA; Austin, TX; Palo Alto, CA; Chicago, IL; Dallas, TX
Salary
Salary:
110000.00 - 230000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience building and deploying ML systems in production with cross-functional engineering teams
  • Fluency in at least two modern languages such as Python, Go, Java, C++, or C# including object-oriented design
  • Experience architecting multi-component ML platforms using open-source/cloud-agnostic components: Datastores: PostgreSQL, NoSQL (MongoDB, Cassandra, CosmosDB) Streaming: Kafka, Flink, or Spark Streaming
  • Experience with end-to-end ML lifecycle: version control, CI/CD, Kubernetes, testing, monitoring, and production support
  • Experience with cloud providers (Azure, AWS or GCP) in production ML environments
  • Experience with observability tools and distributed systems monitoring, logging, tracing, and root cause analysis
  • Experience building multi-agent systems using LLMs and agentic frameworks (e.g., LangChain, LangGraph, AutoGen, Semantic Kernel, CrewAI)
  • Hands-on experience with RAG, semantic search, and vector databases (e.g., Milvus, pgvector, Qdrant, ElasticSearch)
  • Experience designing human-in-the-loop workflows and safety controls for autonomous systems
  • Strong architecture and design skills with ability to influence technical direction and roadmap
Job Responsibility
Job Responsibility
  • Design and build a multi-agent AI platform where specialized agents autonomously detect, diagnose, and resolve issues through agent-to-agent (A2A) collaboration
  • Develop intelligent agents using LLMs and agentic frameworks that coordinate detection, diagnostic, remediation, and knowledge tasks with minimal human intervention
  • Define agent interaction protocols, A2A communication standards, and evaluation frameworks for agent decision quality and autonomous action safety
  • Architect vector database solutions (Milvus, pgvector, Qdrant) for semantic search and RAG to enable context-aware agent decision-making
  • Build end-to-end ML pipelines for severity classification, anomaly detection, failure pattern recognition, and impact forecasting using observability data
  • Establish scalable orchestration infrastructure for multi-agent workflows with CI/CD, automated evaluation, canary releases, and rollback strategies
  • Implement monitoring for agent interactions, A2A communication patterns, decision quality, data drift, and system reliability
  • Lead technical architecture ensuring scalability, observability, and integration with existing alerting, logging, and monitoring systems
  • Define standards for agent safety, explainability, governance, and human-in-the-loop controls for high-impact automated actions
  • Partner with SRE, Product, and Engineering teams to translate reliability goals into measurable ML objectives and maintain pragmatic technical roadmaps
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - AI Applications

Vanilla is seeking a Staff Software Engineer - AI Applications with a strong bac...
Location
Location
United States
Salary
Salary:
190000.00 - 210000.00 USD / Year
justvanilla.com Logo
Vanilla Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, a related field, or equivalent practical experience
  • 8+ years relevant work experience
  • Proficiency in modern programming languages such as Python or Javascript
  • Experience with OpenAI, Anthropic, or similar for both chat and API interfaces
  • Deep understanding of machine learning and AI technologies, including the ability to design, train, and implement machine learning models and use natural language processing techniques for automation
  • Production experience with scalability and best-practices of AI infrastructure
  • Must have experience with AI observability, monitoring, and signaling using tools like LangChain or LangGraph
  • Hands-on experience using RAG and chunking to tune LLM performance
  • Experienced with LLM orchestration tooling and decision frameworks
  • Experience or exposure building agentic capabilities and workflows
Job Responsibility
Job Responsibility
  • Machine learning and AI: You are passionate and knowledgeable about the current and future state of AI
  • You will be utilizing existing Large Language Models to build applied AI applications focused on producing high accuracy rates. Your software engineer skills will come into play here as you'll take ownership in constructing services to ingest results
  • You will work with product, and engineering teams and build models/services that can ingest data, extract key information and surface insights
  • You can build tooling to support model training, evaluation, inference serving, monitoring and alerting
  • You want to use the latest ML frameworks and open source tools to develop new model training pipelines
  • Hands On Coding: You have direct experience with software engineering and are familiar with modern languages like Python, Javascript, Go, Rust
  • You have experience building microservices and understand the tradeoffs of the approach
  • Data handling: You can identify, extract, transform, and load data from disparate sources into a centralized system. You are able to normalize, cleanse, and validate this data
  • Database management: You are able to design and implement schemas, optimize queries, and manage database performance
  • Project management: You must be an effective self-organizer: prioritize tasks, manage resources, and communicate effectively with non-technical stakeholders
What we offer
What we offer
  • Flexible paid time off policy and 10 company-wide paid holidays
  • Parental leave, 4 weeks for all full-time employees and up to 12 weeks for birthing parents
  • Medical, dental, and vision benefits coverage for employees and their families
  • 401K eligibility after one month of employment
  • Free estate planning documents
  • Budget for learning & development and home office setup
  • Paid parking or transit for hybrid and in office employees
  • Fulltime
Read More
Arrow Right

Senior Staff Software AI Test Engineer- Prisma SASE

We are seeking Test Engineers with a strong Automation First Mindset as we scale...
Location
Location
United States , Santa Clara
Salary
Salary:
126000.00 - 204500.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Automation skills - Python, Playwright
  • Experience with building automation frameworks and leading the automation effort for the team
  • Working Knowledge of CI/CD pipelines
  • Experience with Cloud Technologies such Aws/Azure/GCP
  • Knowledge of common security related protocols and their design (i.e. SSH, IPsec,TCP/IP, DNS, TLS, SSL etc.)
  • Demonstrated ability to learn quickly and to work in a fast paced, innovative environment learning new technologies and multi-tasking
  • 8+ years of experience
  • Bachelor’s Degree OR Master in Computer Science/Engineering/Networking or equivalent military experience required
Job Responsibility
Job Responsibility
  • Develop and execute sophisticated software tests and frameworks to validate Prisma SASE Functionality and Scale, working closely with Development, Product Management, SRE and Technical Marketing teams
  • Provide Thorough Technical Leadership in the areas of Cloud Based Orchestration, Cloud delivered Security, Cloud Networking and Automation Design
  • Participate in system design so that Quality Assurance is considered throughout the entire lifecycle of the Prisma Access Feature Development
  • Develop and/or Enhance Automated test Infrastructure to enable building Scalable & Flexible tests that reflect real world network deployment scenarios
  • Enhance Test strategies, Automation & Build infrastructure with feedback and analysis from real-world Customer deployments
  • Fulltime
Read More
Arrow Right

Staff Software Engineer – Applied AI

Lead the design and delivery of end-to-end AI applications, from discovery and p...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years’ experience building production-grade software
  • Strong backend capability (Python preferred, but stack-agnostic mindset)
  • Hands-on experience or strong interest in LLMs / GenAI (LangChain, vector DBs, model tooling, eval frameworks etc.)
  • Comfortable owning projects end-to-end and interacting directly with technical stakeholders
  • Startup mentality – high ownership, adaptable, and excited by ambiguity
Job Responsibility
Job Responsibility
  • Architecting and deploying custom AI solutions (automation, agents, evaluation frameworks, internal AI tooling)
  • Working directly with senior stakeholders (including CTO-level) on requirements and trade-offs
  • Leading technical direction across projects
  • Shaping engineering standards and culture as the team scales
What we offer
What we offer
  • Front-row seat to real-world enterprise AI deployment
  • Exposure to a wide range of industries and use cases
  • Senior, high-calibre engineering environment
  • Opportunity to shape a new regional presence in Dubai
  • Fulltime
Read More
Arrow Right

Staff Infrastructure Software Engineer - AI Platform

We are currently seeking a Staff Software Engineer to join the AI Platform team ...
Location
Location
United Kingdom , Edinburgh
Salary
Salary:
Not provided
addepar.com Logo
Addepar
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience as a Software/Backend Engineer, with a track record of taking on increasing responsibility
  • Experience across the full product lifecycle: designing, implementing, shipping, scaling, operationalizing, and maintaining technology/SaaS products
  • Exceptional Programming skills and fundamentals in Python/Go/Java, with a proven track record of building large scale production systems
  • Proficient experience with diverse compute environments including microservices (K8s), Databricks and serverless architectures (e.g. AWS Lambda)
  • Demonstrable experience leading initiatives with infrastructure-as-code tools such as Terraform in complex, multi-account environments
  • Proficient experience with comprehensive monitoring and alerting stacks (e.g. Prometheus/Grafana/Sentry/cloud-native tools), with a focus on observability strategy
  • Excellent interpersonal and communication skills to effectively collaborate with multi-functional teams, articulate complex technical concepts, and influence outcomes
Job Responsibility
Job Responsibility
  • Design and build the production runtime for LLM-based agents and products, creating the services and infrastructure that serve autonomous agents
  • Develop deep application-level knowledge to proactively inform and influence requirements, constraints and best practices for implementing composable, complex AI systems
  • Lead the design, implementation, and automation of production infrastructure on a variety of cloud environments (Kubernetes/Databricks), to enable us to ship and scale AI features instantly
  • Evangelize and promote disciplined, best engineering practices to enforce strong production hygiene and culture
  • Initiate and lead collaborations with cross-functional teams to identify and resolve complex application or infrastructure issues, serving as a technical subject matter expert
  • Architect, build, and maintain advanced, automated CI/CD pipelines e.g. using Jenkins, ArgoCD, AWS CodeBuild/Pipeline, GitHub Actions, or similar, establishing best practices for deployment strategies (e.g., blue/green, canary)
  • Develop systems and best practices monitoring, alerting, and troubleshooting of our probabilistic and AI-driven systems and broader software stack
Read More
Arrow Right

Staff Software Engineer, Backend (AI Platform)

Cresta is on a mission to turn every customer conversation into a competitive ad...
Location
Location
United States
Salary
Salary:
Not provided
cresta.com Logo
Cresta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years writing production software
  • 2+ years focused on ML platform or infra
  • Expert Python (async, typing, packaging, performance)
  • Working Golang knowledge for systems components
  • Proven experience with one or more serving frameworks (e.g., vLLM, Triton, TorchServe)
  • Kubernetes and cloud-native ops
  • Solid grasp of distributed systems, networking, and container security
  • Culture of rigorous testing, code review, and continuous delivery
Job Responsibility
Job Responsibility
  • Own model serving: Design, build, and maintain low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners
  • Automate training pipelines: Orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices
  • Optimize at scale: Profile and tune throughput, memory, and cost
  • introduce caching, sharding, batching, and GPU/CPU autoscaling where it pays off
  • Build platform primitives: Create reusable SDKs, templates, and CLI tools that let research and product teams ship models independently and safely
  • Raise the bar: Instrument deep observability (tracing, metrics, alerts), drive blameless post-mortems, and mentor engineers on production ML best practices
What we offer
What we offer
  • Comprehensive medical, dental, and vision coverage with plans to fit you and your family
  • Flexible PTO to take the time you need, when you need it
  • Paid parental leave for all new parents welcoming a new child
  • Retirement savings plan to help you plan for the future
  • Remote work setup budget to help you create a productive home office
  • Monthly wellness and communication stipend to keep you connected and balanced
  • In-office meal program and commuter benefits provided for onsite employees
Read More
Arrow Right

Staff Infrastructure Software Engineer, Enterprise AI

Scale GP is building the next generation of enterprise-grade Generative AI produ...
Location
Location
United States , New York; San Francisco
Salary
Salary:
216200.00 - 270250.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in a senior role
  • 5+ years of full-time software engineering experience
  • Deep understanding of modern infrastructure practices, including CI/CD, IaC (e.g., Terraform, Helm Charts), container orchestration (e.g., Kubernetes) and observability platforms (e.g., Datadog, Prometheus, Grafana)
  • Extensive experience with at least one major cloud provider (AWS, Azure, or GCP)
  • Strong knowledge of security and compliance in enterprise environments, with a focus on access management, data isolation, and customer-specific VPC setups
  • Proficiency in Python or JavaScript/TypeScript, and SQL
Job Responsibility
Job Responsibility
  • Define the architectural patterns for our multi-cloud infrastructure to support secure, reliable, and scalable Agentic workflows for enterprise customers
  • Lead the infrastructure roadmap with a strong focus on compliance, privacy, and security standards, including designing change management and data isolation strategies
  • Own the development and maintenance of our best-in-class Agentic observability platform (logging, metrics, tracing, and analytics) to proactively ensure system health and enable rapid incident response
  • Drive developer efficiency by building automated tooling and championing Infrastructure-as-Code (IaC) paradigms throughout the engineering organization
  • Solve the toughest engineering problems related to multi-tenancy, data isolation, and high-performance inference at a massive scale, taking end-to-end ownership across the full product lifecycle
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • additional benefits such as a commuter stipend
  • Fulltime
Read More
Arrow Right

Staff/Sr. Staff Ai Engineer - Enterprise Ai Solutions

As a Staff/Sr. Staff AI Engineer for Enterprise AI Solutions, you will be a crit...
Location
Location
United States , Santa Clara
Salary
Salary:
157200.00 - 254100.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in software engineering, with 5+ years in Data science/AI/ML experience
  • Experience deploying enterprise AI/ML systems
  • Understanding of the AI lifecycle
  • Experience with distributed systems and streaming data
  • Hands-on experience with Generative AI and LLMs
  • Proficiency in AI/ML frameworks and cloud platforms
  • Strong programming skills (Python and C++)
  • Excellent communication skills
  • Masters/Bachelors degree in Computer Science or related field
Job Responsibility
Job Responsibility
  • Lead the design and implementation of AI solutions, translating business problems into AI designs, and managing model selection, data requirements, and integration for AI applications
  • Develop and implement core components of the enterprise AI/ML platform, ensuring scalability and security
  • Contribute to the lifecycle of traditional and Generative AI model deployment and real-time inference systems
  • Design and optimize large-scale AI/ML systems for performance, reliability, and developer-friendliness, focusing on low latency and high throughput in real-time AI applications
  • Evaluate and integrate new AI tools, frameworks, and cloud solutions, aligning with architectural guidelines
  • Lead POCs for emerging AI innovations
  • Champion design standards and best practices for AI systems, guiding junior engineers
  • Mentor AI/ML engineers, lead design discussions, and perform code reviews, fostering engineering excellence
  • Partner with Data Scientists, ML Engineers, Product Managers, and IT stakeholders to develop production-grade AI solutions
  • Ensure AI systems comply with responsible AI principles and security policies
What we offer
What we offer
  • Restricted stock units
  • Bonus
  • Fulltime
Read More
Arrow Right