CrawlJobs Logo

Senior Software Engineer, Cloud Development

Canada · Job Posted June 30, 2026
Apply Position
Job Link Share

Job Description

The AI Platform team is responsible for building the foundational infrastructure that powers intelligent experiences across Mozilla products. This includes model training pipelines, high-throughput inference services, GPU orchestration, and secure, privacy-respecting AI systems that operate reliably at global scale. We’re looking for a Senior Software Engineer with a strong platform mindset to help design, build, and operate Mozilla’s AI platform. In this role, you’ll work at the intersection of machine learning, distributed systems, and production infrastructure—ensuring that models can be trained, deployed, and served efficiently, securely, and at scale. You will collaborate closely with product, infrastructure, and security teams to enable fast iteration while meeting strict performance and privacy requirements.

Job Responsibility

  • Design, build, and operate core platform services and APIs used to deploy and serve production workloads at scale
  • Own service reliability end-to-end, driving improvements in availability, scalability, performance, and operational excellence
  • Lead efforts to optimize backend services for throughput, latency, and cost efficiency across distributed infrastructure
  • Design and manage Kubernetes-based workloads, including GitOps deployment pipelines, environment configuration, and resource utilization optimization
  • Own and improve critical parts of the service lifecycle, including packaging, versioning, testing strategies, validation, and deployment automation
  • Implement and evolve observability practices (metrics, logging, tracing, alerting) to improve visibility and operational resilience of backend services and pipelines
  • Partner closely with product, infrastructure, security, and data teams to design scalable platform capabilities that enable new product features
  • Contribute to technical design discussions, propose architectural improvements, and mentor junior engineers through code reviews and knowledge sharing
  • Participate in and help improve operational processes, including incident response, on-call rotations, and post-incident reviews

Requirements

  • Bachelor's degree with 4–6 years of relevant industry experience, or Master's degree with significant hands-on experience building and operating production systems, or work experience equivalent
  • Strong, modern Python skills, with experience writing clean, maintainable code and working with a fast toolchain (dependency management, linting, formatting, type checks, pre-commit), building both libraries and CLIs that output structured data
  • Advance experience with database deployment and management, bonus points for familiarity with Postgres
  • Proven experience deploying and operating workloads in cloud environments, including production-grade infrastructure on GCP and GKE (artifact registries, managed caches, networking and internal load balancing, VPC, DNS, and separation of nonprod and prod)
  • Hands-on experience with Kubernetes and Helm, writing charts that deploy across environments with per-environment configuration and progressive feature rollout
  • Experience with Terraform for provisioning infrastructure across environments, including schema validation and PR-level plan review
  • Experience designing and running scalable APIs that hold up under load, including health and readiness checks, auth, and clean startup and shutdown
  • Experience with Grafana or similar tools for metrics, dashboards, and reading application and infrastructure health together during rollouts
  • Strong problem-solving skills and the ability to debug performance and reliability issues in distributed systems
  • Clear and effective communication skills, with experience collaborating across engineering, product, and infrastructure teams
  • On-call experience, including participating in incident response and post-incident reviews

Nice to have

  • Experience with Ray or Ray Serve for GPU-backed model serving, including setting resource requests and replica counts aligned with available hardware
  • Experience building stateless ML services such as embedding or similarity models, including multi-model loading, runtime device selection, batch APIs, and handling model-cache and cold-start tradeoffs
  • Experience running a multi-provider LLM gateway, including routing between providers, migrating models, and mixing self-hosted with third-party serving
  • Familiarity with containerization and orchestration systems in production environments beyond core Kubernetes/Helm usage
  • Exposure to privacy-preserving ML techniques, security best practices, or responsible AI system design
  • Contributions to open-source infrastructure projects or leadership in building reusable internal tooling

What we offer

  • Generous performance-based bonus plans to all eligible employees
  • Rich medical, dental, and vision coverage
  • Generous retirement contributions with 100% immediate vesting
  • Quarterly all-company wellness days
  • Country specific holidays plus a day off for your birthday
  • One-time home office stipend
  • Annual professional development budget
  • Quarterly well-being stipend
  • Considerable paid parental leave
  • Employee referral bonus program
  • Other benefits (life/AD&D, disability, EAP, etc.)

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Software Engineer, Cloud Development

8 matching positions

Software Engineer / Senior Software Engineer - CoreAI

Azure DevOps is a suite of modern development services that enables software dev...
Location
Location
Czech Republic , Prague
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid software development experience
  • Demonstrable experience with C#, C++, Java or any other OOP language
  • Strong analytical skills as well as communication skills both verbal and written
  • Ability to understand unfamiliar code bases, debug client and service side applications (including database stored procedures)
  • Knowledge and experience with Microsoft Azure, AWS or similar cloud computing platforms is preferred
  • Experience with SQL performance tuning (preferably Microsoft SQL Server)
  • Solid understanding of testing principles
  • Ability to prioritize and handle multiple tasks completely and independently and generate clarity in ambiguous situations
  • Troubleshooting skills across network, application, caching, queuing, load-balancing storage and distributed services layers
Job Responsibility
Job Responsibility
  • Design, develop, test and support features, experiences
  • Collaborate on the design and development of features and solutions, contributing to technical direction across business scenarios
  • Support highly available services used by top companies and millions of developers on a daily basis
  • Troubleshooting of complex issues through the entire tech stack including frontend and database layers
  • Participate in on-call rotations with your team. Triage and respond to issues and advocate for opportunities to improve service health
  • Collaborate through pairing and code reviews and contribute to a culture of learning and growth
  • Fulltime
Read More
Arrow Right

Backend Software Engineer / Senior Software Engineer- Kusto

Are you excited by the challenge of redefining how people explore and analyze ma...
Location
Location
Israel , Tel Aviv, Herzliya
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of technical engineering experience with coding in languages including, but not limited to, C#, Python or Java
  • 2+ years building and running services in a cloud environment (Azure, AWS, or GCP)
  • Experience in designing and operating large-scale distributed systems with high availability and reliability
Job Responsibility
Job Responsibility
  • Design, develop, and improve cloud-native services that are scalable, secure, and easy to operate
  • Drive architectural decisions and lead the development of major components in a distributed, high-SLA system
  • Collaborate with cross-functional teams in ILDC and abroad to deliver end-to-end solutions
  • Conduct code and design reviews and mentor junior engineers to grow technical excellence across the team
  • Help shape the future of real-time analytics in Microsoft Fabric RTI, with customer impact as your north star
  • Fulltime
Read More
Arrow Right

Software Engineer / Senior Software Engineer

ARiA is looking for highly motivated self-starters and low-ego team players to j...
Location
Location
United States , Madison; Alexandria; Seattle
Salary
Salary:
Not provided
ariacoustics.com Logo
Applied Research in Acoustics
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Applicants selected for employment will be subject to a government security investigation and must meet eligibility requirements, including U.S. citizenship, for access to sensitive information
  • Bachelor’s degree or greater in a relevant technical field (Computer Science, Engineering, or equivalent)
  • Expertise designing and developing code using modern programming/scripting languages such as C, C++, Golang, JavaScript (and variants), and Python
  • Expertise developing and deploying software in an agile, continuous-integration (CI) framework across a variety of hardware platforms (desktop, server, cloud) using modern tools including containerization (e.g., Docker, Kubernetes)
  • Exceptional ability and desire to acquire new knowledge and skills to solve challenges
  • Ability to work independently but collaboratively
  • Ability to manage multiple projects in a fast-paced professional office environment
  • Ability to communicate technical solutions to colleagues and customers
  • Superior oral and written communications skills
Job Responsibility
Job Responsibility
  • Algorithm and software design, development, research, and testing to support prototypes and products
  • Supporting the transition of research algorithms to fielded systems
  • Preparing documentation to summarize design and status of prototypes and products
  • Assisting with in-field integration, testing, and support, with some local travel required
  • Developing an interface between a C++ underwater-acoustics physics engine and a video game for education and training
  • Developing a JavaScript backend for a scenario-design and management tool for players and integration of that system with a learning-management system (LMS)
  • Developing algorithms and software for a cloud-deployed cognitive tool that allows natural-language query of legal documents to answer user questions about government regulations and supporting the DevOps process for deployment of the prototype
  • Fulltime
Read More
Arrow Right
New

Software Engineer II & Senior Software Engineer

Attacks begin on the endpoint—it’s where an intruder first lands, runs malware, ...
Location
Location
United States , Redmond
Salary
Salary:
102100.00 - 202200.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Ability to meet Microsoft, customer, and/or government security screening requirements
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years.
Job Responsibility
Job Responsibility
  • Build and improve detection, OS event monitoring, telemetry, and reporting in the on-device agent
  • Develop native components that detect and respond to real-world endpoint threats
  • Design, deliver, and maintain components with a focus on reliability, performance, and operational health
  • Help ensure the agent remains efficient, safe, and aligned with the macOS security model
  • Contribute to performance, concurrency, and compatibility across macOS versions
  • Produce high-quality telemetry that supports investigation, analytics, and security insights
  • Investigate and resolve issues such as crashes, performance regressions, and compatibility challenges
  • Apply secure coding practices and validate assumptions to build resilient systems
  • Collaborate with engineers and partner teams to deliver integrated security solutions
  • Contribute to design discussions, code reviews, and shared engineering practices
  • Fulltime
Read More
Arrow Right

Software engineer 2 / Senior Software engineer - Azure Data

Microsoft's Azure Data engineering team is leading the transformation of analyti...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Experience with the Azure stack including Storage, Compute, Networking, Fabric, Purview, Synapse, AKS, DevOps, Data Factory, or Power BI
  • Experience with big data technologies such as Spark, Kafka, Hadoop, or HBase
  • Experience building data lake or data engineering products, tools, or pipelines
  • Familiarity with container-based architectures (Docker, Kubernetes)
  • Ability to debug complex distributed systems on Linux and/or Windows platforms
Job Responsibility
Job Responsibility
  • Write extensible, maintainable code in C#, Java, Scala, or Python for Fabric Materialized Lake View services and HDInsight components
  • Use AI tools and coding best practices across the development lifecycle
  • Design data refresh, scheduling, and query optimisation features with minimal supervision
  • Review code from teammates for correctness, test coverage, security risks, and adherence to team standards
  • Coach junior engineers through code reviews
  • Debug complex issues in distributed systems running on Azure, Linux, and Windows
  • Run live site operations on a rotational, on-call basis
  • Integrate logging and instrumentation to gather telemetry on system health, performance, reliability, and security
  • Work with product managers, technical leads, and partners across geographies to define customer requirements for Materialized Lake View features
  • Fulltime
Read More
Arrow Right

Software Engineer II / Senior Software Engineer

We are hiring multiple Software Engineers II and Senior Software Engineers to jo...
Location
Location
Costa Rica , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science or Engineering OR equivalent experience
  • At least 3 years of experience in software design and backend service development
  • 3+ years of hands-on experience in an object-oriented language such as C++, C#, Java, Python, or equivalent experience with C
  • Proficiency in English is essential for effective communication
Job Responsibility
Job Responsibility
  • Drive projects through to delivery, removing obstacles to ensure success
  • Contribute to the design of large-scale distributed software services and solutions
  • Write clean, well-tested code
  • Champion and improve team software engineering practices
  • Mentor junior engineers
  • Collaborate with related teams to achieve goals
  • Promote knowledge sharing and contribute to onboarding documentation
  • Participate in the on-call rotation to monitor service health and manage incidents
  • Fulltime
Read More
Arrow Right

Software Engineer 2 / Senior Software Engineer

We are looking for an experienced Software Engineers for our Bangalore location ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
komprise.com Logo
Komprise, Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid grasp of computer science fundamentals and especially data structures, algorithms, multi-threading
  • Ability to solve difficult problems with a simple elegant solution
  • Should have solid object-oriented programming background with impeccable design skills
  • Experience in developing management applications and performance management applications is ideal
  • Experience with object-based file systems and REST interfaces is a plus (e.g. Amazon S3, Azure, Google Cloud Service)
  • Should have a BE or higher in CS, EE, Math or related engineering or science field
  • At least 5+ years of experience in software deployment
  • Tech Stack: Java, Maven Virtualisation, SaaS, Github, Jira, Slack, Cloud Solutions and Hypervisors
Job Responsibility
Job Responsibility
  • Responsible for designing and developing features that powers Komprise data management platform to manage billions of files and petabytes of data
  • Responsible for designing of major components and systems of our product architecture, ensuring that Komprise data management platform is highly available and scalable
  • Responsible for writing performance code, evaluate feasibility, develop for quality and optimize for maintainability
  • Work in agile, customer focused and fast paced team with direct interaction with the customers
  • Responsible for analysing customer escalated issues and provide resolutions in a timely manner
  • Should be able to design and implement highly performant, scalable distributed systems
Read More
Arrow Right

Senior Software Engineer / Principal Software Engineer - Copilot CLI

Within GitHub and Microsoft CoreAI, the Copilot CLI team builds GitHub's coding ...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years
Job Responsibility
Job Responsibility
  • Take ownership of critical product and platform areas of the Copilot CLI and shared agent runtime
  • Set a high technical and quality bar for agentic systems and developer-facing tooling
  • Design and ship performant, reliable terminal experiences that developers depend on for daily work
  • Use data, benchmarks, and direct user feedback to guide iteration and investment
  • Collaborate across org boundaries to enable other teams to build agentic products on top of a shared foundation
  • Influence architecture, technical direction, and engineering standards beyond your immediate team
What we offer
What we offer
  • Certain roles may be eligible for benefits and other compensation
  • Fulltime
Read More
Arrow Right