CrawlJobs Logo

Software Engineer, Platform Infrastructure

United States, San Mateo 120000.00 - 220000.00 USD / Year · Job Posted January 06, 2026
Apply Position
Job Link Share

Job Description

As a member of our cloud infrastructure team, you will help design, build and maintain highly scalable and reliable systems powering Verkada’s services. Some example projects include scaling microservice clusters, automating serverless deployment, adopting full service mesh and enhancing observability. You'll have the opportunity to take full ownership of a sub domain and lead cross-team efforts.

Job Responsibility

  • Design, develop, test, and deploy software to manage and automate our cloud infrastructure
  • Collaborate with other engineering teams to understand their infrastructure needs and provide robust solutions
  • Contribute to the architecture and design of new and existing cloud services
  • Implement and maintain monitoring, alerting, and logging systems to ensure high availability and performance

Requirements

  • Must have a BS, MS, or PhD in Computer Science, or similar technical field of study
  • 1+ years of experience on cloud infrastructure or backend systems
  • Strong problem solving and troubleshooting skills
  • Experience in Python and/or Go
  • Experience with one of the major cloud platforms (preferably AWS)
  • Strong written and verbal communications

Nice to have

  • Experience with Terraform is preferred
  • Experience with Kubernetes is preferred
  • Experience with a service mesh such as Istio is preferred
  • Experience with observability tools such as Datadog, prometheus or opentelemetry is preferred
  • Experience with contributing to open source projects is preferred

What we offer

  • Healthcare programs that can be tailored to meet the personal health and financial well-being needs - Premiums are 100% covered for the employee under at least one plan and 80% for family premiums under all plans
  • Nationwide medical, vision and dental coverage
  • Health Saving Account (HSA) with annual employer contributions and Flexible Spending Account (FSA) with tax saving options
  • Expanded mental health support
  • Paid parental leave policy & fertility benefits
  • Time off to relax and recharge through our paid holidays, firmwide extended holidays, flexible PTO and personal sick time
  • Professional development stipend
  • Fertility Stipend
  • Wellness/fitness benefits
  • Healthy lunches provided daily
  • Commuter benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Software Engineer, Platform Infrastructure

8 matching positions

Software Engineer II and Senior Software Engineer- Microsoft Security - Platform Team

We have multiple positions open for Software Engineers and Senior Software Engin...
Location
Location
Israel , Tel Aviv, Herzliya
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • B.Sc. or M.Sc. in computer science, software engineering, or equivalent experience
  • 3+ years of professional hands-on software development experience, primarily focused on developing and designing backend services in cloud or on-premises environments
  • Experience working with Kubernetes and Containers
  • Experience in working with cloud infrastructure and services
Job Responsibility
Job Responsibility
  • Contribute to business-critical initiatives in Microsoft Security
  • Requiring deep technical skills and the ability to quickly adapt to new areas
  • Will improve the end-to-end lifecycle of services
  • Analyze complex system behavior, and apply modern engineering practices to streamline deployments and reduce costs
  • Working on high-end technologies and collaborating across disciplines to deliver impactful features
  • Collaborate with multiple teams across Microsoft to deliver key customer solutions and support technology
  • Fulltime
Read More
Arrow Right

Software Engineer, Platform (Infrastructure)

We’re looking for an exceptional software engineer with a systems mindset to bui...
Location
Location
Sweden , Stockholm
Salary
Salary:
Not provided
lovable.dev Logo
Lovable
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep experience building and operating production infrastructure as a Software Engineer, Systems Engineer, Site Reliability Engineer or similar
  • You write clean, performant code and understand what happens all the way down - from the API layer to the runtime
  • Familiarity with container orchestration and sandboxing at a level deeper than configuration. You understand how schedulers, runtimes, and isolation boundaries actually work, not just how to deploy on top of them
  • Strong proficiency in at least one systems-oriented language (Go, Rust, C++, or similar)
  • You have a track record of designing services that handle high throughput and unpredictable load at a global tech company or scale-up
  • You’re comfortable navigating ambiguity and solving problems as they arise
  • You care about security, stability, and speed and know when to make trade-offs between them
  • You’re based in Stockholm or ready to relocate - this is an on-site, 5-days-a-week role
Job Responsibility
Job Responsibility
  • Design, build and own the systems that makes AI engineering work for everyone
  • Design, build and maintain the systems that enable our AI product, such as: Gvisor-based runtime environment for running agentic workloads
  • A high-throughput sandbox scheduler across multiple cloud providers, built on deep understanding of the orchestration layer rather than off-the-shelf abstractions
  • Harden our infrastructure against failures, downtime, and slowdowns
  • Support our growth by making sure that our infrastructure never becomes a bottleneck
  • Plan and implement our network infrastructure and cloud strategy
  • Identify and drive reliability improvement efforts across all engineering teams
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Platform Infrastructure

We are seeking an experienced and highly motivated Staff Software Engineer to le...
Location
Location
United States , Pittsburgh
Salary
Salary:
171000.00 - 273000.00 USD / Year
aurora.tech Logo
Aurora Innovation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior or Staff-level experience (P7 equivalent) as a Software Engineer, ideally in infrastructure, developer tooling, or critical shared services
  • Proven experience leading technical projects and mentoring/directing other engineers
  • Familiarity with distributed compute technologies, cloud services (e.g., AWS), and large-scale workflow management systems
  • Demonstrated ability to triage, debug, and perform on-call and incident management for complex, cross-cutting infrastructure issues
  • Strong communication skills to manage stakeholder alignment and drive cross-team standardization efforts
Job Responsibility
Job Responsibility
  • Lead the OTI Team: Serve as the technical lead (TL) for the OTI team within PIE-Compute, driving the strategic vision, execution, and long-term stability of the core infrastructure
  • Help Define and Optimize the Testing Ecosystem: Lead the design of the next-generation offline testing architecture to meet diverse team needs, reducing redundancy and siloing across the organization
  • Partner with Test Creation and Test Drive teams to standardize end-to-end test execution and reporting (Creation -> Execution -> Reporting)
  • Refine the full test lifecycle to ensure performance and scalability, and maintain clear attribution of failures to enhance reliability and efficient debugging
  • Own Critical OTI Components and Migrations: Take ownership of the shared OTI components, including maintenance and on-call support
  • Own various offline test Modalities, including step code, workflow code, and general health
  • Lead the maintenance and development of common OTI tooling, including launching test evaluations, polling APIs, communicating results, and providing recommended pipeline templates
  • Establish Architecture and Best Practices: Define and enforce data management policies for the testing ecosystem (storage, lifecycling, write strategies, data integrity, and lineage)
  • Define use cases and feature design for new test modalities, including single versus cross-modality testing strategies
  • Manage incidents related to offline tests and maintain Standard Operating Procedures (SOPs) for PRs, local workflows, V&V, and releases
What we offer
What we offer
  • annual bonus
  • equity compensation
  • benefits
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Platform Infrastructure

We are seeking an experienced and highly motivated Staff Software Engineer to le...
Location
Location
United States , Mountain View
Salary
Salary:
189000.00 - 303000.00 USD / Year
aurora.tech Logo
Aurora Innovation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior or Staff-level experience (P7 equivalent) as a Software Engineer, ideally in infrastructure, developer tooling, or critical shared services
  • Proven experience leading technical projects and mentoring/directing other engineers
  • Familiarity with distributed compute technologies, cloud services (e.g., AWS), and large-scale workflow management systems
  • Demonstrated ability to triage, debug, and perform on-call and incident management for complex, cross-cutting infrastructure issues
  • Strong communication skills to manage stakeholder alignment and drive cross-team standardization efforts
Job Responsibility
Job Responsibility
  • Lead the OTI Team: Serve as the technical lead (TL) for the OTI team within PIE-Compute, driving the strategic vision, execution, and long-term stability of the core infrastructure
  • Help Define and Optimize the Testing Ecosystem: Lead the design of the next-generation offline testing architecture to meet diverse team needs, reducing redundancy and siloing across the organization
  • Partner with Test Creation and Test Drive teams to standardize end-to-end test execution and reporting (Creation -> Execution -> Reporting)
  • Refine the full test lifecycle to ensure performance and scalability, and maintain clear attribution of failures to enhance reliability and efficient debugging
  • Own Critical OTI Components and Migrations: Take ownership of the shared OTI components, including maintenance and on-call support
  • Own various offline test Modalities, including step code, workflow code, and general health
  • Lead the maintenance and development of common OTI tooling, including launching test evaluations, polling APIs, communicating results, and providing recommended pipeline templates
  • Establish Architecture and Best Practices: Define and enforce data management policies for the testing ecosystem (storage, lifecycling, write strategies, data integrity, and lineage)
  • Define use cases and feature design for new test modalities, including single versus cross-modality testing strategies
  • Manage incidents related to offline tests and maintain Standard Operating Procedures (SOPs) for PRs, local workflows, V&V, and releases
What we offer
What we offer
  • annual bonus
  • equity compensation
  • benefits
  • Fulltime
Read More
Arrow Right

Staff Systems Software Engineer, Infrastructure Platform

The Infrastructure Engineering organisation at GM is building a cloud-native pla...
Location
Location
United States , Austin; Mountain View; Warren
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science or related field, or equivalent work experience
  • 8+ years of software engineering experience with a strong track record of building and operating production distributed systems
  • Deep platform or infrastructure engineering experience, with hands-on work building APIs, schedulers, orchestrators, or similar systems at scale
  • Strong proficiency in Go, with ability to write clean, maintainable, and performant production code for backend services
  • Solid understanding of distributed systems fundamentals including consistency models, failure handling, idempotency, retry patterns, and circuit breakers
  • Experience with cloud-native technologies such as Kubernetes, Nomad, Consul, or similar orchestration and service discovery platforms
  • Strong API design skills with understanding of RESTful patterns, authentication and authorisation models (OIDC, RBAC), versioning strategies, and error handling
  • Deep experience with relational databases, particularly PostgreSQL, including schema design, indexing strategies, query optimisation, and migration management
  • Architectural thinking with ability to evaluate trade-offs, balance simplicity with flexibility, design for current requirements and future growth, and document decisions effectively
  • Strong communication skills with ability to explain complex technical concepts to both engineering and business stakeholders
Job Responsibility
Job Responsibility
  • Design and implement core platform services including the API gateway, scheduler, lifecycle orchestrator, and synchronisation services using Go and cloud-native patterns
  • Build RESTful APIs with authentication (OIDC, RBAC), authorisation, versioning, and observability, architecting the inventory database system using PostgreSQL for resource metadata, capabilities, and state management
  • Develop intelligent scheduling and orchestration logic that matches workload requirements to resource capabilities with support for automated pooling, reservation modes, and hybrid allocation strategies
  • Build developer CLI tooling and integrate with the control plane, enabling developers to discover, allocate, and manage infrastructure resources through intuitive commands
  • Implement provisioning workflows that coordinate firmware flashing, health checks, power cycling, and resource validation across diverse automotive hardware configurations
  • Collaborate with stakeholders across Infrastructure Engineering, Quality Engineering, and Hardware Infrastructure to understand workflows and integrate with existing systems
  • Lead architectural discussions, conduct code reviews, document technical decisions, and mentor team members on distributed systems patterns and Go development
  • Work with tools and technologies including Go, PostgreSQL, Kubernetes, Nomad, Consul, RESTful APIs with OIDC authentication and RBAC authorisation, Datadog, S3-compatible object storage (MinIO), CI/CD pipelines, and Git/GitHub
What we offer
What we offer
  • From day one, we're looking out for your well-being–at work and at home–so you can focus on realizing your ambitions
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Platform Infrastructure

We are seeking a Senior Software Engineer II to architect, build, and operate se...
Location
Location
United States
Salary
Salary:
192200.00 - 225810.00 USD / Year
confluent.io Logo
Confluent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience in software engineering, SRE, or security engineering roles, with significant experience operating security platform services
  • Strong backend software development experience (Go, Java, Rust, Python)
  • Expertise with distributed systems, cloud infrastructure (AWS, GCP, Azure), Kubernetes, service mesh, and container orchestration
  • Strong understanding of security domains: IAM, OAuth2, OIDC, PKI, secrets management, policy engines, audit pipelines, zero trust architecture
  • Experience building highly reliable, observable, and resilient production systems
  • Operational expertise: SLOs, SLIs, error budgets, on-call leadership, incident management
  • Strong collaboration skills to drive alignment across engineering, security, and compliance stakeholders
  • Excellent communication skills with ability to influence technical and business leaders
  • BS, MS, or PhD in computer science or a related field, or equivalent work experience
Job Responsibility
Job Responsibility
  • Architect, design, and develop platform services with a strong focus on scalability, security, and developer experience
  • Lead operational design for reliability: build comprehensive observability, monitoring, and incident response automation into security-critical services
  • Build automation and tooling to drive self-healing systems, proactive risk detection, failure recovery, and continuous resilience testing
  • Collaborate with compliance, governance, and risk teams to translate regulatory and policy requirements into scalable technical controls
  • Lead technical design reviews, security architecture reviews, and incident postmortems for platform-level incidents
  • Mentor engineers across multiple disciplines on both security and operational best practices
  • Own end-to-end delivery of services: from initial design and development through deployment, production hardening, and lifecycle maintenance
What we offer
What we offer
  • Remote-First Work
  • Robust Insurance Benefits
  • Flexible Time Away
  • The Best Teammates
  • Experience Ambassadors
  • Open and Honest Culture
  • Well-Being and Growth
  • Offers Equity
  • Fulltime
Read More
Arrow Right

Senior-Staff Software Engineer, Platform Infrastructure

As a Senior Software Engineer on this team, you will help architect, design and ...
Location
Location
United States , San Mateo
Salary
Salary:
130000.00 - 280000.00 USD / Year
verkada.com Logo
Verkada
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must have a BS, MS, or PhD in Computer Science, or similar technical field of study
  • Experience and enthusiasm for learning about new infrastructure products, features, and strategies
  • Comfortable with working at the frontier of infrastructure and software development
  • Experience in Python and/or Go
  • Experience with one of the major cloud platforms (preferably AWS)
  • Strong written and verbal communications
Job Responsibility
Job Responsibility
  • Identify and lead critical efforts related to scalability, reliability and efficiency
  • Influence the features and direction of our platform with your own ideas
  • Provide technical support for engineers on team
  • Align with product and org objectives, and coordinate with cross-functional teams on delivering key results
What we offer
What we offer
  • Healthcare programs that can be tailored to meet the personal health and financial well-being needs - Premiums are 100% covered for the employee under at least one plan and 80% for family premiums under all plans
  • Nationwide medical, vision and dental coverage
  • Health Saving Account (HSA) with annual employer contributions and Flexible Spending Account (FSA) with tax saving options
  • Expanded mental health support
  • Paid parental leave policy & fertility benefits
  • Time off to relax and recharge through our paid holidays, firmwide extended holidays, flexible PTO and personal sick time
  • Professional development stipend
  • Fertility stipend
  • Wellness/fitness benefits
  • Healthy lunches provided daily
  • Fulltime
Read More
Arrow Right

Software Engineer - Infrastructure, Foundry Platform

Software Engineers at Palantir build software at scale to transform how organiza...
Location
Location
United States , New York
Salary
Salary:
135000.00 - 200000.00 USD / Year
palantir.com Logo
Palantir Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Engineering background in Computer Science, Mathematics, Software Engineering, Physics or similar field
  • Strong coding skills with demonstrated proficiency in programming languages, such as Java, C++, Python, Rust, or similar languages
  • Familiarity with storage and data processing systems, cloud infrastructure, and other technical tools
  • Strong written and verbal communication skills and ability to iterate quickly with teammates, incorporating feedback and holding a high bar for quality
  • Active US Security clearance, or eligibility and willingness to obtain a US Security clearance is beneficial, but not necessary
Job Responsibility
Job Responsibility
  • Building a performant search and indexing ecosystem for complex granularly permissioned data
  • Contributing to open-source data processing libraries, integrating the latest innovations to achieve performance gains
  • Building the distributed systems that power large scale compute workloads, orchestrating and efficiently scheduling hundreds of thousands of containers every hour
  • Designing architecture and opinionated APIs to keep application developers on the happy path
  • Tracing and performance observability in high scale distributed microservice architectures
  • Building reliant, performant, and scalable systems for storage, auth, or asset serving to enable other product teams to build robust applications without deep domain expertise in the underlying systems
  • Automating the deployment, management, and operations of complex distributed systems like Cassandra, Elasticsearch, Kafka, and more across different environments
What we offer
What we offer
  • Employees (and their eligible dependents) can enroll in medical, dental, and vision insurance as well as voluntary life insurance
  • Employees are automatically covered by Palantir’s basic life, AD&D and disability insurance
  • Commuter benefits
  • Relocation assistance
  • Take what you need paid time off, not accrual based
  • 2 weeks paid time off built into the end of each year (subject to team and business needs)
  • 10 paid holidays throughout the calendar year
  • Supportive leave of absence program including time off for military service and medical events
  • Paid leave for new parents and subsidized back-up care for all parents
  • Fertility and family building benefits including but not limited to adoption, surrogacy, and preservation
  • Fulltime
Read More
Arrow Right