CrawlJobs Logo

Site Reliability Engineer - FedRAMP

confluent.io Logo

Confluent

Location Icon

Location:
United States , Austin

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

137400.00 - 158000.00 USD / Year

Job Description:

We’re not just building better tech. We’re rewriting how data moves and what the world can do with it. With Confluent, data doesn’t sit still. Our platform puts information in motion, streaming in near real-time so companies can react faster, build smarter, and deliver experiences as dynamic as the world around them. It takes a certain kind of person to join this team. Those who ask hard questions, give honest feedback, and show up for each other. No egos, no solo acts. Just smart, curious humans pushing toward something bigger, together. One Confluent. One Team. One Data Streaming Platform. About the Role: Do you have a passion for data that can turn events into outcomes, enabling intelligent, real-time apps, and empowering teams and systems to be able to act on data instantly? Have you ever dreamt about the opportunity to work with key agencies of the public sector? Confluent's team of Federal Site Reliability Engineers, will allow you to do just that by putting you in the driver seat to deliver highly performant, reliable systems that enable prominent public sector agencies to make real time decisions with their data to solve real time problems through Confluent Cloud. Confluent Cloud delivers a complete end-to-end streaming experience as a Software as a Service (SaaS) model. Tech in Texas! Austin is an early career engineer hub for Confluent. Your role will be a hybrid working model that introduces you to Confluent's culture and enables faster learning, onboarding, and coaching. During the first year of employment, all NCGs globally will be required to participate in the New Grad Onboarding program and go into their Confluent office 2 days (days to be determined) per week.

Job Responsibility:

  • Understand and participate in the changing FedRAMP space by quickly ramping up with the 20x controls and building upon these to maintain federal compliance
  • Own and champion high operational standards of Confluent Cloud systems leveraged by federal agencies
  • Deploy production changes to Confluent Cloud systems and infrastructure through established change management processes
  • Assist with process improvements and adoption of change management
  • Own monitoring and incident handling of complex distributed systems, engaging engineering teams when needed through an escort model system.
  • Act as a core member of Confluents Business Continuity Plan and Disaster Recovery team with efforts across 3 large verticals
  • Innovate and design solutions to reduce toil, bolster operational maturity, and make day-to-day worklife easier.
  • Participate in a 24/7 on-call rotation to maintain the integrity of Confluent Cloud for Government systems

Requirements:

  • 0-2 years of relevant SRE experience
  • Experience in Cloud Native technologies with experience operating production services in the cloud
  • Fundamentals of Distributed Systems and their design
  • Knowledge of Kubernetes and containerization
  • Proficiency in infrastructure as code (Terraform preferred)
  • Experience with telemetry tooling to monitor production systems (DataDog, Grafana, Prometheus)
  • Exposure and understanding of BCP/DR and high availability exercises
  • Ability to quickly problem-solve and troubleshoot critical services
  • Proficiency with scripting and automation (e.g Go, Java, Python, Bash)
  • Exceptional teamwork, collaboration skills, and the ability to act critically with minimal supervision at times in a remote first environment
  • Experience with a rotating on-call schedule to provide 24/7 support
  • BS Degree in Computer Science, Engineering, or equivalent experience
What we offer:
  • Remote-First Work
  • Robust Insurance Benefits
  • Flexible Time Away
  • The Best Teammates
  • Experience Ambassadors
  • Open and Honest Culture
  • Well-Being and Growth
  • Offers Equity

Additional Information:

Job Posted:
January 22, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Site Reliability Engineer - FedRAMP

Lead Site Reliability Engineer

As a Lead Site Reliability Engineer (SRE), you will ensure the stability, perfor...
Location
Location
United States
Salary
Salary:
184000.00 - 229000.00 USD / Year
https://corelight.com/ Logo
Corelight
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience building and operating FedRAMP environments or similarly regulated systems
  • Expertise in AWS services (e.g., EC2, S3, RDS, Lambda, ECS/EKS, Glue, EMR, Redshift, OpenSearch, VPC)
  • Deep understanding of the FedRAMP framework, controls, and compliance requirements
  • Proficiency in programming languages such as Python, Go, or Java
  • Experience with big data technologies (Hadoop, Spark, Kafka)
  • Strong skills in Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Ansible
  • Knowledge of containerization and orchestration tools like Docker and Kubernetes
  • Experience with CI/CD tools such as Jenkins, GitLab CI, or CircleCI
  • Proven track record in building and scaling platforms with high availability, resilience, and strict SLO objectives
  • Strong experience with Unix/Linux systems and cloud providers, ideally AWS
Job Responsibility
Job Responsibility
  • Collaborate with software engineering teams to ensure the reliability, performance, and security of the Federal region’s infrastructure
  • Design, implement, and manage FedRAMP-compliant infrastructure and systems
  • Establish continuous monitoring, logging, and auditing processes to ensure compliance with FedRAMP controls
  • Partner with security teams to conduct security assessments and implement necessary controls
  • Design and implement scalable infrastructure solutions that support multi-region growth
  • Drive automation efforts, enabling infrastructure and platforms to scale efficiently with a focus on compliance
  • Stay up-to-date on best practices, evolving security threats, and FedRAMP guidelines to maintain a strong security posture
  • Deploy and maintain cloud-native services in AWS that are resilient and elastic
  • Participate in 24x7 incident response and on-call rotations
  • Plan for capacity and work with teams to prepare for platform growth
What we offer
What we offer
  • Equity and additional benefits will also be awarded
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

Are you ready to start a new journey with a team of energized professionals adva...
Location
Location
Australia , North Sydney; Perth; Brisbane
Salary
Salary:
Not provided
bentley.com Logo
Bentley Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in computer science, software engineering or relevant training and/or experience
  • +8 years of experience with Cloud Services development, deployment and/or IT Cloud infrastructure setup and maintenance (Azure Cloud or AWS or GCP)
  • Expertise in containerization and orchestration technologies (Docker, Kubernetes)
  • Experience with Scripting and automation skills using languages like PowerShell, Bash, Ansible, JavaScript or similar
  • Programming experience, preferably in a high-level language like C#, Python, Golang, Ruby, or equivalent
  • Knowledge of AD and DNS, IIS, and networking
  • Experience with FedRamp background screening
  • Experience with Azure DevOps (Pipelines, YAML) or GitHub enterprise (Git, Actions)
  • Good knowledge of Microsoft SQL Server/Azure SQL setup, SQL statements/scripts and troubleshooting
  • Ability to document architectural designs along with operational processes and procedures to support ongoing administration of cloud systems
Job Responsibility
Job Responsibility
  • Manage, implement, and improve automation (CI/CD Infrastructure) and tooling through Azure DevOps, scripting, developing tools and proprietary systems
  • Automate Azure cloud-based deployments, resource provisioning and other Azure infrastructure related tasks
  • Troubleshoot and resolve issues related to application development, deployment, and operations
  • Dive deep into availability, performance and outages for infrastructure and systems, and provide technical leadership for proactive resolutions
  • Ensure compliance with industry’s best practices and organizational policies
  • Continuously improving processes and tools to enhance efficiency and productivity
  • Maintain monitoring and alerting and participate as a member of a rotating on-call schedule
  • Share on-call responsibilities, including collaborating with other engineers to triage and fix issues that come up in production for our users
What we offer
What we offer
  • A great Team and culture
  • An exciting career as an integral part of a world-leading software company providing solutions for architecture, engineering, and construction
  • An attractive salary and benefits package
  • A commitment to inclusion, belonging and colleague wellbeing through global initiatives and resource groups
  • A company committed to making a real difference by advancing the world’s infrastructure for better quality of life, where your contributions help build a more sustainable, connected, and resilient world
Read More
Arrow Right

Senior Site Reliability Engineer

Are you ready to start a new journey with a team of energized professionals adva...
Location
Location
Australia , North Sydney; Perth; Brisbane
Salary
Salary:
Not provided
bentley.com Logo
Bentley Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in computer science, software engineering or relevant training and/or experience
  • +8 years of experience with Cloud Services development, deployment and/or IT Cloud infrastructure setup and maintenance (Azure Cloud or AWS or GCP)
  • Expertise in containerization and orchestration technologies (Docker, Kubernetes)
  • Experience with Scripting and automation skills using languages like PowerShell, Bash, Ansible, JavaScript or similar
  • Programming experience, preferably in a high-level language like C#, Python, Golang, Ruby, or equivalent
  • Knowledge of AD and DNS, IIS, and networking
  • Experience with FedRamp background screening
  • Experience with Azure DevOps (Pipelines, YAML) or GitHub enterprise (Git, Actions)
  • Good knowledge of Microsoft SQL Server/Azure SQL setup, SQL statements/scripts and troubleshooting
  • Ability to document architectural designs along with operational processes and procedures to support ongoing administration of cloud systems
Job Responsibility
Job Responsibility
  • Manage, implement, and improve automation (CI/CD Infrastructure) and tooling through Azure DevOps, scripting, developing tools and proprietary systems
  • Automate Azure cloud-based deployments, resource provisioning and other Azure infrastructure related tasks
  • Troubleshoot and resolve issues related to application development, deployment, and operations
  • Dive deep into availability, performance and outages for infrastructure and systems, and provide technical leadership for proactive resolutions
  • Ensure compliance with industry’s best practices and organizational policies
  • Continuously improving processes and tools to enhance efficiency and productivity
  • Maintain monitoring and alerting and participate as a member of a rotating on-call schedule
  • Share on-call responsibilities, including collaborating with other engineers to triage and fix issues that come up in production for our users
What we offer
What we offer
  • A great Team and culture
  • An exciting career as an integral part of a world-leading software company providing solutions for architecture, engineering, and construction
  • An attractive salary and benefits package
  • A commitment to inclusion, belonging and colleague wellbeing through global initiatives and resource groups
  • A company committed to making a real difference by advancing the world’s infrastructure for better quality of life, where your contributions help build a more sustainable, connected, and resilient world
Read More
Arrow Right
New

Site Reliability Engineer (FedRAMP / Security) - NY

Coralogix is a modern, full-stack observability platform transforming how busine...
Location
Location
United States , New York
Salary
Salary:
170000.00 - 220000.00 USD / Year
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years of experience as a DevOps Engineer/ SRE in production environments
  • In-depth experience with Kubernetes - operating & monitoring are key parts
  • At least 2 years of experience Experience with FedRAMP compliance (High/Moderate levels), vulnerability management, and continuous monitoring, including scanning, patching, and reporting - advantage
  • High familiarity with monitoring tools such as Coralogix, Grafana, Prometheus
  • Experience in AWS or other cloud providers
  • Experience with infrastructure as a code (Terraform, Crossplane, etc.)
  • Understanding of networking - from networking layers to different networking protocols (http, grpc, ssl)
  • Some software engineering experience, preferably in Golang
  • An advantage - operating data pipelines
  • An advantage - familiarity with Apache Kafka
Job Responsibility
Job Responsibility
  • Work in high scale environments - Coralogix data pipeline processes 55Tb of data each day
  • Adopt cutting edge technologies with end-to-end responsibility
  • Building internal tools to expand our platform capabilities
  • Collaborate with R&D to improve stability & reliability of the system
  • Lead the product roadmap - our product is designed for engineers
  • Perform operational duties for FedRAMP cloud products, including deployments, on-call support, and incident management
What we offer
What we offer
  • Comprehensive and inclusive employee benefits for healthcare, dental, and mental health benefits
  • 401(k) plan and match
  • Paid sick time and paid time off
  • Fulltime
Read More
Arrow Right
New

Principal Site Reliability Engineer (DNS Security)

We are seeking development-heavy Site Reliability Engineers (SREs) who are passi...
Location
Location
United States , Santa Clara
Salary
Salary:
151600.00 - 245300.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or higher degree in Computer Science, Engineering, or related field or equivalent military experience required
  • 6+ years of experience in DevOps, SRE, or related roles
  • Cloud Experiences: GCP/AWS/OCI/Azure
  • Container Docker, Kubernetes operational experiences
  • Knowledge of TCP/IP, DNS, HTTP, GRPC
  • Proven experience in designing, implementing, and maintaining scalable and reliable infrastructure
  • Strong proficiency in automation scripting and infrastructure as code (IaC)
  • Excellent problem-solving skills and the ability to troubleshoot complex issues
  • Effective communication skills, both written and verbal
  • Experience working in collaborative, cross-functional environments
Job Responsibility
Job Responsibility
  • Build Terraform to deploy infrastructures and services to multiple cloud platforms
  • Build automation for provisioning and operating infrastructure at a massive scale using Python or Go code
  • Work with Dev/QA teams to build pipelines and automation for delivering and deploying applications to production
  • Build observation (logging, metrics, alerting) systems to make sure system works well
  • Design and implement the infrastructure to ensure applications align with infrastructure requirements, focusing on scalability and reliability
  • Collaborate with PMs to deliver compliances (SOC2, Fedramp, IL5) and establish a vision for continuous improvement
  • On-call Support and Incident Resolution
  • Participate in occasional on-call rotations to support the infrastructure
  • Investigate incidents, formulate hypotheses, and identify root causes to solve issues promptly
  • Write postmortem reviews and provide remediation recommendations
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer II - FedRAMP

Trimble is seeking a Site Reliability Engineer to join their world class and glo...
Location
Location
India , Chennai
Salary
Salary:
Not provided
trimble.com Logo
Trimble Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree or equivalent in Computer Science, Engineering or related field or equivalent experience
  • Recent college graduate or one year of experience in IT operations, including knowledge of networking, computing and storage
  • Experience with AWS and/or Azure public cloud
  • Windows system administration familiarity and scripting skills, such as Python, Powershell
  • Linux system administration familiarity and scripting skills, including Bash and Perl
  • Familiarity with application operations, including Incident Management, Change Management, and Capacity Management
  • Excellent written and verbal communication
  • Troubleshooting and problem solving skills
  • Strong desire to learn new things
Job Responsibility
Job Responsibility
  • Responsible for configuration, optimization, documentation and support of the infrastructure components of software products which are hosted primarily in cloud services (AWS and Azure)
  • Perform day-to-day server application management, monitoring, incident response/resolution and working with the customer application development and technical support teams to establish effective application monitoring and to identify application changes to improve operations
  • Develop new and enhance current shared public cloud services with consideration for Availability, Operations, Performance, Capacity, Security, and User Experience
  • Responsible for management of security posture and adherence to corporate security best practices
  • Develop and maintain documentation including but not limited to architecture diagrams, service descriptions, build and deploy documentation and operations run book documentation
  • Provide design and deployment assistance for divisions needing help on a project basis
  • Manage AWS & FedRAMP best practice expectations (incorporating Trimble Cloud Core Platform standards)
  • Work with a global team and are able to occasionally meet or perform tasks off-hours
Read More
Arrow Right
New

Site Reliability Engineer (FedRAMP / Security) - CA

Coralogix is a modern, full-stack observability platform transforming how busine...
Location
Location
United States , Los Angeles
Salary
Salary:
170000.00 - 220000.00 USD / Year
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years of experience as a DevOps Engineer/ SRE in production environments
  • In-depth experience with Kubernetes - operating & monitoring are key parts
  • At least 2 years of experience Experience with FedRAMP compliance (High/Moderate levels), vulnerability management, and continuous monitoring, including scanning, patching, and reporting - advantage
  • High familiarity with monitoring tools such as Coralogix, Grafana, Prometheus
  • Experience in AWS or other cloud providers
  • Experience with infrastructure as a code (Terraform, Crossplane, etc.)
  • Understanding of networking - from networking layers to different networking protocols (http, grpc, ssl)
  • Some software engineering experience, preferably in Golang
  • An advantage - operating data pipelines
  • An advantage - familiarity with Apache Kafka
Job Responsibility
Job Responsibility
  • Work in high scale environments - Coralogix data pipeline processes 55Tb of data each day
  • Adopt cutting edge technologies with end-to-end responsibility
  • Building internal tools to expand our platform capabilities
  • Collaborate with R&D to improve stability & reliability of the system
  • Lead the product roadmap - our product is designed for engineers. Therefore, our engineers promote, enhance, and take a crucial part in influencing the product roadmap
  • Perform operational duties for FedRAMP cloud products, including deployments, on-call support, and incident management
What we offer
What we offer
  • Healthcare
  • Dental
  • Mental health benefits
  • 401(k) plan and match
  • Paid sick time
  • Paid time off
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer - FedRAMP

We’re not just building better tech. We’re rewriting how data moves and what the...
Location
Location
Canada , Toronto
Salary
Salary:
113200.00 - 130200.00 CAD / Year
confluent.io Logo
Confluent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 0-2 years of relevant SRE experience
  • Experience in Cloud Native technologies with experience operating production services in the cloud
  • Fundamentals of Distributed Systems and their design
  • Knowledge of Kubernetes and containerization
  • Proficiency in infrastructure as code (Terraform preferred)
  • Experience with telemetry tooling to monitor production systems (DataDog, Grafana, Prometheus)
  • Exposure and understanding of BCP/DR and high availability exercises
  • Ability to quickly problem-solve and troubleshoot critical services
  • Proficiency with scripting and automation (e.g Go, Java, Python, Bash)
  • Exceptional teamwork, collaboration skills, and the ability to act critically with minimal supervision at times in a remote first environment
Job Responsibility
Job Responsibility
  • Understand and participate in the changing FedRAMP space by quickly ramping up with the 20x controls and building upon these to maintain federal compliance
  • Own and champion high operational standards of Confluent Cloud systems leveraged by federal agencies
  • Deploy production changes to Confluent Cloud systems and infrastructure through established change management processes
  • Assist with process improvements and adoption of change management
  • Own monitoring and incident handling of complex distributed systems, engaging engineering teams when needed through an escort model system
  • Act as a core member of Confluents Business Continuity Plan and Disaster Recovery team with efforts across 3 large verticals
  • Innovate and design solutions to reduce toil, bolster operational maturity, and make day-to-day worklife easier
  • Participate in a 24/7 on-call rotation to maintain the integrity of Confluent Cloud for Government systems
What we offer
What we offer
  • Remote-First Work
  • Robust Insurance Benefits
  • Flexible Time Away
  • The Best Teammates
  • Experience Ambassadors
  • Open and Honest Culture
  • Well-Being and Growth
  • Offers Equity
  • Fulltime
Read More
Arrow Right