CrawlJobs Logo

Database Reliability Engineer

Viant

Location Icon

Location:
United States, Irvine

Category Icon
Category:
IT - Software Development

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

130000.00 - 150000.00 USD / Year

Job Description:

We are looking for a skilled and motivated Database Reliability Engineer to join our growing team. In this role, you will support the design, implementation, and day-to-day operations of our database infrastructure across cloud platforms including AWS and Google Cloud Platform (GCP). This position offers a great opportunity to grow technically while contributing to performance, automation, and reliability of our data systems.

Job Responsibility:

  • Database Maintenance and Operations - Maintain database health by managing backups, replication, and routine maintenance tasks across environments (e.g., MySQL, PostgreSQL, SQL Server)
  • Cloud Database Support - Assist with administration of cloud-based databases such as AWS RDS, Aurora, DynamoDB, and Google Cloud SQL, ensuring reliability and performance
  • Monitoring and Alerting - Set up and maintain monitoring and alerting systems using Prometheus and Grafana, as well as cloud-native tools (e.g., CloudWatch, Stackdriver) to proactively detect and resolve database issues
  • Performance Tuning - Collaborate with senior DBAs and developers to optimize queries, indexes, and configurations for better performance
  • Automation and Scripting - Automate recurring tasks using scripts and contribute to deployment pipelines and database change management processes
  • Security and Access Management - Implement role-based access controls, audit trails, and enforce best practices for data security and compliance
  • Documentation and Support - Document database configurations, procedures, and incident reports. Provide support during incidents and collaborate with engineers to troubleshoot issues

Requirements:

  • 2–5 years of experience in database administration in production environments
  • Experience with relational databases such as MySQL, PostgreSQL, or SQL Server
  • Hands-on exposure to AWS (e.g., RDS, Aurora) and/or GCP (e.g., Cloud SQL, BigQuery)
  • Experience with Linux systems and cloud monitoring tools (e.g., CloudWatch, Stackdriver)
  • Proficient in scripting (e.g., Bash, Python) and automation tools
  • Familiar with CI/CD and infrastructure automation (e.g., Terraform, GitHub Actions, Jenkins)
  • Hands-on experience with Grafana and Prometheus for database and infrastructure monitoring
  • Understanding of backup and recovery strategies, replication, and high availability
  • Basic knowledge of performance tuning and monitoring tools (e.g., EverSQL)
  • Strong analytical and troubleshooting skills
  • ability to work independently and collaboratively

Nice to have:

  • Exposure to NoSQL or distributed database systems (e.g., MongoDB, Aerospike)
  • Experience with Git, CI/CD pipelines, and infrastructure as code tools (e.g., Terraform, GitHub Actions)
  • Familiarity with containerized environments (Docker, Kubernetes)
  • Experience working in regulated environments or with data compliance standards
What we offer:
  • fully paid health insurance
  • paid parental leave
  • unlimited PTO

Additional Information:

Job Posted:
December 13, 2025

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Database Reliability Engineer

New

Database Reliability Engineer

The Database Reliability Engineer (DBRE) is responsible for managing, building, ...
Location
Location
United States
Salary
Salary:
120000.00 - 179000.00 USD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience working with relational database systems
  • Strong hands-on experience with MySQL (administration, performance tuning, replication, HA/DR)
  • 1+ years in a DBRE or database-focused engineering role
  • Experience working in cloud environments (AWS, GCP, or Azure — Azure preferred)
  • Coding and automation experience (Python, PowerShell, SQL, etc.)
  • Experience with Infrastructure-as-Code tools such as Ansible and Terraform
  • Experience working with source control systems such as Git
  • MySQL experience preferred
  • PostgreSQL is a plus
  • Experience working with VLDBs (1+ TB) and managing large database fleets (100+ instances)
Job Responsibility
Job Responsibility
  • Managing, building, maintaining, monitoring, and troubleshooting the cloud-based MySQL database infrastructure that our mission-critical SaaS application depends on
  • Focuses heavily on automation and coding to reduce operational toil
  • Collaborate closely with Engineering and SRE teams to support new product development and ensure reliable database integration across the platform
  • Work on observability of MySQL database metrics and ensure database performance and reliability objectives are consistently met
  • Work with the DBA team to identify areas of operational toil and implement automations/processes to manage PCC’s MySQL database systems at scale
  • Apply a data-driven approach to performance tuning, availability improvements, and operational optimization
  • Provide database support to Engineering and SRE teams, including review of database migrations, query performance, schema/design improvements, and standardizing MySQL configuration and deployment patterns
  • Assist the DBA team with performance troubleshooting and root-cause analysis
What we offer
What we offer
  • Benefits starting from Day 1!
  • Retirement Plan Matching
  • Flexible Paid Time Off
  • Wellness Support Programs and Resources
  • Parental & Caregiver Leaves
  • Fertility & Adoption Support
  • Continuous Development Support Program
  • Employee Assistance Program
  • Allyship and Inclusion Communities
  • Employee Recognition … and more!
  • Fulltime
Read More
Arrow Right
New

Database Reliability Engineer - Core Team

We are committed to providing our customers with reliable and secure services at...
Location
Location
United Kingdom
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right
New

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Netherlands
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right
New

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Germany
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right
New

Site Reliability Engineer

About LogRocket: Founded in 2016, LogRocket's goal is to make every experience o...
Location
Location
United States , Boston
Salary
Salary:
135000.00 - 220000.00 USD / Year
logrocket.com Logo
LogRocket
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 4 years of experience as a Site Reliability Engineer, or related job
  • Ability to read and understand product code
  • Familiarity with the state of the art in cloud technologies, including common providers, specific tools of the trade, and their strengths and weaknesses
  • Experience operating applications and databases with demanding scalability or availability requirements
  • Proven expertise in modern container orchestration practices
  • A strong understanding of the performance, architecture, tooling, and cost of cloud systems
  • A security focused mindset with a solid understanding of incident response and risk mitigation
  • A strong collaborator who is transparent about progress on tasks, seeks feedback early and often, works effectively with the team and customers
Job Responsibility
Job Responsibility
  • Improve quality of pager alerts while reducing noise
  • Maintain awareness of engineering initiatives across the organization and monitor their impact on stability, cost, and performance
  • Keep infrastructure up-to-date to take advantage of security patches and new features
  • Improve operational security without sacrificing engineering independence
What we offer
What we offer
  • Catered lunch and an impressive array of your favorite snacks
  • Unlimited vacation policy
  • Health, Dental, Vision benefits, 401k, commuter benefits
  • Generous stock options
  • Regular team outings and activities
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

This is a role at Baxter where your work impacts saving and sustaining lives thr...
Location
Location
United States , Deerfield
Salary
Salary:
96000.00 - 132000.00 USD / Year
https://www.baxter.com/ Logo
Baxter
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, IT, or related field (or equivalent experience)
  • Prior experience in Site Reliability Engineering and cloud-based infrastructure management
  • Experience in enterprise engineering, including 24x7 uptime, regulated environments, and planning/operations
  • Azure administration and operations experience, with certifications a plus
  • Knowledge of related technologies, including cloud, encryption, and security protocols
  • Systems administration experience in Windows and Linux environments
  • Proven problem-solving skills and experience with scripting and automation tools
  • Ability to create accurate documentation and reports, with excellent communication skills
  • Applicants must be authorized to work for any employer in the U.S.
  • Unable to sponsor or take over sponsorship of an employment visa at this time.
Job Responsibility
Job Responsibility
  • Drive strategies to ensure 24x7 availability of services and business continuity for customer-facing healthcare software applications and platforms hosted on Microsoft Azure cloud
  • Manage and administer Azure resources, including virtual machines, databases, and networking components
  • Define and document operating procedures to ensure required security, privacy and other compliance standards are maintained for digital solutions deployed in cloud
  • Manage process, planning, and execution for Disaster Recovery (DR) and Business Continuity Planning (BCP)
  • Define and refine Operations SLAs to maintain high level of Customer Satisfaction
  • Establish non-functional requirements to meet SLAs
  • Establish infrastructure and application monitoring dashboards and workflow for automatic routing of notifications
  • Define key performance indicators that can be monitored, measured, and used to derive opportunities
  • Standardize site metrics for stakeholders, reporting on various KPIs including SLAs, availability, capacity utilization, service metrics and cost utilization
  • Work closely with DevOps Engineers to automate infrastructure provisioning and deployment processes.
What we offer
What we offer
  • Support for Parents
  • Continuing Education/Professional Development
  • Employee Health & Well-Being Benefits
  • Paid Time Off
  • 2 Days a Year to Volunteer
  • Medical and dental coverage starting day one
  • Insurance coverage for basic life, accident, short-term and long-term disability
  • Business travel accident insurance
  • Employee Stock Purchase Plan (ESPP)
  • 401(k) Retirement Savings Plan
  • Fulltime
Read More
Arrow Right
New

Senior / Staff Software Engineer (Database)

Our database technology sits at the heart of the Materialize product—a product t...
Location
Location
United States , New York
Salary
Salary:
164050.00 - 250000.00 USD / Year
materialize.com Logo
Materialize
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Several years of experience developing software
  • Passionate about distributed systems and/or databases
  • Excited to learn Rust if not already using it
  • Pride in owning work end-to-end
  • Ability to write clear design docs and well-documented code
  • Love solving hard problems in service of the customer
  • Excited about working at the intersection of frontier academic research and a venture-backed startup
Job Responsibility
Job Responsibility
  • Design and deliver improvements to the Database, with an eye on correctness, reliability, and performance
  • Own projects end-to-end, from early stage design to holding the pager
  • Debug and resolve complex distributed systems issues, sometimes directly with customers
  • Continually improve system reliability, observability, and automation
  • Collaborate across your team, with Product, with Field Eng, and all other stakeholders to align on direction, carefully prioritize, and build the best product for our users
  • Share your work through mentorship, demos, blog posts, and any other relevant channels
What we offer
What we offer
  • Equity
  • Fulltime
Read More
Arrow Right

Database Performance Engineer

We're seeking a Database Performance Engineer to join our Engineering Infrastruc...
Location
Location
United States
Salary
Salary:
Not provided
https://seamless.ai/ Logo
Seamless.AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as a Database Engineer, Database Administrator (DBA), or Performance Engineer working on PostgreSQL in production
  • Deep understanding of query planning, indexing strategies, and PostgreSQL internals
  • Experience with OpenSearch, Elasticsearch, or other document-based databases (preferred)
  • Strong knowledge of SQL optimization, database partitioning, replication, and caching strategies
  • Hands-on experience with performance profiling, benchmarking tools, and observability stacks
  • Proficiency in scripting or automation tools (Python, Bash, Ansible, Terraform)
  • Familiarity with cloud-based database services (AWS RDS, Aurora, or GCP Cloud SQL)
  • Solid understanding of data security, compliance, and access control principles
  • Excellent problem-solving skills and ability to work in fast-paced, cross-functional environments
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
Job Responsibility
Job Responsibility
  • Design, implement, and optimize SQL queries, stored procedures, and indexes to achieve low-latency performance
  • Fine-tune PostgreSQL configurations for maximum throughput and minimal resource utilization
  • Collaborate with backend and infrastructure engineers to diagnose and resolve performance issues
  • Manage and optimize large-scale database clusters, including replication, sharding, and partitioning
  • Ensure database uptime, backup, recovery, and disaster recovery objectives (RTO/RPO) are consistently met
  • Set up and monitor performance metrics, alerts, and dashboards (e.g., using pg_stat_statements, Prometheus, Grafana)
  • Conduct capacity planning and load testing to prepare for traffic spikes and data growth
  • Partner with Security and Compliance teams to ensure database access, data encryption, and retention policies meet regulatory standards
  • Develop and maintain documentation for database configurations, maintenance routines, and operational procedures
  • Support query performance reviews and educate developers on best practices for writing efficient SQL
  • Fulltime
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.