CrawlJobs Logo

Infra & Database Engineer

India, Coimbatore · Job Posted March 05, 2026
Apply Position
Job Link Share

Job Description

We are seeking a senior-level Infrastructure & Database Engineer with deep expertise across Windows Server administration and SQL Server database management. This is a high-ownership, production-critical role supporting a regulated healthcare and workers’ compensation environment operating in a hybrid (on-premises + Azure) infrastructure model. The ideal candidate is equally strong in Windows infrastructure operations and SQL Server database administration, capable of independently managing stability, performance, patching, automation, monitoring, and compliance across both layers of the stack. This is not a ticket-driven support role. It is a proactive, engineering-focused position requiring technical depth, structured documentation discipline, and operational ownership.

Job Responsibility

  • Administer and maintain enterprise Windows Server environments (2016/2019/2022)
  • Manage OS patching lifecycle including security updates, cumulative updates, emergency remediation, and rollback planning
  • Own infrastructure monitoring via PRTG and Azure Monitor — configure sensors, dashboards, and proactive alerting
  • Manage Azure VMs and hybrid infrastructure integrations
  • Perform system health analysis, capacity planning, and reliability improvements
  • Support virtualization technologies (VMware, Citrix or equivalent)
  • Automate operational tasks using PowerShell scripting
  • Maintain infrastructure documentation including runbooks, SOPs, escalation matrices, and RACI definitions
  • Administer and support SQL Server 2016+ production environments
  • Perform advanced performance tuning (execution plans, indexing strategy, statistics management, wait analysis)
  • Execute core DBA functions: backups, restore validation, integrity checks, index maintenance, log management, and security reviews
  • Monitor SQL Server and DB2 environments using Redgate, PRTG, Azure dashboards, and native tooling
  • Manage database deployments using Azure DevOps pipelines, DACPACs, DB Maestro, and scripted releases
  • Configure and maintain database monitoring dashboards and alerting frameworks
  • Conduct database capacity planning and proactive health assessments
  • Support database compliance in regulated environments
  • Build and continuously improve runbooks and knowledge documentation
  • Operate in structured change management processes
  • Provide root cause analysis for recurring incidents
  • Define and maintain clear escalation paths for infrastructure and database issues
  • Collaborate with development, application, and infrastructure teams to ensure production stability

Requirements

  • Bachelor’s degree in computer science, Information Technology, Engineering, or equivalent experience
  • 8+ years of combined hands-on experience in Windows Server administration and SQL Server database administration
  • Strong production experience in hybrid infrastructure environments (on-premises + Azure)
  • Proven track record managing enterprise patching and database performance tuning at scale
  • Experience operating in regulated industries (healthcare, insurance, financial services) preferred
  • Windows Server 2016/2019/2022 administration
  • WSUS / SCCM / MECM patch management
  • PowerShell automation
  • Azure VM administration and monitoring
  • PRTG configuration (sensors, alerts, dashboards)
  • Virtualization platforms (VMware / Citrix)
  • SQL Server 2016+ administration
  • Advanced query optimization and execution plan analysis
  • T-SQL scripting and troubleshooting
  • Redgate SQL Monitor / SQL Compare
  • Azure DevOps for database CI/CD
  • DB Maestro or equivalent database change automation tool
  • Working knowledge of IBM DB2

Nice to have

  • Microsoft Certified: Windows Server
  • Microsoft Certified: Azure Administrator
  • Microsoft Certified: Azure Database Administrator
  • SQL Server certifications
  • ITIL Foundation (nice to have)

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Infra & Database Engineer

8 matching positions

Database Engineer

Scale is powering this generative AI wave by providing the data and infrastructu...
Location
Location
United States , San Francisco, CA; New York, NY
Salary
Salary:
162400.00 - 203000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of industry experience as a database engineer post graduation
  • Engineering experience with building real-time and distributed system architecture
  • Experience designing and self hosting databases on industry standard public cloud platforms
  • Deep familiarity with design, architecture, optimization, and tuning multiple database platforms such as MongoDB, Postgres, MySQL, DynamoDB, Redis
  • Deep familiarity with SQL query optimization, database indexing, scalability (partitioning/sharding), and replication
  • Experience developing and optimizing backup and restore functionality to meet RTO goals
  • Intermediate experience in at least one coding language: Typescript, Python, Go, Java, C++
  • Experience working with Docker, Kubernetes, and Infra-as-Code (e.g. Terraform)
  • bonus points for experience supporting GPU/ML workloads
Job Responsibility
Job Responsibility
  • Build and mature database foundations for Scale, leveraging industry-standard platforms
  • Collaborate with stakeholders across the organization, such as software developers, platform engineers, machine learning scientists, customer operations, etc
  • Own services or systems and define their long-term health goals, while also improving the health of surrounding components
  • Mentor other engineers and become deeply involved in architectural design and database best-practices
  • Improve our high engineering standards, tooling, and process
  • Work directly with our engineering and sales teams to create backend database solutions to meet their challenging data and security needs
  • Work with our Security Team on security compliance, pen tests and mitigations that improve security across Scale
  • Build systems capable of handling millions of frames of data every day, making it available to both our workforce and our internal teams with high availability
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • commuter stipend
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - AI/ML Infra

GEICO AI platform and Infrastructure team is seeking an exceptional Senior ML Pl...
Location
Location
United States , Chevy Chase; New York City; Palo Alto
Salary
Salary:
115000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
  • 8+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
  • 3+ years of hands-on experience with machine learning infrastructure and deployment at scale
  • 2+ years of experience working with Large Language Models and transformer architectures
  • Proficient in Python
  • strong skills in Go, Rust, or Java preferred
  • Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral, Gemma, Code Llama, etc.)
  • Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
  • Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
  • Experience implementing and operating feature stores (Chronon, Feast, Tecton, Azure ML Feature Store, or custom solutions)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)
  • Architect and manage Kubernetes clusters for ML workloads, including GPU scheduling, autoscaling, and resource optimization
  • Design, implement, and maintain feature stores for ML model training and inference pipelines
  • Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions
  • Ensure 99.9%+ uptime for ML platforms through robust monitoring, alerting, and incident response procedures
  • Design and implement ML platforms using DataRobot, Azure Machine Learning, Azure Kubernetes Service (AKS), and Azure Container Instances
  • Develop and maintain infrastructure using Terraform, ARM templates, and Azure DevOps
  • Implement cost-effective solutions for GPU compute, storage, and networking across Azure regions
  • Ensure ML platforms meet enterprise security standards and regulatory compliance requirements
  • Evaluate and potentially implement hybrid cloud solutions with AWS/GCP as backup or specialized use cases
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - AI/ML Infra

GEICO AI platform and Infrastructure team is seeking an exceptional Senior ML Pl...
Location
Location
United States , Palo Alto
Salary
Salary:
90000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
  • 8+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
  • 3+ years of hands-on experience with machine learning infrastructure and deployment at scale
  • 2+ years of experience working with Large Language Models and transformer architectures
  • Proficient in Python
  • strong skills in Go, Rust, or Java preferred
  • Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral, Gemma, Code Llama, etc.)
  • Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
  • Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
  • Experience implementing and operating feature stores (Chronon, Feast, Tecton, Azure ML Feature Store, or custom solutions)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)
  • Architect and manage Kubernetes clusters for ML workloads, including GPU scheduling, autoscaling, and resource optimization
  • Design, implement, and maintain feature stores for ML model training and inference pipelines
  • Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions
  • Ensure 99.9%+ uptime for ML platforms through robust monitoring, alerting, and incident response procedures
  • Design and implement ML platforms using DataRobot, Azure Machine Learning, Azure Kubernetes Service (AKS), and Azure Container Instances
  • Develop and maintain infrastructure using Terraform, ARM templates, and Azure DevOps
  • Implement cost-effective solutions for GPU compute, storage, and networking across Azure regions
  • Ensure ML platforms meet enterprise security standards and regulatory compliance requirements
  • Evaluate and potentially implement hybrid cloud solutions with AWS/GCP as backup or specialized use cases
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Database operations engineer

Shape the future of payments as a database operations engineer in fintech. A fin...
Location
Location
Japan , Tokyo
Salary
Salary:
7000000.00 - 9000000.00 JPY / Year
https://www.randstad.com Logo
Randstad
Expiration Date
February 17, 2027
Flip Icon
Requirements
Requirements
  • Hands-on experience with MySQL operation
  • More than 1.5 years of AWS experience
  • Hands-on experience with Infra as Code (IaC)
  • Interested in new technologies and willing to improve operational quality and efficiency
  • Familiar with shell and programming language for scripting (Python, Go etc.)
Job Responsibility
Job Responsibility
  • Design the total UI/UX for the product experience in our payment application (Design the payment experience)
  • Design the persona of our users, and research by using the storyboard for it
  • Define the UI architecture and design the concept diagram, wire frame and virtual design
  • Promote the projects
  • Have a broad range of tasks related to the product design such as holding the splint and design workshop for the product development etc.
What we offer
What we offer
  • Language Learning support
  • Translation/Interpretation support
  • VISA sponsor + Relocation support
  • Health insurance
  • Workers' compensation insurance
  • Fulltime
Read More
Arrow Right

Software Engineer, Distributed Systems - Infra

You'll build and scale the application and data infrastructure that supports 70M...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 275000.00 USD / Year
gamma.app Logo
Gamma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3–5+ years of backend engineering experience building scalable systems
  • Strong proficiency in backend technologies (Node.js, Python, or similar) and databases (PostgreSQL, Redis)
  • Experience with high-traffic production systems and performance optimization
  • Track record shipping high-quality, complex applications under tight timelines
  • Product-minded approach with understanding of how technical decisions impact user experience and business metrics
  • Thrives in fast-paced, product-led environments where shipping quality directly impacts growth
  • Experience with real-time collaboration systems, event pipelines, or AI-powered applications (Nice to have)
Job Responsibility
Job Responsibility
  • Design and implement scalable APIs, distributed systems, and data infrastructure that serve millions of users
  • Help define and evolve the core data model and storage systems powering Gamma's business
  • Ship backend systems that directly impact growth metrics and user experience
  • Work on real-time collaborative editing, databases, public APIs, and high-volume event pipelines
  • Balance long-term technical investments with rapid shipping velocity
  • Collaborate across frontend, product, and data teams to deliver high-quality solutions under tight timelines
What we offer
What we offer
  • Equity
  • Fulltime
Read More
Arrow Right

Senior Engineer – (Systems Engineering, Enterprise Infra & Platform Support)

The Senior Infrastructure & Platform Support Engineer provides end-to-end techni...
Location
Location
United States , Chevy Chase
Salary
Salary:
80000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience leading engineering efforts or owning internal, enterprise-scale platforms and working directly with enterprise customers
  • Familiarity with enterprise application lifecycle (selection, deployment, user adoption, decommission, integration layers)
  • Strong background in enterprise infrastructure supporting Windows and Linux systems, including builds, configuration, hardening, and troubleshooting
  • Advanced skills with Windows Server, Active Directory, authentication protocols (Kerberos / LDAP / SAML / OAuth), and Azure AD/identity integrations
  • Solid Linux administration experience (Ubuntu, RHEL, or equivalent), with certifications preferred
  • Proficiency in automation and scripting (PowerShell, Bash, Python)
  • Strong understanding of networking fundamentals: TCP/IP, DNS, DHCP, routing, VPNs, firewalls, load balancers, VLANs, and secure connectivity
  • Hands-on experience with cloud platforms (Azure/AWS), hybrid environments, virtualization (vSphere/Hyper-V), and containers (Docker, Kubernetes)
  • Knowledge of monitoring and observability tools, such as Prometheus, Grafana, or equivalent solutions
  • Familiarity with database concepts, performance tuning, and integration of MySQL/PostgreSQL/SQL Server/Oracle with enterprise systems
Job Responsibility
Job Responsibility
  • Provide technical leadership to ensure strong engineering standards and operational excellence
  • Support, configure, and maintain both Linux and Windows server platforms, including application servers, integration components, and system services
  • Design and implement infrastructure solutions for workplace technologies including but not limited to digital mailroom, physical security & safety, and real estate facility management technology platforms—covering on-prem systems, hybrid setups, and SaaS applications
  • Build production-ready configurations emphasizing reliability, maintainability, scalability, and testability
  • Lead incident response, troubleshooting, root-cause analysis, and drive ongoing performance optimization
  • Execute DevOps activities including CI/CD pipeline management, automation scripting, monitoring setup, and Infrastructure as Code
  • Ensure platform observability through logging, alerting, dashboards, and automated health checks
  • Apply secure design practices, compliance controls, network segmentation, encryption, and access management
  • Manage platform lifecycle activities such as patching, upgrades, capacity planning, backups, disaster recovery and identifying opportunities for automation and standardization
  • Collaborate with cross-functional teams, vendors, and senior engineers, communicating clearly with technical and non-technical stakeholders
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Software Engineer II (Backend, Healthcare Infra)

As a Software Engineer II on the Healthcare Infra team, you will help build and ...
Location
Location
United States , Boston
Salary
Salary:
125000.00 - 170000.00 USD / Year
whoop.com Logo
Whoop
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Professional experience in backend development, with a strong foundation in object-oriented programming, API design, and relational databases (RESTful APIs, Postgres)
  • Familiarity with asynchronous processing systems (Kafka, SQS)
  • Experience writing automated tests and documenting code for a variety of audiences
  • A passion for approaching large-scale problems guided by data-driven insights and a commitment to agile, iterative development
  • A proactive, collaborative team player, eager to take on new challenges, continuously learn, and adapt in a fast-paced, data-informed environment
Job Responsibility
Job Responsibility
  • Contribute to engineering efforts within a cross-functional team, collaborating with designers, product managers, other engineers, and our Digital Health team to refine and advance the WHOOP platform
  • Develop and maintain robust backend services using Java, Kafka, Postgres, and other AWS technologies, ensuring stability and performance
  • Contribute to the ideation, technical design, and implementation of new features and platforms, transforming complex requirements into reliable, scalable solutions
  • Work on scaling challenges that span multiple systems and demand high availability and reliability
  • Write clean, testable, and maintainable code, while participating in code reviews and documentation practices
What we offer
What we offer
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

As a Staff Software Engineer, you will play a key role in designing, building, a...
Location
Location
United States , San Jose
Salary
Salary:
120500.00 - 243000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 5 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE)
  • Proficiency with Linux systems, especially Debian-based distributions
  • Strong experience with cloud platforms such as AWS and GCP
  • Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible
  • Solid programming skills in Python and/or Golang
  • Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE)
  • Experience with GitOps workflows
  • Proven track record in implementing and maintaining CI/CD pipelines
  • Strong background in security and familiarity with security programs
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK)
Job Responsibility
Job Responsibility
  • Enhance Infrastructure as Code (IAC) and enforce best practices
  • Optimize cloud infrastructure for scalability, security, and cost-effectiveness
  • Develop internal tools to support and streamline cloud platform operations
  • Improve CI/CD pipelines and deployment workflows using FluxCD and Jenkins
  • Address container image vulnerabilities and standardize remediation processes
  • Build Amazon Machine Images (AMIs) aligned with CIS and STIG benchmarks
  • Strengthen monitoring, alerting, and observability using Prometheus, Grafana, and logging tools
  • Troubleshoot complex production issues to ensure system reliability and customer satisfaction
  • Fine-tune distributed systems such as Apache Kafka and Cassandra
  • Collaborate with development, security, and operations teams to align infrastructure with application needs
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right