CrawlJobs Logo

Kafka Infrastructure Engineer

citizensbank.com Logo

Citizens Bank

Location Icon

Location:
United States , Phoenix

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

125000.00 - 155000.00 USD / Year

Job Description:

We are seeking a highly skilled and motivated Senior Infrastructure Engineer to join our Enterprise Data Engineering team. This full-time role is ideal for candidates with hands-on experience in infrastructure technologies, Apache Kafka including Confluent Kafka and Kafka Streams, MQ, SQL and NoSQL databases, and cloud engineering. If you are passionate about building efficient, scalable platforms, we would love to hear from you.

Job Responsibility:

  • Lead a team of engineers and technicians in monitoring, diagnosing, and resolving infrastructure issues using event-based management
  • Administer and troubleshoot Kafka clusters, including configuration and performance tuning
  • Integrate Kafka with various systems using connectors such as MQ, MongoDB, Oracle, SQL Server, PostgreSQL, and MySQL
  • Automate infrastructure setup across environments using Terraform
  • Provide senior-level support and troubleshooting across a wide range of technologies
  • Collaborate within agile teams to drive modern development practices and product vision
  • Serve as the primary point of contact for daily operations and incident management
  • Conduct proactive monitoring to identify and mitigate potential service disruptions
  • Document actions, create reports, and establish escalation procedures
  • Audit support tickets to identify patterns and reduce downtime
  • Ensure compliance with internal policies and industry standards
  • Develop and maintain runbooks and playbooks for operational excellence

Requirements:

  • 7 or more years of experience in Kafka administration on-prem and cloud, messaging systems, and database integration
  • Proficiency with cloud platforms such as AWS, GCP, or Azure, event-driven architecture, DevOps, and containerization
  • Experience deploying Kafka clients and brokers in production and disaster recovery environments
  • Proven ability to scale Kafka clusters and connector infrastructure
  • Hands-on experience building real-time data pipelines using Kafka producers and Spark Streaming consumers
  • Familiarity with monitoring tools such as Splunk, Datadog, and Grafana
  • Strong knowledge of source control systems such as SVN and Git
  • Solid understanding of networking protocols, operating systems, and diagnostic tools
  • Proficiency in scripting languages such as PowerShell, Bash, Python, and Perl
  • Strong analytical and decision-making skills, even with limited information
  • Ability to work independently and track issues to identify trends
  • Excellent communication and customer service skills
  • Sound judgment and autonomy in handling both emergency and routine situations
  • Experience collaborating with technical teams for issue resolution
  • Bachelor's degree in Computer Science, Computer Engineering, Electronics Engineering

Nice to have:

  • Experience with monitoring tools
  • Working experience in the financial services industry
What we offer:
  • Medical, dental and vision coverage
  • Retirement benefits
  • Maternity/paternity leave
  • Flexible work arrangements
  • Education reimbursement
  • Wellness programs

Additional Information:

Job Posted:
May 16, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Kafka Infrastructure Engineer

Software Engineer, Data Infrastructure

The Data Infrastructure team at Figma builds and operates the foundational platf...
Location
Location
United States , San Francisco; New York
Salary
Salary:
149000.00 - 350000.00 USD / Year
figma.com Logo
Figma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of Software Engineering experience, specifically in backend or infrastructure engineering
  • Experience designing and building distributed data infrastructure at scale
  • Strong expertise in batch and streaming data processing technologies such as Spark, Flink, Kafka, or Airflow/Dagster
  • A proven track record of impact-driven problem-solving in a fast-paced environment
  • A strong sense of engineering excellence, with a focus on high-quality, reliable, and performant systems
  • Excellent technical communication skills, with experience working across both technical and non-technical counterparts
  • Experience mentoring and supporting engineers, fostering a culture of learning and technical excellence
Job Responsibility
Job Responsibility
  • Design and build large-scale distributed data systems that power analytics, AI/ML, and business intelligence
  • Develop batch and streaming solutions to ensure data is reliable, efficient, and scalable across the company
  • Manage data ingestion, movement, and processing through core platforms like Snowflake, our ML Datalake, and real-time streaming systems
  • Improve data reliability, consistency, and performance, ensuring high-quality data for engineering, research, and business stakeholders
  • Collaborate with AI researchers, data scientists, product engineers, and business teams to understand data needs and build scalable solutions
  • Drive technical decisions and best practices for data ingestion, orchestration, processing, and storage
What we offer
What we offer
  • equity
  • health, dental & vision
  • retirement with company contribution
  • parental leave & reproductive or family planning support
  • mental health & wellness benefits
  • generous PTO
  • company recharge days
  • a learning & development stipend
  • a work from home stipend
  • cell phone reimbursement
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Engineer

Coralogix is seeking a Senior Infrastructure Engineer to join our Core SRE team ...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, SRE, platform engineering, or infrastructure roles
  • Deep understanding of Kubernetes: API, CNI, scheduling, container runtimes and such
  • Strong hands-on experience with Kafka and Istio (or similar technologies ), and core networking protocols (HTTP, gRPC, TLS)
  • Proven experience managing large-scale cloud infrastructure (AWS, GCP, etc.)
  • Experience in incident response and troubleshooting complex distributed systems
  • Some software engineering experience, preferably in Golang
  • Passion for automation, performance tuning, and operational excellence
Job Responsibility
Job Responsibility
  • Act as a hands-on technical leader with deep expertise in modern cloud infrastructure
  • Serve as a go-to person in the team — leading through influence, not hierarchy
  • Collaborate cross-functionally to refine requirements and propose innovative, scalable solutions
  • Drive long-term, high-impact infrastructure projects across multiple teams, from design to implementation, within defined timelines
  • Contribute to improving system reliability, performance, and cost-efficiency at scale
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Engineer

Coralogix is seeking a Senior Infrastructure Engineer to join our Core SRE team ...
Location
Location
Israel , Ramat Gan
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, SRE, platform engineering, or infrastructure roles
  • Deep understanding of Kubernetes: API, CNI, scheduling, container runtimes and such
  • Strong hands-on experience with Kafka and Istio (or similar technologies ), and core networking protocols (HTTP, gRPC, TLS)
  • Proven experience managing large-scale cloud infrastructure (AWS, GCP, etc.)
  • Experience in incident response and troubleshooting complex distributed systems
  • Some software engineering experience, preferably in Golang
  • Passion for automation, performance tuning, and operational excellence
Job Responsibility
Job Responsibility
  • Act as a hands-on technical leader with deep expertise in modern cloud infrastructure
  • Serve as a go-to person in the team — leading through influence, not hierarchy
  • Collaborate cross-functionally to refine requirements and propose innovative, scalable solutions
  • Drive long-term, high-impact infrastructure projects across multiple teams, from design to implementation, within defined timelines
  • Contribute to improving system reliability, performance, and cost-efficiency at scale
  • Fulltime
Read More
Arrow Right

DevOps Engineer – Kafka Service

We are looking for a highly skilled DevOps Engineer to take ownership of the Kaf...
Location
Location
Luxembourg , Leudelange
Salary
Salary:
Not provided
https://www.soprasteria.com Logo
Sopra Steria
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, Site Reliability Engineering (SRE), or Kafka administration
  • Strong hands-on experience with Apache Kafka (setup, tuning, and troubleshooting)
  • Proficiency in scripting (Python, Bash) and automation tools (Terraform, Ansible)
  • Experience with cloud environments (AWS, Azure, or GCP) and Kubernetes-based Kafka deployments
  • Familiarity with Kafka Connect, KSQL, Schema Registry, Zookeeper
  • Knowledge of logging and monitoring tools (Dynatrace, ELK, Splunk)
  • Understanding of networking, security, and access control for Kafka clusters
  • Experience with CI/CD tools (Jenkins, GitLab, ArgoCD)
  • Ability to analyze logs, debug issues, and propose proactive improvements
  • Excellent problem-solving and communication skills
Job Responsibility
Job Responsibility
  • Kafka Administration & Operations: Deploy, configure, monitor, and maintain Kafka clusters in a high-availability production environment
  • Performance Optimization: Tune Kafka configurations, partitions, replication, and producers/consumers to ensure efficient message streaming
  • Infrastructure as Code (IaC): Automate Kafka infrastructure deployment and management using Terraform, Ansible, or similar tools
  • Monitoring & Incident Management: Implement robust monitoring solutions (e.g., Dynatrace) and troubleshoot performance bottlenecks, latency issues, and failures
  • Security & Compliance: Ensure secure data transmission, access control, and compliance with security best practices (SSL/TLS, RBAC, Kerberos)
  • CI/CD & Automation: Integrate Kafka with CI/CD pipelines and automate deployment processes to improve efficiency and reliability
  • Capacity Planning & Scalability: Analyze workloads and plan for horizontal scaling, resource optimization, and failover strategies
What we offer
What we offer
  • Work among high-level professionals at the forefront of corporate software solutions and innovation at Europe’s Leading Digital Service Provider
  • Fulltime
Read More
Arrow Right

Software Engineer, Infrastructure

The Infrastructure team builds foundational systems at scale. We're hundreds o b...
Location
Location
United States , New York City; San Francisco Bay Area
Salary
Salary:
171200.00 - 246000.00 USD / Year
metronome.com Logo
Metronome
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years building infrastructure systems: Hands-on experience with distributed systems, cloud infrastructure, container orchestration, data pipelines, observability, CI/CD, or other foundational platforms
  • Ownership of production systems: Track record of operating mission-critical infrastructure with strong focus on reliability, scalability, and performance
  • Force multiplier mindset: You build platforms that enable others. You create abstractions that make complex systems approachable. You think about developer experience as a first-class concern
  • Cross-functional collaboration: You partner effectively with product teams, communicate technical decisions clearly, and mentor engineers across experience levels
Job Responsibility
Job Responsibility
  • Build platforms that scale: Design and operate foundational infrastructure—Kubernetes clusters, Kafka streaming platforms, Spark batch processing, observability systems—that handle billions of events and enable Metronome to grow with minimal friction
  • Enable product velocity: Create golden paths, abstractions, and tooling that let engineers ship faster and more reliably without becoming infrastructure experts themselves
  • Enable reliability as the product: Take accountability for system uptime, performance, and correctness. Build monitoring, alerting, and incident response systems that enable the entire team catch problems before customers notice
  • Drive technical direction: Shape Metronome's infrastructure strategy, make platform-level architectural decisions, and mentor engineers across the organization
What we offer
What we offer
  • Excellent medical, dental, vision, and life insurance coverage, including a One Medical membership
  • Paid parental leave
  • FSA (Flexible spending account)
  • Retirement planning - Traditional and ROTH 401(k)
  • Flexible time off
  • Employee assistance program (mental health benefits)
  • Culture where personal growth is highly valued
  • market-benched equity
  • sales incentive pay (for eligible roles)
  • comprehensive health benefits
  • Fulltime
Read More
Arrow Right

Senior Cloud Infrastructure Engineer

HPE Aruba Networking is a leading provider of next-generation networking solutio...
Location
Location
United States , San Jose
Salary
Salary:
133500.00 - 307000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum expected industry experience is around 6 years
  • Minimum education at BS or MS level in Computer Science or related fields
  • Proven record of developing and releasing cloud applications in the production environment
  • Experience with DevOps and Cloud Infrastructure Deployment and Automation in Python, Terraform, Ansibles, GitOps, GitLabs, and Jenkins/Spinnaker
  • Experience in RDBMS (Postgres), GraphQL, and NoSQL (Cassandra, OpenSearch, Clickhouse, and etc.)
  • Experience in cloud stacks such as Redis, Kafka, RabbitMQ, Hazelcast
  • Experience in development in Kubernetes and Docker containers
  • Programming language experience with Shell Scripts, Python, Golang, or Java
  • Ability to deploy various techniques to ‘scale’ an application in a cloud environment
  • Demonstrated abilities to work with QA and Remote Teams
Job Responsibility
Job Responsibility
  • Participate in architecture and design discussions
  • Develop scalable applications that run on top of Next Generation Central
  • Contribute to multiple technical programs simultaneously
What we offer
What we offer
  • Health benefits
  • Comprehensive suite of benefits supporting physical, financial, and emotional wellbeing
  • Personal and professional development programs
  • Inclusion and diversity initiatives
  • Exciting and fun work culture
  • Innovation and growth opportunities
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Kafka Engineer

We are seeking a Senior Infrastructure - Kafka Engineer to join a high-performin...
Location
Location
United States , Phoenix
Salary
Salary:
Not provided
technologent.com Logo
Technologent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in infrastructure engineering with a strong focus on: Kafka administration across on-prem and cloud environments
  • Kafka ecosystem components including brokers, topics, consumer groups, replication, and failover
  • Messaging systems such as MQ
  • SQL and NoSQL database integration
  • Proven experience designing, deploying, and scaling Kafka clusters and connector infrastructure in production and DR environments
  • Hands-on experience building real-time data pipelines using Kafka producers and streaming consumers such as Spark Streaming
  • Strong proficiency with at least one major cloud platform: AWS, GCP, or Azure
  • Experience with event-driven architectures, containerization, and DevOps practices
  • Experience with observability and monitoring tools such as Splunk, Datadog, and Grafana
  • Solid understanding of networking, Linux/Windows operating systems, and core diagnostic tools
Job Responsibility
Job Responsibility
  • Administer, configure, and troubleshoot Kafka clusters across on-prem and cloud environments, including broker and cluster configuration, partitioning, and performance tuning
  • Design and implement scalable, highly available Kafka infrastructure, including disaster recovery and multi-environment strategies
  • Integrate Kafka with upstream and downstream systems using Kafka Connect and related connectors, including MQ, MongoDB, Oracle, SQL Server, PostgreSQL, and MySQL
  • Build and support real-time data pipelines using Kafka producers and streaming consumers such as Spark Streaming and Kafka Streams
  • Automate infrastructure provisioning and configuration across environments using Terraform and modern DevOps practices
  • Deploy and manage Kafka components and clients in production and disaster recovery environments, ensuring resilience and recoverability
  • Lead a small team of engineers and technicians in monitoring, diagnosis, and remediation of infrastructure issues
  • Implement and maintain comprehensive monitoring, logging, and alerting using tools such as Splunk, Datadog, and Grafana
  • Perform proactive health checks and capacity planning to identify and resolve issues before they impact service
  • Serve as a primary point of contact for daily operations, major incidents, and escalations related to Kafka and associated infrastructure
  • Fulltime
Read More
Arrow Right

Senior Engineering Manager, Big Data

Checkr is looking for a Senior Engineering Manager to lead the Criminal Data tea...
Location
Location
United States , San Francisco
Salary
Salary:
238000.00 - 280000.00 USD / Year
https://checkr.com Logo
Checkr
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years as an engineering manager
  • 8+ years as an engineer
  • Exceptional verbal and written communication skills
  • Unparalleled bar for quality (data quality metrics, QC gates, data governance, automated regression test suites, data validations, etc)
  • Experience working on data products at scale and understanding the legal, human impact, and technical nuances of supporting a highly regulated product
  • Experience designing and maintaining: Real-time & batch processing data pipelines serving up billions of data points
  • Normalizing and cleansing data across a medallion lakehouse architecture
  • Systems that rely on high-volume, low-latency messaging infrastructure (e.g. Kafka or similar)
  • Highly tolerant production systems with streamlined operations (data lineage, logging, telemetry, alerting, etc.)
  • Familiarity with AWS Glue, OpenSearch, EMR, etc
Job Responsibility
Job Responsibility
  • Drive a motivating technical vision for the team
  • Partner closely with product management to solve business problems
  • Work with the team to build a world-class architecture that can scale into the next phase of Checkr’s growth
  • Hire the best talent and continue to raise the bar for the team
  • Represent the team in planning and product meetings
  • Optimize engineering processes and policies to drive velocity and quality
What we offer
What we offer
  • A fast-paced and collaborative environment
  • Learning and development allowance
  • Competitive compensation and opportunity for advancement
  • 100% medical, dental, and vision coverage
  • Up to 25K reimbursement for fertility, adoption, and parental planning services
  • Flexible PTO policy
  • Monthly wellness stipend, home office stipend
  • Fulltime
Read More
Arrow Right