Senior Infrastructure Kafka Engineer Job at Technologent (Phoenix)

Kafka Infrastructure Engineer

We are seeking a highly skilled and motivated Senior Infrastructure Engineer to ...

Location

United States , Phoenix; Johnston; Iselin; Westwood; Plano

Salary:

125000.00 - 155000.00 USD / Year

Citizens Bank

Expiration Date

Until further notice

Requirements

7 or more years of experience in Kafka administration on-prem and cloud, messaging systems, and database integration
Proficiency with cloud platforms such as AWS, GCP, or Azure, event-driven architecture, DevOps, and containerization
Experience deploying Kafka clients and brokers in production and disaster recovery environments
Proven ability to scale Kafka clusters and connector infrastructure
Hands-on experience building real-time data pipelines using Kafka producers and Spark Streaming consumers
Familiarity with monitoring tools such as Splunk, Datadog, and Grafana
Strong knowledge of source control systems such as SVN and Git
Solid understanding of networking protocols, operating systems, and diagnostic tools
Proficiency in scripting languages such as PowerShell, Bash, Python, and Perl
Strong analytical and decision-making skills, even with limited information

Job Responsibility

Lead a team of engineers and technicians in monitoring, diagnosing, and resolving infrastructure issues using event-based management
Administer and troubleshoot Kafka clusters, including configuration and performance tuning
Integrate Kafka with various systems using connectors such as MQ, MongoDB, Oracle, SQL Server, PostgreSQL, and MySQL
Automate infrastructure setup across environments using Terraform
Provide senior-level support and troubleshooting across a wide range of technologies
Collaborate within agile teams to drive modern development practices and product vision
Serve as the primary point of contact for daily operations and incident management
Conduct proactive monitoring to identify and mitigate potential service disruptions
Document actions, create reports, and establish escalation procedures
Audit support tickets to identify patterns and reduce downtime

What we offer

Medical, dental and vision coverage
Retirement benefits
Maternity/paternity leave
Flexible work arrangements
Education reimbursement
Wellness programs

Fulltime

Senior Infrastructure Engineer - GenAI

We are seeking an experienced Senior Backend Engineer to design, develop, and ma...

Location

India , Chennai

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Bachelor’s degree in computer science, Engineering, or related technical field, or equivalent practical experience
4–6 years of experience in backend engineering with focus on scalable, production systems
2+ years of hands-on experience with containerization, Kubernetes, and cloud infrastructure in production environments
Demonstrated experience with AI/ML model deployment and serving in production systems
Strong experience with backend development using Python, with familiarity in Go, Node.js, or Java for building scalable web services and APIs
Hands-on experience with containerization using Docker and orchestration platforms including Kubernetes, OpenShift, and AWS ECS in production environments
Proficient with cloud infrastructure, particularly AWS services (Lambda, ECS, EKS, S3, RDS, ElastiCache) and serverless architectures
Experience with CI/CD pipelines using Jenkins, GitLab CI, GitHub Actions, or similar tools, including Infrastructure as Code with Terraform or CloudFormation
Strong knowledge of databases including PostgreSQL, MongoDB, Redis, and experience with vector databases for AI applications
Familiarity with message queues (RabbitMQ, Apache Kafka, AWS SQS/SNS) and event-driven architectures

Job Responsibility

Design and implement scalable backend services and APIs for generative AI applications using microservices architecture and cloud-native patterns
Build and maintain model serving infrastructure with load balancing, auto-scaling, caching, and failover capabilities for high-availability AI services
Deploy and orchestrate containerized AI workloads using Docker, Kubernetes, ECS, and OpenShift across development, staging, and production environments
Develop serverless AI functions using AWS Lambda, ECS Fargate, and other cloud services for scalable, cost-effective inference
Implement robust CI/CD pipelines for automated deployment of AI services, including model versioning and gradual rollout strategies
Create comprehensive monitoring, logging, and alerting systems for AI service performance, reliability, and cost optimization
Integrate with various LLM APIs (OpenAI, Anthropic, Google) and open-source models, implementing efficient batching and optimization techniques
Build data pipelines for training data preparation, model fine-tuning workflows, and real-time streaming capabilities
Ensure adherence to security best practices, including authentication, authorization, API rate limiting, and data encryption
Collaborate with AI researchers and product teams to translate AI capabilities into production-ready backend services

Fulltime

Senior Infrastructure Engineer

Coralogix is seeking a Senior Infrastructure Engineer to join our Core SRE team ...

Location

Germany , Berlin

Salary:

Not provided

Coralogix

Expiration Date

Until further notice

Requirements

5+ years of experience in DevOps, SRE, platform engineering, or infrastructure roles
Deep understanding of Kubernetes: API, CNI, scheduling, container runtimes and such
Strong hands-on experience with Kafka and Istio (or similar technologies ), and core networking protocols (HTTP, gRPC, TLS)
Proven experience managing large-scale cloud infrastructure (AWS, GCP, etc.)
Experience in incident response and troubleshooting complex distributed systems
Some software engineering experience, preferably in Golang
Passion for automation, performance tuning, and operational excellence

Job Responsibility

Act as a hands-on technical leader with deep expertise in modern cloud infrastructure
Serve as a go-to person in the team — leading through influence, not hierarchy
Collaborate cross-functionally to refine requirements and propose innovative, scalable solutions
Drive long-term, high-impact infrastructure projects across multiple teams, from design to implementation, within defined timelines
Contribute to improving system reliability, performance, and cost-efficiency at scale

Fulltime

Senior Infrastructure Engineer

Coralogix is seeking a Senior Infrastructure Engineer to join our Core SRE team ...

Location

Israel , Ramat Gan

Salary:

Not provided

Coralogix

Expiration Date

Until further notice

Requirements

5+ years of experience in DevOps, SRE, platform engineering, or infrastructure roles
Deep understanding of Kubernetes: API, CNI, scheduling, container runtimes and such
Strong hands-on experience with Kafka and Istio (or similar technologies ), and core networking protocols (HTTP, gRPC, TLS)
Proven experience managing large-scale cloud infrastructure (AWS, GCP, etc.)
Experience in incident response and troubleshooting complex distributed systems
Some software engineering experience, preferably in Golang
Passion for automation, performance tuning, and operational excellence

Job Responsibility

Act as a hands-on technical leader with deep expertise in modern cloud infrastructure
Serve as a go-to person in the team — leading through influence, not hierarchy
Collaborate cross-functionally to refine requirements and propose innovative, scalable solutions
Drive long-term, high-impact infrastructure projects across multiple teams, from design to implementation, within defined timelines
Contribute to improving system reliability, performance, and cost-efficiency at scale

Fulltime

Senior Infrastructure Engineer

Senior Infrastructure - Kafka Engineer, Enterprise Data Engineering. We are seek...

Location

United States , Phoenix; Westwood; Johnston; Iselin; Plano

Salary:

125000.00 - 145000.00 USD / Year

Citizens Bank

Expiration Date

Until further notice

Requirements

7 or more years of experience in Kafka administration on-prem and cloud, messaging systems, and database integration
Proficiency with cloud platforms such as AWS, GCP, or Azure, event-driven architecture, DevOps, and containerization
Experience deploying Kafka clients and brokers in production and disaster recovery environments
Proven ability to scale Kafka clusters and connector infrastructure
Hands-on experience building real-time data pipelines using Kafka producers and Spark Streaming consumers
Familiarity with monitoring tools such as Splunk, Datadog, and Grafana
Strong knowledge of source control systems such as SVN and Git
Solid understanding of networking protocols, operating systems, and diagnostic tools
Proficiency in scripting languages such as PowerShell, Bash, Python, and Perl
Strong analytical and decision-making skills, even with limited information

Job Responsibility

Lead a team of engineers and technicians in monitoring, diagnosing, and resolving infrastructure issues using event-based management
Administer and troubleshoot Kafka clusters, including configuration and performance tuning
Integrate Kafka with various systems using connectors such as MQ, MongoDB, Oracle, SQL Server, PostgreSQL, and MySQL
Automate infrastructure setup across environments using Terraform
Provide senior-level support and troubleshooting across a wide range of technologies
Collaborate within agile teams to drive modern development practices and product vision
Serve as the primary point of contact for daily operations and incident management
Conduct proactive monitoring to identify and mitigate potential service disruptions
Document actions, create reports, and establish escalation procedures
Audit support tickets to identify patterns and reduce downtime

What we offer

comprehensive medical, dental and vision coverage
retirement benefits
maternity/paternity leave
flexible work arrangements
education reimbursement
wellness programs
competitive pay
opportunity to earn an annual discretionary bonus

Fulltime

Senior Software Engineer - Infrastructure Reliability

We are seeking a Senior Software Engineer to join our Security Product team, foc...

Location

India , Bangalore

Salary:

Not provided

JFrog

Expiration Date

Until further notice

Requirements

7+ years of experience in software engineering, with at least 3+ years focused on debugging and solving infrastructure-level problems in distributed systems
Strong proficiency in Go
familiarity with Python and Helm is a plus
Deep hands-on experience with RabbitMQ or similar message brokers (Kafka, ActiveMQ) - including queue management, clustering, monitoring, and production troubleshooting
Solid working knowledge of Kubernetes (pod lifecycle, resource management, networking, debugging CrashLoopBackOff / OOMKilled scenarios) and Docker
Experience investigating production incidents and conducting post-incident reviews with clear root cause analysis and follow-through
Strong understanding of Linux systems, networking fundamentals, and cloud infrastructure (AWS, Azure, or GCP)
Ability to read and interpret logs, thread dumps, heap dumps, and system metrics to isolate root causes under time pressure
Excellent analytical and problem-solving skills with a methodical approach to debugging
Strong written and verbal communication skills - ability to produce clear incident reports, root cause analyses, and playbooks, and to communicate effectively across engineering, SRE, and customer-facing teams

Job Responsibility

Investigate system outages and production failures across customer environments (SaaS and self-hosted), spanning RabbitMQ, Kubernetes, Docker, Postgres, and cloud infrastructure (AWS, Azure, GCP)
Identify recurring failure patterns and systemic weaknesses from incident data, and drive them to resolution - whether by writing Go code yourself (resilience features, infrastructure fixes, observability) or by collaborating with service owners to prioritize and address reliability gaps
Lead and participate in post-incident reviews - document root causes, corrective actions, and follow through to ensure issues are properly resolved
Collaborate with production engineering and SRE teams to develop and maintain operational playbooks and runbooks that reduce time-to-resolution
Diagnose root causes across the full stack - message queue failures, container lifecycle issues, cloud networking, disk and memory pressure, and deployment topology mismatches
Design and implement data migrations and lifecycle management for infrastructure components such as queue management and vhost operations
Emit and monitor operational metrics to proactively detect infrastructure degradation and measure service reliability

Senior Cloud Infrastructure Engineer

HPE Aruba Networking is a leading provider of next-generation networking solutio...

Location

United States , San Jose

Salary:

133500.00 - 307000.00 USD / Year

Hewlett Packard Enterprise

Expiration Date

Until further notice

Requirements

Minimum expected industry experience is around 6 years
Minimum education at BS or MS level in Computer Science or related fields
Proven record of developing and releasing cloud applications in the production environment
Experience with DevOps and Cloud Infrastructure Deployment and Automation in Python, Terraform, Ansibles, GitOps, GitLabs, and Jenkins/Spinnaker
Experience in RDBMS (Postgres), GraphQL, and NoSQL (Cassandra, OpenSearch, Clickhouse, and etc.)
Experience in cloud stacks such as Redis, Kafka, RabbitMQ, Hazelcast
Experience in development in Kubernetes and Docker containers
Programming language experience with Shell Scripts, Python, Golang, or Java
Ability to deploy various techniques to ‘scale’ an application in a cloud environment
Demonstrated abilities to work with QA and Remote Teams

Job Responsibility

Participate in architecture and design discussions
Develop scalable applications that run on top of Next Generation Central
Contribute to multiple technical programs simultaneously

What we offer

Health benefits
Comprehensive suite of benefits supporting physical, financial, and emotional wellbeing
Personal and professional development programs
Inclusion and diversity initiatives
Exciting and fun work culture
Innovation and growth opportunities

Fulltime

Senior Software Engineer, Streaming Infrastructure

Join us in building the future of finance. Our mission is to democratize finance...

Location

United States , Bellevue

Salary:

196000.00 - 230000.00 USD / Year

Robinhood

Expiration Date

Until further notice

Requirements

5+ years of professional experience in software engineering, including building distributed systems at scale
A background in tools like Kafka, Flink and Debezium
Proficiency in designing and implementing event-driven architectures and stream processing systems
A passion for platform engineering and creating great experiences for other developers
Strong communication and collaboration skills to work across technical teams

Job Responsibility

Design and operate distributed data streaming platforms that scale to billions of events per day
Develop secure, performant, and highly reliable systems using technologies like Kafka, Flink, and Debezium
Collaborate closely with product, infrastructure, data, and ML teams to ensure the platform supports diverse use cases
Build tools and documentation to deliver a smooth, empowering experience for internal developers
Mentor and support other engineers to drive architectural decisions and long-term technical strategy

What we offer

Performance-driven compensation with multipliers for outsized impact, bonus programs, equity ownership, and 401(k) matching
100% paid health insurance for employees with 90% coverage for dependents
Lifestyle wallet — a highly flexible benefits spending account for wellness, learning, and more
Employer-paid life & disability insurance, fertility benefits, and mental health benefits
Time off to recharge including company holidays, paid time off, sick time, parental leave, and more
Exceptional office experience with catered meals, events, and comfortable workspaces

Fulltime

Select Country

Senior Infrastructure Kafka Engineer

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?