This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
AutoRABIT is looking for a Senior Site Reliability/DevSecOps Engineer to help develop, scale and operate our cloud services. In this role you will be an experienced business professional able to implement and execute best practice operations and improvements across teams by providing visibility and recommendations for improved reliability and automation. Responsible for the security, availability, performance, efficiency, change management, monitoring, emergency response, capacity planning, back-up, and disaster recovery of our technical ecosystem, as well as drive automation while building a robust and agile DevSecOps framework.
Job Responsibility:
Contribute to the development and maintenance of frameworks for monitoring, automation and code to increase the scalability and reliability of the service
Assist both internal and customer facing teams with deployment of new software releases, VPN and other related security infrastructure interfacing
Assist with resolution of AutoRABIT service or customer issues as required
Participate in and practice sustainable incident response and blameless postmortems
Contribute to the automation of manual tasks, such as the provisioning of users in production and test environments
Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration
Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve
Participate in a regular on-call or rotational schedule needed to support AutoRABIT servers, including weekends and holidays
Requirements:
Design, implement, and maintain scalable, resilient, and secure infrastructure using AWS
Develop and manage infrastructure as code using Terraform
Implement and manage CI/CD pipelines to automate deployments and ensure smooth delivery of applications
Monitor system performance, identify bottlenecks, and implement solutions to improve reliability and performance
Troubleshoot, resolve, and perform RCAs for incidents, while ensuring minimal disruption to services
Collaborate with development teams to ensure applications are designed for reliability and performance
Working Experience with Shell Scripting (Bash), Python or equivalent is required
Good Knowledge of programming languages such as Python, Go, or Java
Working Experience with configuration management tools such as Ansible or Chef
Implement and maintain monitoring, logging, and alerting systems to ensure the health and performance of our infrastructure
Ensure security best practices are followed and compliance requirements are met
Excellent written and verbal US English communication skills for working across a global team environment
Bachelors in Computer Science, Engineering, or equivalent degree or experience
5+ years of experience in site reliability engineering, DevOps, or a related field
AWS, GCP and/or Azure Certified
3+ Years of Kubernetes experience
3+ years' experience managing Linux-based systems in a public cloud such as AWS, GCP, or Azure
3+ years of experience with systems monitoring and logging
knowledge of ELK is preferred
Solid understanding of standard TCP/IP networking and common protocols like DNS, load balancers, HTTP, etc.
Must be a US citizen/permanent resident, and capable of obtaining a Government Security clearance if required and live in and work from the US. Green card holders qualify, but H1B or other work visa holders do not qualify for this role
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.