This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Take ownership of project priorities, deadlines and deliverables using Agile methodologies
Understand and refine business and functional requirements
Be a leader with vision and a partner in brainstorming solutions for team productivity, efficiency, guiding and motivating others
Drive priority setting of the engineering teams
Improve Engineering culture by encouraging continuous focus on reliability across the entire application lifecycle
Proactive participant in architectural and design decisions
Design and implement end-to-end monitoring solutions for Application and Infrastructure components, based on cutting edge SLO-based telemetry tools
Drive the engineering of capacity management and demand forecasting solutions
Act as a culture carrier and leader, passing on SRE knowledge and best practices to the engineering team
Drive detailed root cause investigations for production incidents with rigorous focus on issue avoidance
Create/coordinate retros for significant incidents
Additional core engineering functions, such as adding custom Telemetry metrics to the code base of in-scope Applications
Anticipate new opportunities to continuously evolve the resiliency profile of scoped Applications and Infrastructure
Requirements:
B.S. / M.S. degree in Computer Science, Engineering or a related team with 8 – 12 years of experience
Experience with troubleshooting performance issues
A passion for providing engineering support for highly available, performant full stack applications with a “Student of Technology” attitude
Experience with CI/CD practices
Passion to work in a team-environment, multi-tasking and effective communication skills
Excellent analytical, problem-solving and debugging skills
Demonstrated ability to build positive relationships
Self-motivated
Knowledge of software development methodologies (analysis, design, development, testing) and basic understanding of Agile / Scrum methodology and practices
Good hands-on object-oriented programming experience in Java, including proficiency in object-oriented principles and design patterns
Experience with Telemetry solutions and use cases – Log monitoring, performance monitoring and visualization tools like Grafana
Exposure to Infrastructure platform problem solving
Nice to have:
Understanding of Agile work environments, including knowledge of GIT, CI/CD, and one scripting language like Perl, Python etc.
Experience with automated configuration management tools
Experience with AI or ML for problem solving or improve developer experience