This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Xometry is seeking a Site Reliability Engineer II to join our Site Reliability Engineering (SRE) Organization. In this role as an individual contributor, you will guide the reliability and performance of our infrastructure and software systems across several engineering teams and influence decisions across our technology organization. You will utilize your technical skills and expertise to help us build reliable and flexible infrastructure solutions that empower our technology organization to quickly and safely deploy new features for our customers.
Job Responsibility:
Take ownership of assigned problem statements and drive them to completion with guidance from senior engineers
Write clean, efficient, and well-documented code while improving existing systems and features
Accurately estimate timelines for features and tasks, learning to balance effort, risk, and impact
Collaborate effectively across teams, communicating clearly on progress, blockers, and outcomes
Seek and apply feedback from peers and managers to improve code quality, technical skills, and delivery consistency
Support team members and contribute to a positive, learning-oriented team culture
Take ownership of personal development goals, showing steady progress in technical and problem-solving skills
Demonstrate accountability, curiosity, and continuous improvement in all aspects of your work
Develop, configure, and maintain underlying platforms for deployed software (AWS accounts and networking, kubernetes clusters, and similar systems)
Develop, configure, and maintain observability and monitoring tools (Coralogix, Sentry, etc.)
Develop, configure, and maintain software development (CI/CD) tools (github actions runners, ArgoCD, etc)
Requirements:
3+ years of professional experience in infrastructure management or backend software development experience in a fast-paced, product-driven environment
Demonstrated technical expertise in one or more of the following languages: Python, Javascript, or Unix Shell
Experience with AWS, including deploying, monitoring, and scaling production workloads
General experience with Terraform, Kubernetes, CI/CD pipelines, and Docker
Comfortable working in an operational environment, including participation in an on-call schedule
Excellent communication and collaboration skills, comfortable engaging with both technical and non-technical stakeholders
What we offer:
401(k) match
medical, dental and vision insurance
life and disability insurance
generous paid time off including vacation, sick leave, floating and fixed holidays, maternity and bonding leave