This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
NetApp is looking for a Technical Operations Engineer to join our growing Instaclustr team in the USA. NetApp’s Instaclustr offering provides open source as-a-service, delivering reliability at scale. We manage cutting edge open-source technologies (Cassandra, Kafka, PostgreSQL, OpenSearch) for our customers around the world. NetApp Instaclustr makes it easy for our customers to run powerful open-source applications at the highest levels of scale. We have developed a platform that takes care of the whole lifecycle: provisioning infrastructure, installing applications and, most importantly, keeping the applications running reliably in production. Since being founded in 2013, Instaclustr has grown strongly, with over 300 customers worldwide, and over 20,000 nodes under management. Our Technical Operations Engineers are the frontline team keeping our large fleet of cloud hosted open-source clusters up and running. Your work will ensure the security, reliability and performance of world-class systems and databases. You will collaborate with our customer’s technical teams, from globally recognized companies in the gaming, banking and logistics industry sectors, ranging from big multinationals to emerging start-ups.
Job Responsibility:
Provide expert support on incidents, diagnosing and solving data modelling and architectural issues by liaising with customer’s engineers and maintaining a high standard of customer communication
Undertake complex cluster tasks including but not limited to real time migrations with no downtime, upgrades (minor & major versions), performance tuning and maintenance of our fleet of 13000+ nodes for our customers
Provide expert support to our nodes running in the cloud (AWS/Azure/GCP), using technologies such as Linux (Debian, Ubuntu), Docker, and languages including Java, Python and bash
Investigate issues and apply standard maintenance procedures to optimize the performance and stability of production and non-production clusters that we manage
Develop and continually improve our suite of internal automation tools, applications, and processes
Be a proactive, reliable and supportive member of the support team, and participate in a 24/7 rotating on-call roster
Requirements:
Minimum of 6 years working experience
Designing and maintaining database architecture, data structures, tables, dictionaries and naming conventions to ensure the accuracy and completeness of all data master files. Experience with Apache Cassandra or similar NoSQL databases preferred
Testing systems and upgrades such as debugging, tracking, reproduction, logging and resolving all identified problems, according to approved quality testing scripts, procedures and processes
Experience in identifying architectural issues at scale in the cloud such as AWS/GCP/Azure
Developing and managing documentation, standards, policies, and procedures related to database operations
Programming skills in Python, Java, bash scripting, SQL, and source code control using Git
Exceptional ability to communicate clearly and professionally in written and verbal English (essential)
Any customer service experience is favorable
US Citizen
Bachelor's degree in a technical or engineering field OR Master's degree with less related experience
Nice to have:
Any customer service experience is favorable
What we offer:
Health Insurance
Life Insurance
Retirement or Pension Plans
Paid Time Off (PTO)
various Leave options
Performance-Based Incentives
employee stock purchase plan
restricted stocks (RSU’s)
Volunteer time off (40 hours of paid volunteer time each year)
Employee Assistance Program, fitness, and mental health resources