CrawlJobs Logo

Senior Systems Operations Engineer – Infrastructure Development

https://www.wellsfargo.com/ Logo

Wells Fargo

Location Icon

Location:
United States , Iselin

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

41.83 - 80.77 USD / Hour
Save Job
Save Icon
Job offer has expired

Job Description:

Wells Fargo is seeking a seasoned Senior Systems Operations Engineer to join our App and Web Engineering team and build the automation foundations that provision, manage, and scale our enterprise application and web server hosting platforms. This role is ideal for a hands‑on engineer who understands the operational complexity of running large‑scale server environments and is passionate about enabling frictionless self‑service through automation. You will lead by example-designing and implementing modular IaC components and GitOps workflows that abstract the complexity of provisioning and managing application/web servers, configuring runtime settings, tuning performance parameters, enforcing security policies, integrating with routing layers, and ensuring end‑to‑end observability.

Job Responsibility:

  • Lead large‑scale initiatives to automate provisioning, configuration, and lifecycle operations for application and web server platforms (Tomcat, Apache HTTPD, IBM Liberty, NGINX, etc.)
  • Architect and develop reusable IaC components (Ansible) for server installation, configuration management, clustering, routing, JVM tuning, certificate automation, and policy enforcement
  • Develop automation scripts and workflows using Python or Java to support provisioning, configuration, governance, certificate management, and operational efficiency
  • Develop robust APIs using Java Spring Boot or Python to expose provisioning, configuration, deployment governance, certificate management, and capacity automation workflows
  • Design and implement GitOps‑driven workflows to automate server configuration updates—such as routing rules, reverse proxy updates, JVM or container runtime changes, TLS rotation, module/plugin configuration, and environment policies
  • Build and maintain self‑service platform capabilities enabling developers to request server instances, deploy applications, configure routing, request certificates, manage JNDI/resources, and consume metrics through APIs or service catalogs
  • Collaborate across engineering, security, and product teams to ensure platform automation aligns with organizational goals and best practices
  • Participate in architecture and code reviews while mentoring engineers on server operations, IaC design patterns, and automation best practices
  • Continuously improve platform reliability, performance, scalability, and operational efficiency through automation modernization and engineering excellence

Requirements:

  • 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 3+ years of full‑stack or backend software development experience using Java and/or Python
  • 3+ years of hands-on experience deploying and operating one or more application/web server technologies (Tomcat, Apache HTTPD, IBM Liberty, NGINX, etc.)
  • 3+ years of experience with IaC tools such as Terraform and Ansible
  • 3+ years of experience implementing GitOps or similar automation practices
  • 1+ year of experience with Kubernetes/OCP, containerization, and hybrid cloud platforms (AWS, Azure, GCP)
  • 2+ years of experience designing and consuming RESTful APIs and integrating automation into platform services

Nice to have:

  • Deep understanding of server internals across one or more platforms (e.g., connectors, routing engines, thread pools, modules/plugins, classloading, cluster coordination, reverse proxy behavior)
  • Experience integrating application/web servers with load balancers, API gateways, and service meshes
  • Experience implementing enterprise features such as JNDI/JDBC (Tomcat/Liberty), reverse‑proxy modules (Apache/NGINX), or Liberty features/packs
  • Familiarity with designing scalable hosting architectures and deployment pipelines for Java and web applications
  • Experience with HA and DR patterns spanning multi‑AZ or multi‑region deployments across server platforms
  • Hands‑on experience with observability tools (Prometheus, Grafana, ELK) and platform‑specific monitoring interfaces (e.g., JMX, mod_status, NGINX stub_status, Liberty admin metrics)
What we offer:
  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Scholarships for dependent children
  • Adoption reimbursement

Additional Information:

Job Posted:
March 18, 2026

Expiration:
March 19, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Systems Operations Engineer – Infrastructure Development

Senior Systems Engineer

We are looking for a versatile and driven Senior Systems Engineer to join our En...
Location
Location
United States , Chicago
Salary
Salary:
130000.00 USD / Year
akunacapital.com Logo
AKUNA CAPITAL
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in Computer Science, Information Systems, or a related field
  • 5-7 years of systems engineering experience
  • Advanced Linux knowledge including kernel bypass, kernel tuning, and customizing kernels
  • Deep understanding of virtualization and containerization technologies
  • Extensive experience with a variety of Linux distributions (RedHat, Ubuntu, etc.)
  • Deep understanding of system monitoring and configuration management tools (Ansible, Foreman, Prometheus and Icinga/Nagios)
  • Proficiency in scripting and using automation and orchestration tools such as Python and Bash
  • Expertise in troubleshooting multicast and TCP related performance issues
  • Experience automating daily software and hardware related tasks
  • Demonstrated ability to lead large technical projects
Job Responsibility
Job Responsibility
  • Analyze complex technical problems and collaborate on designing solutions for Akuna’s global Infrastructure platform
  • Drive projects and solutions to completion in a fast-paced environment
  • Design, develop and maintain orchestration and configuration solutions
  • Collaborate with developers and other infrastructure engineers to research new products and techniques that drive innovation and improve efficiency and performance in the environment
  • Architect and maintain multi-vendor, tier-based storage solutions
  • Build out a test automation framework for systems performance testing and tuning
  • Create and institute process enforcement across environments
  • Create tools that assist teams to optimize the available infrastructure
  • Develop and maintain comprehensive technical documentation, including system configurations, procedures, and troubleshooting guides
  • Lead knowledge transfer sessions and mentor team members to ensure continuity and operational excellence
What we offer
What we offer
  • Discretionary performance bonus
  • Comprehensive benefits package that may encompass employer-paid medical, dental, vision, retirement contributions, paid time off, and other benefits
  • Fulltime
Read More
Arrow Right

Senior Systems Engineer

AnaVation is seeking a highly skilled Senior Systems Engineer to join our Cross ...
Location
Location
United States , Vienna
Salary
Salary:
Not provided
anavationllc.com Logo
AnaVation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Engineering, Computer Science, or related technical discipline
  • 7–9 years of documented experience in Information Systems Engineering
  • Hardware and network designs for large-scale enterprise applications
  • Implementing and maintaining security best practices, creating and maintaining documentation for architecture, configuration and processes
  • Experience establishing and maintaining monitoring and alerting systems for cloud and on premise resources
  • Optimizing on premise and cloud infrastructure for cost efficiency and performance
  • Troubleshooting and resolving issues related to performance and availability
  • Documented and demonstrated experience with troubleshooting and problem solving
  • Experience with software development
  • Experience scripting and programming for automation
Job Responsibility
Job Responsibility
  • Architect, develop and support a for a highly available resource for mission-critical programs composed of numerous AWS services and on-premises servers across multiple locations
  • Automation and Cloud Integration: Automate the creation and management of AWS resources using AWS CloudFormation, AWS Lambda, GitLab, BASH, and Python scripting
  • Infrastructure Lifecycle Automation: Design and implement an automated, hands-free monthly server rebuild and switchover process leveraging CloudFormation, Lambda, and EventBridge
  • Linux Automation and Monitoring: Develop and maintain a comprehensive system of scripts and processes to automate configuration, maintenance, and monitoring of UNIX systems
  • Maintain network hardware and server infrastructure, including analysis, configuration, installation, and testing of new hardware and software
  • Support daily network operations, evaluating utilization, monitoring response times, and detecting and resolving operational problems
  • Troubleshoot issues at both the physical and logical levels of the network, using diagnostic tools and communication protocol analysis
  • Participate in planning, design, technical reviews, and implementation of network and infrastructure projects supporting voice and data communications
  • Maintain and enhance network infrastructure standards, including TCP/IP communication protocols, and ensure adherence to industry and security best practices
  • Exhibit proficiency with virtualization technologies (VMware, AWS, etc.) and network administration, ensuring high system availability and scalability
What we offer
What we offer
  • Generous cost sharing for medical insurance for the employee and dependents
  • 100% company paid dental insurance for employees and dependents
  • 100% company paid long-term and short term disability insurance
  • 100% company paid vision insurance for employees and dependents
  • 401k plan with generous match and 100% immediate vesting
  • Competitive Pay
  • Generous paid leave and holiday package
  • Tuition and training reimbursement
  • Life and AD&D Insurance
  • Fulltime
Read More
Arrow Right

Senior Systems Engineer

The Senior Systems Engineer will be responsible for monitoring and maintaining o...
Location
Location
Canada , Mississauga
Salary
Salary:
Not provided
greenfield.com Logo
Greenfield Global
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 10 years’ work experience in operations and infrastructure
  • 4+ years of experience working in public cloud, MS365 and similar technologies
  • 2+ years of experience as a senior team member executing complex projects and deliverables
  • Extensive knowledge and experience with enterprise hardware and virtualization stacks (Cisco UCS, VMWare, SANs, ISCSI networks)
  • Experience with backup platforms, methodologies and validation techniques (Veeam preferred)
  • Strong knowledge maintaining Active Directory domains and Entra ID environments
  • Exceptional written and verbal communication skills
Job Responsibility
Job Responsibility
  • Collaborate closely with Operations team members and the wider IT Dept to offer solutions to business needs
  • Maintain and support VMWare, Active Directory, SANs and other components that compromise a on-premises enterprise environment
  • Work with the IT Dept and business to develop and maintain backup and recovery strategy’s
  • Identify and research high-value improvements to physical and virtual infrastructure
  • Maintain and tune monitoring tools to provide validated alerts when needed and limit alert storms and fatigue
  • Develop methods to facilitate non-disruptive update and upgrade paths for enterprise systems
  • Ability to automate where possible using various tools (Scripting, power automate/logic apps, Azure Arc functions)
  • Stay current and knowledgeable of new technologies and process within enterprise IT and manufacturing spaces
  • Exceptional communicator who can collaborate with stakeholders to gather and understand business related needs
  • Develop and maintain process-related documentation
What we offer
What we offer
  • Formal and informal training opportunities
  • Comprehensive health and dental benefits
  • Income protection: short- and long-term disability coverage, life insurance, paid personal sick time
  • Vacation time exceeding industry standards
  • Company funded retirement savings program with individual contribution opportunities
  • Meaningful and challenging work
  • Curated intentional culture focused on growth and development, engagement, and communication
Read More
Arrow Right

Senior Developer Success Engineer

Temporal is an open source programming model that can simplify code, make applic...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
temporal.io Logo
Temporal
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must reside in and be eligible for employment in India with a strong preference for Bangalore
  • 6+ years of experience as a developer, preferably fluent in two of the following languages: Python, Java, Golang, TypeScript
  • Experience with deployment and managing medium to large-scale architectures (e.g., Kubernetes or Docker)
  • Experience with monitoring tools such as Prometheus and Grafana and troubleshooting performance and availability issues
  • Minimum of one year experience in an internal or external customer-facing role
  • Passion for helping others regardless of who they are or how they act
  • Experience working with or as part of remote teams
  • Strong written and verbal communication skills
  • Seek to understand first, lead with data, and rely on facts
Job Responsibility
Job Responsibility
  • Be the frontline technical expert for our developer community in India
  • Help users deploy and scale Temporal in cloud-native environments
  • Troubleshoot complex infrastructure issues, optimize performance, and develop automation solutions
  • Work with cloud-native, highly scalable infrastructure spanning AWS, GCP, Kubernetes, and microservices
  • Gain deep expertise in container orchestration, networking, and observability while learning from complex, real-world customer use cases
  • Hone your programming skills in infrastructure automation, resilience engineering, and performance tuning
  • Tackle scalability, reliability, and troubleshooting challenges in distributed systems
  • Work directly with developers to debug complex infrastructure issues, optimize cloud performance, and enhance reliability for Temporal users
  • Develop observability solutions (Grafana, Prometheus), improve networking (load balancing, DNS, ingress/egress), and automate infrastructure operations (Terraform, IaC) to help customers run Temporal efficiently at scale
  • Independently drive technical solutions, whether debugging complex production issues or designing infrastructure best practices
What we offer
What we offer
  • Unlimited PTO, 12 Holidays + 2 Floating Holidays
  • 100% Premiums Coverage for Medical, Dental, and Vision
  • AD&D, LT & ST Disability, and Life Insurance (Standard & Supplemental Available)
  • Empower 401K Plan
  • Additional Perks for Learning & Development, Lifestyle Spending, In-Home Office Setup, Professional Memberships, WFH Meals, Internet Stipend and more
  • Paid Time Off (PTO) and Benefits outside the United States vary by country, and are issued in partnership with Remote.com
  • Perks to all international employees for learning & career development, a lifestyle spending account, in-home office setup (in addition to company-issued hardware), professional memberships, work-from-home meals, and access to the Calm app for mental wellness
  • $3,600 / Year Work from Home Meals
  • $1,500 / Year Career Development & Learning
  • $1,200 / Year Lifestyle Spending Account
  • Fulltime
Read More
Arrow Right

Senior Engineering Manager, Search Infrastructure

Atlassians have flexibility in where they work; The Search Platform team is resp...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience managing high performing, software engineering teams running core services at scale
  • Deep technical experience building and scaling search applications and distributed systems using large amounts of data on cloud platforms, preferably AWS
  • Expert level knowledge and understanding of low-latency distributed data management and query processing systems including Lucene based stacks will be strongly preferred
  • Proven track record of consistent execution delivering outsized results with strong operational rigour
  • Strong organisation and communication skills with the ability to drive clarity in an ambiguous environment
  • Ability to hire, onboard, and retain top talent for your team and foster a culture of innovation, collaboration, and excellence
  • Passion for mentoring and coaching your team members on best practices, code quality, design patterns, testing and operational skills
  • Focus on business outcomes and the 80/20 rule
  • Proactive approach and a desire to innovate in a large, fast-paced organisation
Job Responsibility
Job Responsibility
  • Own a part of the mission for the overall Search Platform team
  • Responsible for building the highest performing teams
  • Develop and work closely with senior engineers to drive technical solutions and architecture
  • Act as a role model for continuously upgrading deep technical skills, engineering judgment and operational rigour
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Engineer - Postgres

ClickHouse is expanding its cloud data platform across AWS, GCP, and Azure—addin...
Location
Location
United States
Salary
Salary:
140000.00 - 208000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in SRE, DevOps, or infrastructure engineering, with a track record of running distributed, production-grade systems
  • Solid understanding of Postgres operations, scaling, and performance tuning
  • Deep hands-on experience across AWS, with exposure to GCP and Azure
  • comfortable navigating multi-cloud topologies
  • Proficient with Terraform, Kubernetes, and container-based infrastructure
  • Strong Go development skills (or willingness to write and own production Go code)
  • Familiar with tools like Prometheus, Grafana, Loki, OpenTelemetry, or equivalents
  • Deep understanding of SLOs, incident response, and continuous improvement in service reliability
  • You operate with a founder’s mentality — hands-on, resourceful, and willing to dive deep to get things done. You take pride in hard work, autonomy, and shipping impactful systems
Job Responsibility
Job Responsibility
  • Lead reliability and operations for ClickHouse’s Postgres integration — upgrades, patching, maintenance, and scaling
  • Design and implement automation for provisioning, deployments, and service lifecycle management across AWS, GCP, and Azure
  • Develop infrastructure-as-code using Terraform and modern CI/CD tooling to ensure consistent, repeatable deployments
  • Contribute Go-based tooling and services that improve automation, observability, and developer experience
  • Own observability and monitoring, ensuring robust alerting, metrics, and tracing across environments
  • Drive incident management and postmortem practices that strengthen reliability and learning loops
  • Collaborate cross-functionally with platform, networking, and product teams to improve service operability
  • Mentor and enable engineers, helping the team scale effectively as customer adoption grows
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure

The Cloud Infrastructure Engineering team builds and manages the foundational bl...
Location
Location
Australia
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
Job Responsibility
Job Responsibility
  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure

About ClickHouse: Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is on...
Location
Location
Singapore
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
Job Responsibility
Job Responsibility
  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right