CrawlJobs Logo

Cloud SRE Intern

allianceautomotive.co.uk Logo

Alliance Automotive UK LV Ltd

Location Icon

Location:
United States , Birmingham

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Ready for a challenging and rewarding internship? This is your opportunity to work hands on with project teams throughout the summer and see your development projects put into production to solve business needs and grow your capabilities! Join a leading industrial distribution company and unleash your technology skills to move our business forward!

Job Responsibility:

  • Participate on an Agile development team developing cloud-native services and integrations for company needs
  • Work on a capstone project on a topic in your discipline to present to IT leadership
  • Work alongside senior developers and architects on assigned tasks
  • Document, design, develop, test, and monitor solutions
  • Support deployment pipeline of products to production, and triage and solve issues

Requirements:

  • Working on a BS degree in a computer related field (e.g. Computer Science, Engineering)
  • Working knowledge of software development languages (Java preferred)
  • Familiarity with cloud platforms and technologies (Google Cloud preferred)
  • Familiarity with DevSecOps processes and tools (e.g. Git, CI/CD pipelines)
  • Familiarity with Linux shell and Windows scripting
  • High Level understanding of full software lifecycle development
  • Excellent communication skills (both verbal and written)
  • Must be self-motivated and know when to seek guidance
  • Individual must be a self-starter and capable of working independently as well as part of a team
  • Capable of learning new tools and technologies
  • Strong critical thinking and problem solving skills

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Parttime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Cloud SRE Intern

Hadoop SRE Engineer - VP

The Engineering Lead Analyst is a senior level position responsible for leading ...
Location
Location
United States , Tampa, Florida; Irving, Texas
Salary
Salary:
113840.00 - 170760.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-10 years of relevant experience in an Engineering role
  • Experience working in Financial Services or a large complex and/or global environment
  • Project Management experience
  • Comprehensive knowledge of design metrics, analytics tools, benchmarking activities and related reporting to identify best practices
  • Demonstrated analytic/diagnostic skills
  • Ability to work in a matrix environment and partner with virtual teams
  • Ability to work independently, multi-task, and take ownership of various parts of a project or initiative
  • Ability to work under pressure and manage to tight deadlines or unexpected changes in expectations or requirements
  • Proven track record of operational process change and improvement
  • Proven track record of designing highly available platforms and services supporting various types of workloads
Job Responsibility
Job Responsibility
  • Serve as a technology subject matter expert for internal and external stakeholders
  • Provide direction for all firm mandated controls and compliance initiatives
  • Create a technology domain roadmap for Cloudera Hadoop Platform and Hadoop on Google Cloud Platform
  • Ensure that all integration of functions meet business goals
  • Define necessary system enhancements to deploy new products and process enhancements
  • Recommend product customization for system integration
  • Identify problem causality, business impact and root causes
  • Exhibit knowledge of how own specialty area contributes to the business and apply knowledge of competitors, products and services
  • Advise or mentor junior team members
  • Impact the engineering function by influencing decisions through advice, counsel or facilitating services
What we offer
What we offer
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer - Core

We are looking for a Site Reliability Engineer to join our Core team to encourag...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
blockchain.com Logo
Blockchain
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with containerization and service orchestration, including best practices and security
  • Strong knowledge of at least one programming language
  • Linux, including an understanding of resource allocation, network and/or internals
  • Experience working with cloud solutions (GCP or AWS)
  • Deep understanding and demonstrable experience with modern monitoring tools such as Prometheus, Datadog, Grafana, Telegraf
  • Experience with infrastructure as code tools
  • Solid background with configuration management tools
  • Experience with using GitOps and CI to make changes, preferably Github Actions
  • Experience with messaging systems such as Kafka
  • Experience with database management
Job Responsibility
Job Responsibility
  • Play a critical role in evolving our infrastructure as we develop solutions to complex technical problems involving reliability, latency, bandwidth and most importantly security
  • Be an integral part of improving observability, monitoring and alerting throughout the platform
  • Help co-ordinate work across different areas of the company to ensure the most efficient path of execution
  • Centralize wherever possible common streams of work that are currently duplicated across developer teams
  • Focus heavily on writing tooling to replace manual, repetitive work in a scalable way
  • Work in a fast paced, and dynamic environment complementing our existing high calibre team
What we offer
What we offer
  • Full-time salary based on experience and meaningful equity in an industry-leading company
  • Hybrid model working from home & awesome office location in the heart of London
  • Unlimited vacation policy
  • work hard and take time when you need it
  • Work from Anywhere Policy: You can work remotely from anywhere in the world for up to 20 days per year
  • Apple equipment
  • The opportunity to be a key player and build your career at a rapidly expanding, global technology company in an emerging field
  • Flexible work culture
  • Fulltime
Read More
Arrow Right

Forward Deployed Engineer

As a Forward Deployed Engineer (FDE) at Virtru, you will play a pivotal role in ...
Location
Location
United States , Tampa
Salary
Salary:
190000.00 - 240000.00 USD / Year
virtru.com Logo
Virtru
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active U.S. Secret Clearance required
  • Minimum of 5+ years of experience as a cloud engineer, demonstrating a strong understanding of SRE principles for highly scalable and reliable systems
  • Bachelor's degree in Computer Science or related field
  • Proficiency in DevSecOps practices, with experience in source code repositories and CI/CD pipeline solutions such as Team Foundation Server/Azure DevOps, Bitbucket, and GitHub
  • Expertise in Infrastructure as Code (IaC) and best practices for managing cloud infrastructure
  • Familiarity with containerization, Kubernetes (k8s) and orchestration tooling such as OpenShift, Rancher, and Helm
  • Ability to excel both independently and as part of a collaborative team
  • Effective communication and collaboration skills with the on-site customer and the support team
  • Willingness to work onsite 5 days a week in Tampa, FL.
Job Responsibility
Job Responsibility
  • Monitor platform and containerized applications to ensure optimal performance and availability
  • Identify and mitigate performance and availability risks and issues in real-time
  • Contribute to the development and optimization of core platform functions to establish a robust infrastructure
  • Collaborate closely with internal teams and government clients on a daily basis.
What we offer
What we offer
  • Flexible PTO policy
  • $1,500 annual Learning & Development Stipend
  • Home-Office Stipend
  • Internal mobility options
  • Frequent company-sponsored Team Celebrations
  • Access to an Employee Assistance Program
  • Access to Headspace, a mental health app
  • A high degree of flexibility
  • Competitive compensation
  • Generous parental, medical, and bereavement policies
  • Fulltime
Read More
Arrow Right

Forward Deployed Engineer

As a Forward Deployed Engineer (FDE) at Virtru, you will play a pivotal role in ...
Location
Location
United States , Columbia
Salary
Salary:
190000.00 - 240000.00 USD / Year
virtru.com Logo
Virtru
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active U.S. TS/SCI Clearance required
  • Minimum of 5+ years of experience as a cloud engineer, demonstrating a strong understanding of SRE principles for highly scalable and reliable systems
  • Bachelor's degree in Computer Science or related field
  • Proficiency in DevSecOps practices, with experience in source code repositories and CI/CD pipeline solutions such as Team Foundation Server/Azure DevOps, Bitbucket, and GitHub
  • Expertise in Infrastructure as Code (IaC) and best practices for managing cloud infrastructure
  • Familiarity with containerization, Kubernetes (k8s) and orchestration tooling such as OpenShift, Rancher, and Helm
  • Ability to excel both independently and as part of a collaborative team
  • Effective communication and collaboration skills with the on-site customer and the support team
  • Willingness to work onsite 3-5 days per week in Columbia, MD
Job Responsibility
Job Responsibility
  • Monitor platform and containerized applications to ensure optimal performance and availability
  • Identify and mitigate performance and availability risks and issues in real-time
  • Contribute to the development and optimization of core platform functions to establish a robust infrastructure
  • Collaborate closely with internal teams and government clients on a daily basis
What we offer
What we offer
  • A Flexible PTO policy
  • A $1,500 annual Learning & Development Stipend
  • Home-Office Stipend
  • Internal mobility options
  • Frequent company-sponsored Team Celebrations
  • Access to an Employee Assistance Program
  • Access to Headspace, a mental health app
  • A high degree of flexibility
  • Competitive compensation
  • Generous parental, medical, and bereavement policies
  • Fulltime
Read More
Arrow Right

Staff Site Reliability Engineer

At Ledger, we are looking for an experienced Reliability Engineer to join our SR...
Location
Location
France , Paris
Salary
Salary:
Not provided
https://www.ledger.com Logo
Ledger
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years on cloud engineering at scale, on organizations operating SaaS solutions
  • Proficiency in working in Unix/Linux environments, Git, Python, Terraform, Kubernetes, AWS cloud solutions and architectures, CI/CD tools, Argocd, Ansible, configuration management, etc.
  • Strong knowledge on observability practices, with experience implementing and managing Logging, Monitoring and Alerting framework with solutions such as Datadog or Prometheus/Grafana/Loki.
  • Experience of cross-functional work and the ability to demonstrate a collaborative approach with regards to building key relationships across the organization and define projects scope, goals, plan and deliverables
  • Customer focused with the ability to identify and understand both internal and external customer's needs
  • Creative problem-solving and analysis skills with an ability to identify, develop, and implement solutions to meet the needs of the business
  • Excellent presentation and written communication
  • Ability to deal with ambiguity, high level of pressure and rapidly changing environments
  • Engineering degree.
Job Responsibility
Job Responsibility
  • Participate in building a DevOps / SRE culture and enable the transition to modern infrastructure management and deployment practices
  • Participate in building the SRE team roadmap (vision and delivery accountability). Anticipate stakeholder needs, game-changing technologies emergence and challenge scope / deadlines
  • Perform integration of platform software components
  • Participate to design and deliver solutions to improve the availability, scalability, latency, and efficiency of systems
  • Influence and create standards & best practices in support of service level objectives
  • Automate key SRE metrics including SLOs/SLAs and error budgets
  • Provide expert support to our level-2/application support team, to troubleshoot priority incidents, and conduct post-mortems
  • Apply analytics on past incidents and usage patterns to predict issues and take proactive actions
  • Ensure control of technical debt and promote quality practices
  • Follow SRE and chaos engineering approaches across all strategic systems to predict in coordination with Service Design and prevent outages and improve solution availability
What we offer
What we offer
  • Equity: Employees are the foundation of our success, and we award stock options so you can share in that success as we grow
  • Flexibility: A hybrid work policy
  • Social: Annual company outing for Ledgerdary Days, plus frequent social events, snacks and drinks
  • Medical: Comprehensive health insurance policy offering extensive medical, dental and vision care coverage
  • Well-being: Personal development, coaching & fitness with our dedicated partners
  • Vacation: Five weeks of paid leave per year, in addition to national holidays and rest & relaxation (RTT) days
  • High tech: Access to high performance office equipment and gadgets, including Apple products
  • Transport: Ledger reimburses part of your preferred means of transportation
  • Discounts: Employee discount on all our products.
  • Fulltime
Read More
Arrow Right

Database Reliability Engineer

We are committed to providing our customers with reliable and secure services at...
Location
Location
Netherlands
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Database Reliability Engineer - Core Team

We are committed to providing our customers with reliable and secure services at...
Location
Location
United Kingdom
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering
  • Previous experience operating ClickHouse or other SQL databases in production
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus
  • Scripting experience with Shell or Python, and ability to read and understand C++ code
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
  • You are a strong problem-solver and have solid production debugging skills
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Continuously improve the reliability and performance of ClickHouse core
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Sre design & support engineer

We are looking for a self-driven, software engineering mindset SRE engineer to •...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
pepsico.com Logo
Pepsico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8-11 years of work experience evolving to a SRE engineer
  • 3-5 years of experience in continuously improving and transforming IT operations ways of working
  • Bachelor’s degree in Computer Science, Information Technology or a related field
  • Proven experience as an SRE in designing the events diagnostics, performance measures and alert solutions to meet the SLA/SLO/SLIs
  • The ideal Engineer will be highly quantitative, have great judgment, able to connect dots across ecosytems, and efficiently work cross-functionally across teams to ensure SRE orchestrating solutions are meeting customer/end-user expectations
  • The candidate will take a pragmatic approach resolving incidents, including the ability to systemically triangulate root causes and work effectively with external and internal teams to meet objectives
  • A strong expertise of SRE (Software Reliability Engineering) and IT Service Management (ITSM) processes with a track record for improving service offerings – pro-actively resolving incidents, providing a seamless customer/end-user experience and proactively identifying and mitigating areas of risk
  • Hands on experience in Python, SQL /No-SQl( MySQL, Mongo DB, Cassandra, Postgress), AppDynamics, ELK Stack Grafana, Splunk, Dynatrace, Kafka and any SRE Ops toolsets
  • A firm understanding of cloud archticture for distributed environments
  • Front-end technologies: HTML, CSS, JavaScript, and frameworks like React, Angular, or Vue.js
Job Responsibility
Job Responsibility
  • Engage & influence product and engineering teams during the design and development phases to embed reliability and operability into new services defining & enforce events, logging, monitoring, and observability standards across applications
  • Ensuring non-functional requirements (NFRs) are embedded early including SLA/SLO/SLI and error budgets into the product’s offerings as part of the engineering solution
  • Execute as Pro-active SRE Support engineer, preventing P1, P2, potential P3s, diagnosing any anomalies prior to any user and driving the necessary remediations across the teams involved in end-to-end ecosystem availability, performance and consumption of the cloud architected application ecosystem leveraging SRE Orchestration solutions
  • Collaborates with Engineering & support teams, including participation in escalations, , and blameless postmortems,
  • Work closely with customer-facing support teams to empower them with SRE insights and tooling
  • Observe, diagnose & improve the end-2-end ecosystem performance of the Modern architected application portfolio i.e. technical “understanding of interactions" of a full stack application alongside with peer SRE team member
  • Continuously optimize the L2/support operations work via SRE workflow automation
  • Shape the SRE orchestration platform design with inputs from Production Operations, Business usage & Product and engineering teams
  • Actively engage and drive AI Ops adoption across teams
Read More
Arrow Right