CrawlJobs Logo

Engineer, SRE GenAI

https://www.t-mobile.com Logo

T-Mobile

Location Icon

Location:
United States , Bellevue

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

92500.00 - 166800.00 USD / Year

Job Description:

As an Engineer in Site Reliability Engineering (SRE) for AI Systems, you will help ensure the reliability, scalability, and performance of AI platforms. This role includes participating in on-call rotations, improving system observability, and supporting operations across cloud-native infrastructure. This is a hands-on role ideal for someone with foundational SRE skills and a growth mindset to expand in GenAI and LLM infrastructure operations.

Job Responsibility:

  • Participate in on-call rotations to support AI platforms and respond to production incidents with urgency and precision
  • Monitor system health and performance using tools like Grafana, Splunk, and PowerBI
  • Support cloud-native infrastructure deployments, with a focus on Azure (primary), and exposure to AWS or GCP
  • Implement runbooks and automate repetitive operational tasks to reduce toil
  • Support CI/CD pipelines and IaC deployments using Gitlab pipelines, Databricks
  • Assist in the development and enforcement of Service Level Objectives (SLOs) and real-time alerts for AI APIs and services
  • Collaborate with senior engineers to improve platform reliability and scale LLM-based applications

Requirements:

  • Bachelor's Degree Computer Science, Engineering or a related field
  • 2–4 years of experience in DevOps, SRE, or cloud platform engineering
  • Hands-on experience with monitoring/logging systems such as Prometheus, Grafana, Splunk, or OpenSearch
  • Familiarity with cloud environments (preferably Azure
  • AWS/GCP a plus)
  • Experience in scripting or automation using Python, Bash, or PowerShell
  • Basic understanding of containerization (Docker, Kubernetes) and CI/CD concepts
  • Willingness to participate in an on-call schedule and incident resolution
  • Strong solving and root cause analysis skills
  • Communication
  • Customer Service
  • Analytics
  • Technical Writing
  • At least 18 years of age
  • Legally authorized to work in the United States

Nice to have:

  • Exposure to AI/ML infrastructure or LLM-based systems (e.g., OpenAI, ChatGPT, Azure OpenAI)
  • Experience with infrastructure-as-code tools like Terraform or ARM templates
  • Familiarity with LLM observability or API token usage metrics
  • Passion for learning AI reliability practices and collaborating with cross-functional teams
What we offer:
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Annual bonus or periodic sales incentive or bonus
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off and up to 12 paid holidays
  • Paid parental and family leave
  • Family building benefits
  • Back-up care
  • Enhanced family support
  • Childcare subsidy
  • Tuition assistance
  • College coaching
  • Short- and long-term disability
  • Voluntary AD&D coverage
  • Voluntary accident coverage
  • Voluntary life insurance
  • Voluntary disability insurance
  • Voluntary long-term care insurance
  • Mobile service & home internet discounts
  • Pet insurance
  • Access to commuter and transit programs

Additional Information:

Job Posted:
December 27, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Engineer, SRE GenAI

New

Senior AI Site Reliability Engineer

At Schwab, you will build a rewarding career while making a difference in the li...
Location
Location
United States , San Francisco
Salary
Salary:
190000.00 - 270000.00 USD / Year
schwab.com Logo
Charles Schwab
Expiration Date
January 20, 2026
Flip Icon
Requirements
Requirements
  • 8+ years of software development or reliability engineering experience, with 4+ years as a hands-on senior engineer in startups and/or large organizations
  • Bachelor’s degree in Computer Science or related field
  • 5+ years of experience building and operating complex products from scratch and running them in production
  • 3+ years of experience supporting applications that use Artificial Intelligence (AI) models to deliver real business impact
  • 3+ years of experience building and maintaining data pipelines and infrastructure for large datasets
  • 3+ years of experience with containers and cloud-native applications, and the ability to operationalize them in the public cloud with infrastructure as code
  • Experience implementing monitoring, alerting, and incident response for large-scale distributed systems
  • Proven track record in driving reliability, scalability, and performance improvements for production AI systems
Job Responsibility
Job Responsibility
  • Design, implement, and manage the reliability and operational excellence of GenAI applications and platforms
  • Work closely with architects, engineers, and business leaders to align reliability practices with Schwab’s enterprise strategy
  • Mentor and coach junior engineers, helping to build strong operational practices and foster a culture of continuous improvement
  • Lead by example in solving complex reliability challenges, advancing SRE standards, and driving rapid iteration from concept to production
What we offer
What we offer
  • 401(k) with company match and Employee stock purchase plan
  • Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
  • Paid parental leave and family building benefits
  • Tuition reimbursement
  • Health, dental, and vision insurance
  • Bonus or incentive opportunities
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer (GCP)

Our client is a global UK-based financial services and investment banking organi...
Location
Location
Salary
Salary:
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, Cloud Engineering, or SRE roles
  • Strong hands-on experience with Google Cloud Platform, including: GKE / Kubernetes, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage, VPC, IAM, networking, security
  • Expertise in Terraform, Helm, or other IaC tools
  • Experience building CI/CD pipelines (GitHub Actions, GitLab CI, CircleCI, Jenkins, etc.)
  • Strong understanding of containerization and orchestration: Docker, Kubernetes
  • Solid experience with monitoring, observability, and logging stacks
  • Familiarity with networking, load balancing, security hardening, and zero-trust principles
  • Experience supporting production systems in high-availability, distributed environments
  • Strong scripting skills (Python, Bash, or similar)
  • Experience working with agile engineering teams
Job Responsibility
Job Responsibility
  • Design, implement, and maintain cloud infrastructure on Google Cloud (GKE, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage)
  • Build and optimize CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)
  • Develop infrastructure-as-code using Terraform or similar tools
  • Set up and maintain container orchestration (Kubernetes, GKE) and automated deployment workflows
  • Implement monitoring, alerting, and observability using tools such as Prometheus, Grafana, ELK/Elastic, Stackdriver, or OpenTelemetry
  • Ensure compliance with security and governance standards across all environments
  • Collaborate closely with engineering teams to ensure scalable, high-performance deployment architectures
  • Support AI/ML and GenAI workloads (Vertex AI pipelines, model hosting, GPU workloads, inference optimization)
  • Manage environment strategies, release pipelines, configuration management, and secrets management
  • Optimize cloud costs and recommend improvements for performance and reliability
What we offer
What we offer
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
Read More
Arrow Right

Distinguished Technologist, Deep Learning

Joining our HPE Hybrid Cloud team and working as part of our OpsRamp team is a c...
Location
Location
United States , San Jose
Salary
Salary:
164500.00 - 398500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of relevant experience in the industry delivering technical and business strategy at an advanced/strategist level
  • Master's, or PhD degree in Computer Science, Information Systems, Engineering, or equivalent
  • At least 4 years of hands-on expertise in defining, building, training and / or optimizing foundational deep learning models at scale in PyTorch, HF and other ML frameworks and libraries
  • Experience and/or deep understanding in various deep learning architectures like CNNs, GNNs, Transformers, Reinforcement Learning etc. is a strong advantage
  • Strong hands-on experience/understanding in pre-training, fine-tuning, distilling, aligning open-source large language models and have them complement the in-house foundational models
  • Hands-on experience developing multi-agent applications around a mixture of in-house and open-source models while leveraging latest in RAG and Prompt Engineering tooling techniques
  • Strong customer focus and obsession with improving service availability/performance and user experience/consumption using measurable SRE metrics
  • Must have a track record of working alongside other engineering teams architecting, building, and deploying mission-critical, highly distributed, large-scale SaaS applications
  • Must have strong knowledge of application failure modes, resiliency patterns, and techniques to enable robust, self-healing architecture
  • Effective technical leadership skills to influence diverse groups to move toward common goals/strategies
Job Responsibility
Job Responsibility
  • Oversee build of OpsRamp’s CoPilot for Autonomous Operations for the Hybrid Cloud
  • Understand latest in GenAI/ML for ITOM
  • Understand cloud-native architecture concepts and have knowledge of best practices for high availability, scalability, resilience, performance, and security requirements in the cloud
  • Act as a cross-functional product and technical expert for GenAI within engineering with close working relationships with customers, product management, support, and marketing supporting edge-to-cloud services offering
  • Provides consultation, design input, and feedback for product development and design reviews across multiple organizations and architectures
  • Help transition proof-of-concept implementations into R&D teams to accelerate new product delivery
  • Creates technical content such as designs, specifications, and initial software implementations
  • Guides and mentors less-experienced staff members to set an example of software systems design and development innovation and excellence, helping to grow engineers into more senior technical roles
  • Collect product feedback from field interactions to provide input into Engineering and Product Management to influence product roadmap direction
  • Maintain a high level of knowledge of OpsRamp SaaS product and product road maps, as well as that of the competition and prospective strategic partners
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Distinguished Technologist, Cloud Development (AI/ML)

Joining our HPE Hybrid Cloud team and working as part of our OpsRamp team is a c...
Location
Location
United States , San Jose
Salary
Salary:
164500.00 - 398500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of relevant experience in the industry delivering technical and business strategy at an advanced/strategist level
  • Master's, or PhD degree in Computer Science, Information Systems, Engineering, or equivalent
  • At least 4 years of hands-on expertise in defining, building, training and / or optimizing foundational deep learning models at scale in PyTorch, HF and other ML frameworks and libraries
  • Experience and/or deep understanding in various deep learning architectures like CNNs, GNNs, Transformers, Reinforcement Learning etc. is a strong advantage
  • Strong hands-on experience/understanding in pre-training, fine-tuning, distilling, aligning open-source large language models and have them complement the in-house foundational models
  • Hands-on experience developing multi-agent applications around a mixture of in-house and open-source models while leveraging latest in RAG and Prompt Engineering tooling techniques
  • Strong customer focus and obsession with improving service availability/performance and user experience/consumption using measurable SRE metrics
  • Must have a track record of working alongside other engineering teams architecting, building, and deploying mission-critical, highly distributed, large-scale SaaS applications
  • Must have strong knowledge of application failure modes, resiliency patterns, and techniques to enable robust, self-healing architecture
  • Effective technical leadership skills to influence diverse groups to move toward common goals/strategies
Job Responsibility
Job Responsibility
  • Lead strategy and innovation across OpsRamp’s Intelligent Observability portfolio
  • Champion HPE OpsRamp’s position with HPE customers and GTM partners externally and HPE internal cross-functional stakeholders
  • Drive technical strategy for emerging GenAI trends across Hybrid Observability and AIOps for cloud-scale modern applications
  • Design and introduce new products to the market
  • Provide consultation, design input, and feedback for product development and design reviews
  • Transition proof-of-concept implementations into R&D teams to accelerate new product delivery
  • Guide and mentor less-experienced staff members.
What we offer
What we offer
  • Health and wellbeing benefits
  • Career development programs
  • Diversity, inclusion, and belonging initiatives.
  • Fulltime
Read More
Arrow Right
New

Assistant Special Educational Needs Coordinator

Are you an experienced and highly motivated Assistant SENCO passionate about mak...
Location
Location
United Kingdom , Guildford, Surrey
Salary
Salary:
27713.00 - 30250.00 GBP / Year
https://www.randstad.com Logo
Randstad
Expiration Date
February 21, 2026
Flip Icon
Requirements
Requirements
  • Significant experience working with students with SEN, particularly those with autism
  • Strong working knowledge and understanding of the SEND Code of Practice
  • GCSE Maths & English (Grade 4/C or equivalent)
  • SENCO qualification or Access Arrangement Assessor qualification (preferred)
  • A formal teaching qualification (preferred)
  • Ability to track progression in attainment
  • Background in youth work
  • Behaviour management
  • Building relationships
  • Classroom management
Job Responsibility
Job Responsibility
  • Leading Annual Reviews: Chairing and managing the annual review process for Education, Health and Care Plans (EHCPs)
  • Stakeholder Communication: Building and maintaining strong relationships with parents, external professionals, and staff
  • Professional Development: Delivering impactful SEN training to staff
  • Strategic Support: Assisting the SENCO in developing and implementing whole-school strategies for special educational needs
What we offer
What we offer
  • Enhanced Pension Scheme & Life Assurance
  • Professional Growth: Excellent opportunities for professional development
  • Employee Assistance Programme (EAP)
  • Referral Bonus
  • Fulltime
Read More
Arrow Right
New

Account Executive, Business Sales

The Account Executive, Business Sales role at T-Mobile is designed for ambitious...
Location
Location
United States , West Sacramento
Salary
Salary:
71700.00 - 129500.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • High School Diploma/GED (Required)
  • Bachelor's Degree (Preferred)
  • 1+ years verifiable new customer acquisition sales experience, preferably within a commissioned environment (Preferred)
  • Outside B2B sales experience. (Preferred)
  • Task Management Ability to work well in a dynamic, fast changing environment that requires a high degree of multi-tasking (Required)
  • Customer Service Demonstrated experience delivering superior customer service and attention to detail (Required)
  • Communication Excellent interpersonal, written, and oral communication skills (Required)
  • Negotiation Effective negotiating and closing skills, including communication, emotional intelligence, and problem-solving. (Required)
  • At least 18 years of age
  • Legally authorized to work in the United States
Job Responsibility
Job Responsibility
  • Lead Generation: Generate and work leads through prospecting, cold calling, and networking under sales manager supervision
  • Customer Needs: Identify customer needs and use solution-based selling to demonstrate T-Mobile’s value. Recommend wireless solutions, including price plans, data services, handsets, and accessories
  • Deal Negotiation: Negotiate and close deals
  • Skill Development: Develop skills in prospecting, call execution, and relationship management with leadership. Participate in product training and sales meetings
  • Sales Approaches: Create effective sales approaches, solutions, and proposals
  • Sales Automation: Utilize sales force automation, manage sales funnel, and report on sales activities and forecasts
  • Customer Base: Maintain and grow the customer base within a territory model
What we offer
What we offer
  • medical, dental and vision insurance
  • flexible spending account
  • 401(k)
  • employee stock grants
  • employee stock purchase plan
  • paid time off
  • up to 12 paid holidays
  • paid parental and family leave
  • family building benefits
  • back-up care
  • Fulltime
Read More
Arrow Right
New

Cleaner

Join our housekeeping team as a cleaner for a career with a little more shine! N...
Location
Location
United Kingdom , Lower Hyde, Shanklin
Salary
Salary:
12.73 GBP / Hour
parkdeanresorts.co.uk Logo
Parkdean Resorts
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • No experience required
  • Passion
  • Positivity
  • Parkdean team spirit
Job Responsibility
Job Responsibility
  • Make holiday homes shine with top-notch cleaning
  • Stay on top of your workload to hit cleaning targets
  • Team up with the Accommodation Supervisor to wow guests and boost feedback scores
  • Use cleaning materials safely, following COSHH guidelines
What we offer
What we offer
  • Flexible shift patterns
  • Development and training opportunities
  • Employee Assistance Programme with 24/7 confidential helpline
  • 50% discount for employee and 25% discount for friends/family on holidays
  • 30% team member discount on food, drinks, and leisure activities
  • Discounts on brands (e.g., Hello Fresh, local gyms)
Read More
Arrow Right
New

Digital Product Manager

Our client, a well established retail business is currently looking to recruit a...
Location
Location
United Kingdom , London
Salary
Salary:
50000.00 GBP / Year
blu-digital.co.uk Logo
Blu Digital
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • experience working with a variety of stakeholders
  • knowledge of software development
  • knowledge of ux/cx
  • experience liaising with web developers
Job Responsibility
Job Responsibility
  • ownership of the entire process from meeting with key stakeholders to work through their roadmap
  • assessing overall performance and user experience of our sites
  • analysing website traffic and user behaviour to advise and assist on future roadmaps
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.