CrawlJobs Logo

Principal Site Reliability Engineer (AI-first SRE)

groupon.com Logo

Groupon

Location Icon

Location:
Peru

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Groupon is modernizing its global platform — and reliability is at the center of that transformation. We’re looking for a Principal Site Reliability Engineer to lead the evolution from reactive maintenance to predictive, AI-driven resilience. You’ll design intelligent, self-healing systems that prevent incidents before they happen, ensuring our customers enjoy fast, secure, and reliable experiences across millions of daily interactions.

Job Responsibility:

  • Architect and maintain self-healing systems with 99.9%+ availability targets
  • Use AI/ML to automate infrastructure governance and detect configuration or IaC anti-patterns
  • Implement adaptive SLIs/SLOs that evolve automatically from real-time data
  • Build AIOps-based observability and auto-remediation pipelines
  • Apply predictive modeling to forecast failures before they impact users
  • Lead chaos, performance, and resilience testing programs
  • Map platform and service behavior to revenue impact and drive improved revenue resilience through better infrastructure performance
  • Mentor engineers and drive reliability standards across teams
  • Partner with platform, data, and product teams to ensure stability aligns with business goals
  • Support major incident response, incident review, and participate in on-call rotations

Requirements:

  • 10+ years in software/systems engineering, including 5+ years in SRE or platform reliability
  • Strong experience with GCP (preferred) or AWS, Kubernetes, and Terraform
  • Proficiency in Python or Go for automation and tooling
  • Deep understanding of observability stacks (Prometheus, Grafana, OpenTelemetry) and service meshes (Istio, Envoy)
  • Hands-on AIOps experience: anomaly detection, predictive analytics, ML-assisted operations
  • Strong communication and influencing skills — data over hierarchy

Nice to have:

  • Experience with MLOps or large-scale data infrastructure
  • Exposure to FinOps or cloud cost optimization
  • Previous leadership of global incident response or SRE transformation programs
What we offer:
  • The opportunity to work with cutting-edge technologies in a transformative environment
  • Professional growth and leadership development pathways tailored to your aspirations
  • A chance to leave a lasting impact by shaping the future of reliable and scalable systems

Additional Information:

Job Posted:
December 09, 2025

Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Principal Site Reliability Engineer (AI-first SRE)

Principal Site Reliability Engineer (AI-first SRE)

Groupon is modernizing its global platform — and reliability is at the center of...
Location
Location
Salary
Salary:
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Requirements
Requirements
  • 10+ years in software/systems engineering, including 5+ years in SRE or platform reliability
  • Strong experience with GCP (preferred) or AWS, Kubernetes, and Terraform
  • Proficiency in Python or Go for automation and tooling
  • Deep understanding of observability stacks (Prometheus, Grafana, OpenTelemetry) and service meshes (Istio, Envoy)
  • Hands-on AIOps experience: anomaly detection, predictive analytics, ML-assisted operations
  • Strong communication and influencing skills — data over hierarchy
Job Responsibility
Job Responsibility
  • Architect and maintain self-healing systems with 99.9%+ availability targets
  • Use AI/ML to automate infrastructure governance and detect configuration or IaC anti-patterns
  • Implement adaptive SLIs/SLOs that evolve automatically from real-time data
  • Build AIOps-based observability and auto-remediation pipelines
  • Apply predictive modeling to forecast failures before they impact users
  • Lead chaos, performance, and resilience testing programs
  • Map platform and service behavior to revenue impact and drive improved revenue resilience through better infrastructure performance
  • Mentor engineers and drive reliability standards across teams
  • Partner with platform, data, and product teams to ensure stability aligns with business goals
  • Support major incident response, incident review, and participate in on-call rotations
What we offer
What we offer
  • The opportunity to work with cutting-edge technologies in a transformative environment
  • Professional growth and leadership development pathways tailored to your aspirations
  • A chance to leave a lasting impact by shaping the future of reliable and scalable systems
Read More
Arrow Right
New

Physical Therapist Assistant

As a Physical Therapist Assistant (PTA), you will work under the direction and s...
Location
Location
United States , Delta
Salary
Salary:
Not provided
tietalent.com Logo
TieTalent
Expiration Date
Until further notice
Requirements
Requirements
  • Current licensure or certification as a Physical Therapist Assistant (PTA) in the state of practice
  • Graduation from an accredited Physical Therapist Assistant program
  • Previous experience in long-term care, skilled nursing, or rehabilitation preferred
  • Strong communication, documentation, and organizational skills
  • Compassion and commitment to improving residents’ quality of life
Job Responsibility
Job Responsibility
  • Carry out treatment plans established by the supervising Physical Therapist
  • Assist residents with therapeutic exercises, gait training, and functional mobility activities
  • Monitor residents’ responses to therapy and promptly report changes to the Physical Therapist
  • Educate residents and families on exercises, techniques, and use of assistive devices
  • Document daily treatments, progress notes, and outcomes accurately and timely in accordance with regulatory and company standards
  • Collaborate with physical therapists, occupational therapists, speech-language pathologists, nurses, and other team members to deliver coordinated care
  • Maintain compliance with all applicable state and federal regulations
What we offer
What we offer
  • Competitive Pay & Benefits: Hourly or per-visit rate commensurate with experience, plus medical, dental, vision, generous Paid Time Off, holidays, 401(k), and more
  • Career Growth: We’re a growing company with opportunities for advancement and company-sponsored training. Tuition reimbursement and ongoing learning opportunities are available
  • Flexible Schedules: Full-time, part-time, and PRN options
  • Supportive Team Culture: Work alongside experienced therapists, nurses, and caregivers in a collaborative environment
Read More
Arrow Right
New

Director Technology Consultancy, AI

The Director Technology Consultancy – AI is responsible for leading the strategi...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
adidas.com Logo
Adidas
Expiration Date
Until further notice
Requirements
Requirements
  • Strong leadership and ability to drive cross-functional alignment with excellent communication skills
  • Problem-solving and resilience in fast-paced, ambiguous environments
  • Collaborative and adaptable mindset, with strong stakeholder management and influencing skills
  • Solid understanding of enterprise IT platforms and AI technologies (e.g., machine learning, GenAI) and experience in leading AI product development and deployment, including data governance and performance measurement
  • Ability to translate business needs into technical requirements and impactful AI use cases
  • Bachelor’s or Master’s degree in Computer Science, Business Administration, or a related field (or equivalent experience)
  • Minimum 10 years in IT or digital business environments, with experience leading AI, digital transformation, or innovation projects
  • Proven track record in global project execution and contribution to strategic initiatives
Job Responsibility
Job Responsibility
  • Own the AI solution roadmap and delivery milestones across business domains. Align AI initiatives with company goals and market needs
  • Work with business leaders to identify high-impact AI opportunities. Oversee the design, development, and deployment of AI solutions that deliver measurable business value
  • Foster a culture of experimentation, continuous learning, and responsible AI adoption. Lead cross-functional teams and promote AI knowledge sharing across the organization
  • Analyze business processes and drive optimization / transformation through AI enabled solutions, ensuring alignment between business, data, and technology teams
  • Engage with internal and external stakeholders, including senior leadership, global/local AI governance teams, and technology partners to communicate priorities and share best practices
  • Understand the AI technology development trend and provide expert advice to leadership team regarding future opportunity and focus area
  • Fulltime
Read More
Arrow Right
New

Airport Program Engineer

Join a team that is enthusiastic about aviation! The Airport Programming Section...
Location
Location
United States , Madison; Wisconsin Rapids; Green Bay; Rhinelander; Milwaukee
Salary
Salary:
83304.00 - 112237.00 USD / Year
wisconsin.gov Logo
State of Wisconsin
Expiration Date
Until further notice
Requirements
Requirements
  • Graduated from an Engineering Accreditation Commission (EAC) ABET accredited college or university with a 4-year bachelor's degree (or Master's or PH.D.) in engineering OR possess a PE license (registration as a Professional Engineer in the State of Wisconsin OR a valid Professional Engineering registration and be able to obtain a Wisconsin PE registration within 3 months if PE registration is from another state)
  • At least 4 years of professional transportation engineering project experience in the construction or design phases. Experience may include participating as a team member or leading transportation projects in design/construction, providing fieldwork guidance, providing technical consultation, tracking and reporting project progress, etc.
  • Registration as a Professional Engineer in the State of Wisconsin OR a valid Professional Engineering registration and be able to obtain a Wisconsin PE registration within 3 months if PE registration is from another state (for Advanced level)
Job Responsibility
Job Responsibility
  • Develops and manages the state airport improvement program and the federal Airport Capital Improvement Program
  • Provides consultation and advice to Wisconsin county and municipal governments, planning authorities, industry representatives, other aviation organizations and WisDOT management on airport programming and development funding
  • Analyzing and preparing responses to the reauthorization of the Federal Aviation Administration and its airport development laws, regulations, programs and procedures as well as potential changes to state laws and programs relating to airport funding to derive the maximum benefit to the state’s system of airports and its aviation industry
What we offer
What we offer
  • Sign-on bonus of $3,000
  • Substantial leave time including at least 3.5 weeks of paid leave time to start
  • 9 paid holidays
  • 130 hours of sick time that roll over each year
  • Excellent and affordable health, vision, and dental benefits (health plan options start at just $49/month for single plans and $122/month for family plans after two months of employment)
  • Casual office atmosphere
  • Flexible work schedules
  • Telework options
  • Exceptional pension plan with employer match and lifetime retirement payment
  • Optional tax advantaged 457 retirement savings plan
  • Fulltime
Read More
Arrow Right
New

Quality Manager

You have an exciting opportunity to join a market leading, increasingly busy, re...
Location
Location
United Kingdom , Birmingham
Salary
Salary:
65000.00 - 75000.00 GBP / Year
kentonblack.com Logo
Kenton Black
Expiration Date
Until further notice
Requirements
Requirements
  • Demonstrable experience in site safety on residential construction or related sectors/projects
  • H&S Diploma-qualified
  • A member of IOSH
  • Experienced in residential construction or a similar sector
  • Excellent communication skills
  • Strives for personal and professional development
  • A people person able to build relationships
Job Responsibility
Job Responsibility
  • Providing SHE support multi-site
  • Coaching and mentoring site personnel to improve output
  • Working proactively to identify potential risks
  • Monitoring and reporting Health and Safety performance
What we offer
What we offer
  • Plus car/car allowance and package
  • bonus
  • healthcare
  • Fulltime
Read More
Arrow Right
New

Support Manager

IT Support Manager required for by High End Retailer based in Central London 5 d...
Location
Location
United Kingdom , London
Salary
Salary:
65000.00 GBP / Year
Social Value Portal Ltd
Expiration Date
Until further notice
Requirements
Requirements
  • Proven experience leading small technical teams (1-5 staff) in a hands-on capacity
  • Deep experience with ServiceNow or similar enterprise-grade helpdesk tools
  • Strong knowledge of iOS/iPhone management and JAMF
  • Microsoft Windows, Active Directory, and Office 365 (Exchange/SharePoint)
  • Fundamental understanding of TCP/IP networking
  • Exceptional interpersonal skills suited for a high-net-worth/VIP environment
Job Responsibility
Job Responsibility
  • Supervise a small, busy helpdesk
  • manage workloads, ticket allocation, and performance reviews
  • Provide Level 1 & 2 support, acting as the primary escalation point for complex technical incidents
  • Oversee the deployment and maintenance of the iOS/iPhone estate via JAMF/Intune
  • Manage the CMDB and helpdesk systems to produce regular statistics and trend reports
  • Maintain IT documentation, SOPs, and knowledgebase articles to ensure high service availability
What we offer
What we offer
  • Bonus scheme
  • amazing benefits
  • Fulltime
Read More
Arrow Right
New

Network Engineer, Foundation & Support

Meta is looking for a forward thinking Network Operations Engineer to support de...
Location
Location
Singapore
Salary
Salary:
Not provided
meta.com Logo
Meta
Expiration Date
Until further notice
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 7+ years of work experience operating, deploying, and designing large-scale network, optical, and/or physical layer infrastructure
  • Hands-on experience with a variety of optical/network communication equipment including optical amplifiers, coherent transponders, network/fiber/copper test gear and other WDM equipment as well as associated tools and test equipment
  • 7+ years of network operations experience while supporting large-scale service provider, datacenter, and/or enterprise network infrastructure
  • Communication experience: demonstrated skills to effectively engage with hardware vendors, service providers, and colocation facility vendors globally and effectively translate performance issues into technical engineering requirements
  • Experience in providing technical guidance to external vendors
  • Intermediate subject matter expertise in Network Services, including topology, traffic analysis, hardware platform, and architectures
  • 7+ years of experience with telecommunication networks in the terrestrial optical transport networks including DWDM, network infrastructures, optical testing, optical measurement principles, structured cabling and experience with photonic layer
  • Subject matter expertise with physical infrastructure design: rack elevations, cable types, connector types, optic types, patch panels, and facility infrastructure
  • Familiarity with Enterprise and Service Provider network hardware platforms and architectures, including Cisco and Juniper routers/switches and Nexus data center switching hardware platforms
Job Responsibility
Job Responsibility
  • Incident Response: Drive work investigating complex technical and process issues on a global scale spanning multiple reliability, security, and continuity disciplines for infra during major incidents/site events (SEVs) on edge, caching, and network infrastructure
  • Change Management: You will be integral to identifying problems and implementing effective change within security & business continuity issues that affect the network, edge, and Infrastructure at Meta
  • Operational Experience: As an operations practitioner within the ENS team you will be expected to drive operational efficiency in everything we do
  • Team Leadership: Set team goals and work globally with cross-functional teams within and outside the organization to deliver business outcomes predictability across global sites
  • Risk Management: Work with partner teams to design and implement aligned processes that identify and manage data and asset protection risks (i.e, security and privacy gaps, Business Continuity vulnerabilities), as well as operations continuity issues across the network
  • Information and Data Assurance: Ensure relevant operational process, procedure, and policy documentation is effectively managed and the data required to support operations is complete and accurate in systems
  • Automation: Be heavily involved in driving the team to analyze operational events in order to identify new automation opportunities
  • Data Measurement: As an operations practitioner supporting our network, you will be expected to drive quality into the metrics we report to assist us in focusing on the areas that give us the best return on investment
  • Communication: provide clear and effective communication around personal and team goals, progress, outcomes, and lessons learned across assigned scope
  • Travel: International and Domestic travel may be required 25% of the time and up to 50% of time depending on needs of the business
Read More
Arrow Right
New

Field Reimbursement Manager

Join Amgen’s Mission of Serving Patients. At Amgen, if you feel like you’re part...
Location
Location
United States , Houston
Salary
Salary:
155968.00 - 173578.00 USD / Year
amgen.com Logo
Amgen
Expiration Date
Until further notice
Requirements
Requirements
  • Doctorate degree AND 2 years of experience in the public or private third-party access arena or pharmaceutical industry in managed care, clinical support, and/or sales
  • Master’s degree AND 6 years of experience in the public or private third-party access arena or pharmaceutical industry in managed care, clinical support, and/or sales
  • Bachelor’s degree AND 8 years of experience in the public or private third-party access arena or pharmaceutical industry in managed care, clinical support, and/or sales
  • Associate degree AND 10 years of experience in the public or private third-party access arena or pharmaceutical industry in managed care, clinical support, and/or sales
  • Bachelor's degree in business, healthcare, or a related field
  • 6 years' experience with specialty/biologic self-injectable (pharmacy benefit) or physician-administered (buy and bill/medical benefit) products
  • Advanced knowledge of medical insurance terminology
  • Knowledge of Centers of Medicare & Medicaid Services (CMS) policies and processes with expertise in Medicare (Part B – for buy & bill products and Part D for Pharmacy products)
  • Ability to manage ambiguity and problem-solve
  • Ability to manage expenses within allocated budgets
Job Responsibility
Job Responsibility
  • Manage defined accounts within a specified geographic region for Patient Access and Reimbursement
  • Support products by executing the collaborative territory strategic plan
  • Ensure an understanding of the reimbursement process, field reimbursement services, and patient support programs
  • Work on patient-level reimbursement issue resolution, requiring knowledge and experience with patient health information (PHI)
  • Act as an extension of the HUB, providing live one-on-one coverage support
  • Offer assistance from physician order to reimbursement, supporting the entire reimbursement journey through payer prior authorization to appeals/denials requirements and forms
  • Review patient-specific information in cases where the site has specifically requested assistance resolving any issues or coverage challenges
  • Educate and update healthcare providers (HCPs) on key private and public payer coverage and changes that impact patient product access
  • Coordinate access/reimbursement issues with relevant partners, including the HUB
  • Provide information to HCPs on how the products are covered under the benefit design (Commercial, Medicare, Medicaid)
What we offer
What we offer
  • A comprehensive employee benefits package, including a Retirement and Savings Plan with generous company contributions, group medical, dental and vision coverage, life and disability insurance, and flexible spending accounts
  • A discretionary annual bonus program, or for field sales representatives, a sales-based incentive plan
  • Stock-based long-term incentives
  • Award-winning time-off plans
  • Flexible work models, including remote and hybrid work arrangements, where possible
  • Fulltime
Read More
Arrow Right