This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Sr. Integration Lead on the GM Motorsports Data Science team is responsible for the reliability and day-to-day operation of key software applications used across NASCAR, IndyCar, and other GM racing programs. This role focuses on maintaining software, troubleshooting issues, and ensuring application uptime for tools supporting race engineering, data science, and operations. You will work closely with data scientists, software engineers, race engineers, and IT to keep critical applications stable, performant, and usable in time-sensitive motorsports environments.
Job Responsibility:
Own the operational health of key motorsports applications (data pipelines, analytics tools, web services, dashboards, race-weekend tools)
Monitor application availability, performance, and error rates
respond quickly to incidents and degradations
Implement and maintain alerting, logging, and observability to detect and diagnose issues early
Serve as a primary responder for production issues, including off-hours support during critical events (e.g., race weekends, major tests)
Troubleshoot and resolve application, configuration, and data issues across the stack in coordination with development teams and infrastructure/IT
Perform root cause analysis, document findings, and drive follow-up actions to prevent recurrence
Coordinate and execute routine maintenance, upgrades, and patches for applications and supporting components
Partner with software engineering and data science teams to plan and validate releases, including smoke tests, rollback plans, and change management
Maintain and improve runbooks, deployment guides, and operational documentation
Provide tier-2/3 support for motorsports users, including data scientists, engineers, and program staff
Triage and resolve user-reported issues, escalating to development or infrastructure teams when needed
Capture and relay user feedback and recurrent pain points into backlog items and improvement plans
Identify and implement automation to reduce manual operations (scripts, tooling, self-service workflows)
Contribute to SLA/SLO definitions and help track adherence for critical motorsports applications
Partner with teams to design systems for reliability, scalability, and supportability from the outset
Work closely with data science, software engineering, DevOps, and motorsports IT to ensure alignment on environments, dependencies, and support boundaries
Communicate clearly and effectively with technical and non-technical stakeholders, especially during incidents and race events
Requirements:
Bachelor’s degree in Computer Science, Information Systems, Engineering, or related field
or equivalent practical experience
Typically 3–5+ years in an application support, SRE, DevOps, or related operations role supporting production systems
Strong experience supporting applications built with Python and/or .NET/Java, web services, and APIs
Proficiency with Linux-based environments, shell scripting, and basic networking concepts
Hands-on experience with monitoring and logging tools (e.g., Grafana, Prometheus, ELK/EFK, App Insights, Datadog or similar)
Familiarity with SQL and working with relational databases
ability to diagnose basic data and query issues
Experience with cloud platforms (Azure preferred) and containerized deployments (Docker, Kubernetes or similar)
Understanding of CI/CD pipelines and release processes
Strong problem-solving and debugging skills under time pressure
Clear, structured communication, especially in incident scenarios and with non-technical stakeholders
Demonstrated ability to own problems end-to-end and follow through on operational improvements
Nice to have:
Experience supporting data-intensive or analytics applications (dashboards, ML services, ETL/ELT pipelines)
Exposure to motorsports, automotive, or real-time operational environments where uptime is critical
Experience with ITIL or similar service-management practices (incident, problem, and change management)
Familiarity with infrastructure-as-code (e.g., Terraform, ARM/Bicep) and configuration management tools