CrawlJobs Logo

Senior Software Engineer, AI Eval

sentry.io Logo

Sentry

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

240000.00 - 280000.00 USD / Year

Job Description:

As a Senior Software Engineer on Sentry’s AI/ML team, you’ll be responsible for building the evaluation infrastructure that measures the accuracy, reliability, and real-world performance of our AI systems. This role is critical to ensuring that our debugging agents and AI-powered features behave correctly, safely, and predictably as they scale. You’ll design datasets, benchmarks, and test harnesses that turn ambiguous AI behavior into measurable signals, helping the team ship AI with confidence.

Job Responsibility:

  • Design and build robust evaluation frameworks to measure accuracy, reliability, regressions, and edge cases in AI systems
  • Create and curate high-quality datasets, golden test cases, and benchmarks grounded in real production data
  • Build automated test harnesses and metrics pipelines to continuously evaluate models, prompts, and agentic workflows
  • Partner closely with applied AI engineers and product leaders to define what “good” looks like and translate it into measurable criteria
  • Own the evaluation lifecycle for major AI initiatives, from early experimentation through production monitoring

Requirements:

  • Minimum 5+ years of professional experience with a Bachelor’s degree in computer science, machine learning, or a related field
  • Experience building testing, evaluation, or data infrastructure for complex systems (AI/ML experience strongly preferred)
  • Comfort writing production-quality code (we use Python and TypeScript)
  • Experience working with structured and unstructured datasets, labeling workflows, or data quality pipelines
  • Familiarity with modern ML systems and evaluation techniques (e.g., offline metrics, online evaluation, regression testing for models or prompts)

Nice to have:

Bonus: experience evaluating LLMs, agentic systems, or AI-assisted developer tools

What we offer:
  • incentive compensation
  • equity grants
  • paid time off
  • group health insurance coverage

Additional Information:

Job Posted:
January 22, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Software Engineer, AI Eval

Senior Software Engineer - Studio - Java, AI

As a Senior Software Engineer, you’ll build the backend that powers AI features ...
Location
Location
United States , New York
Salary
Salary:
175000.00 - 240000.00 USD / Year
clearstreet.io Logo
Clear Street
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 7+ years of strong proficiency in enterprise Java
  • Experience designing and deploying AI/ML or LLM-backed systems in production
  • Familiarity with LLM tooling and patterns: (e.g. tool calling, RAG pipelines and knowledge bases, evals, cost/latency tradeoffs, basic red-teaming)
  • Experience in supporting and running systems in a production environment
  • Comfortable working in a dynamic environment, partnering with cross-functional teams, and moving from prototype to reliable production
Job Responsibility
Job Responsibility
  • Design, implement, and productionize reliable AI workflows to augment the Studio trading platform
  • Build tooling to monitor, tune, and evaluate models and workflows, as well as applicable guardrails to ensure outputs meet quality and regulatory requirements
  • Collaborate with technical and non-technical teams across the firm to identify high ROI AI opportunities
  • Build rapid prototypes and translate them into production-grade systems. Utilize the latest AI-powered development tools to iterate quickly
  • Create reusable libraries, SDKs and tooling to enable AI development throughout the firm
  • Stay current on the latest in applied AI. Read papers, evaluate new models, test out new tools
  • Participate in code review and architecture design, manage deployments, and support and contribute to the success of the overall Studio platform
What we offer
What we offer
  • Competitive compensation, benefits, and perks
  • Company equity
  • 401k matching
  • Gender neutral parental leave
  • Full medical, dental and vision insurance
  • Lunch stipends
  • Fully stocked kitchens
  • Happy hours
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer, AI Evaluation

We’re looking for an AI Platform Engineer to evolve and extend our internal eval...
Location
Location
United States , Mountain View
Salary
Salary:
137871.00 - 172339.00 USD / Year
khanacademy.org Logo
Khan Academy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field
  • 5 years of Software Engineering experience with 2+ of those years working on the evaluation of generative AI systems
  • Strong programming skills in Go, Python, SQL, and at least one data pipeline framework (e.g., Airflow, Dagster, Prefect)
  • Familiarity with the architecture of large language models and their industry-standard APIs
Job Responsibility
Job Responsibility
  • Evolve and extend our internal evaluation framework for assessing the quality of our AI-driven experiences
  • Work closely with ML data engineers and platform developers to help internal teams adopt an eval-driven development process incorporating offline benchmark tests and online experiments
  • Gather internal requirements, getting buy-in for changes, and then developing documentation and training materials
What we offer
What we offer
  • Competitive salaries
  • Ample paid time off as needed
  • 8 pre-scheduled Wellness Days in 2026
  • Remote-first culture
  • Generous parental leave
  • 401(k) + 4% matching
  • Comprehensive insurance, including medical, dental, vision, and life
  • Fulltime
Read More
Arrow Right

Senior AI Engineer - Teams Messaging AI

Are you interested in joining one of the most exciting teams and working on the ...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design, implementation, and shipping of multiple new messaging and large language models (LLM) agentic features
  • Building end-to-end user experiences that work across multiple devices and browsers
  • Writing and maintaining unit tests, large language models (LLM) eval and automated integration or end-to-end tests
  • Building web and AI applications in enterprise and/or consumer markets
  • Collaborating with partner teams to meet engineering goals
  • Managing individual projects or feature priorities, deadlines, and deliverables
  • Fulltime
Read More
Arrow Right
New

Bank Care Assistant

Prime Life are on the lookout for passionate and dedicated care professionals to...
Location
Location
United Kingdom , Skegness
Salary
Salary:
12.45 - 12.80 GBP / Hour
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Genuine caring nature and desire to make a real difference
  • Patience, understanding and respectful of resident’s individual abilities
  • Understanding of dementia would be beneficial
  • Prior experience as a Care Assistant or Support worker is desirable but not essential
Job Responsibility
Job Responsibility
  • Assisting residents with personal hygiene and dressing
  • Supporting residents to dine at mealtimes and maintain good nutrition and hydration
  • Encouraging residents to mobilise safely around the home
  • Providing friendship and companionship, and accompanying residents on social outings or appointments
  • Maintaining accurate and timely written records and completing documentation
  • Being a part of a multi-disciplinary team and engaging with other care professionals
  • Welcoming family members to the home and assisting with enquiries
What we offer
What we offer
  • Opportunities to learn and progress with support of dedicated Quality Matters team
  • Fully funded DBS
  • Comprehensive Holiday Pay scheme
  • Fantastic Refer a Friend scheme, offering up to £250 per successful candidate
  • Access to Blue Light Card savings
  • Parttime
Read More
Arrow Right
New

Customer Service Assistant (Games)

Interested in working at one of the UK’s best‑loved family theme parks? Are you ...
Location
Location
United Kingdom , Tamworth
Salary
Salary:
8.00 - 12.21 GBP / Hour
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Great customer service, communication skills and a positive, outgoing personality
  • Honest, reliable and a responsible individual with strong work ethic
  • A passion for hospitality and creating joyful moments
  • Ability to thrive in a fast-paced, outdoor setting all year round
  • Are able to offer good availability on weekdays and weekends (including bank holidays)
  • Be able to follow instructions and comply with all company standards
Job Responsibility
Job Responsibility
  • Operate and manage a skilled games kiosk in a high-energy, theme park environment
  • Engage with guests of all ages, ensuring a fun and memorable experience
  • Maintain a warm and welcoming atmosphere throughout the event
  • Handle transactions and prize distribution with confidence and enthusiasm
What we offer
What we offer
  • Daily Bonus Scheme
  • Flexible working hours
  • Recommend A Friend Bonus (Subject to T&C's)
  • Discounts to hundreds of tops retailers
  • Unlimited access to wellbeing resources such as a virtual GP service, 24/7 employee assistance programme providing professional support and advice, and a virtual gym hosting a range of free online workouts
  • Great career possibilities which include the opportunities to travel the globe (Asia, Europe, Dubai, USA, etc.) and work in some of the world’s most amazing locations
  • 20% discount in both retail and food purchases on site
  • 50% of meals at Drayton Manor Hotel
  • Free entry into sites affiliated with Drayton Manor e.g. West Midlands Safari Park & Waterworld
  • Free Entry to Drayton Manor for you and 4 friends up to 10 times in the year!
Read More
Arrow Right
New

Quality Assurance Analyst

Ensuring data integrity of balances in the General Ledger, as well as, balances ...
Location
Location
United States Of America , NEW YORK
Salary
Salary:
110000.00 - 150000.00 USD / Year
credit-agricole.com Logo
Crédit Agricole
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor Degree / BSc Degree or equivalent
  • Bachelors in Accounting
  • 6-10 years minimal experience
  • Knowledge of US Regulatory Reporting
  • English language
  • Strong knowledge in regulatory reporting requirements, specifically in FR2052a and FR Y-15 report
  • Extensive experience in Quality Assurance that enhances to develop protocols to ensure the integrity and accuracy of regulatory reporting
  • Strong understanding of regulatory reporting requirements with a proven ability to perform detailed data validation for the User Acceptance Tests
  • Proactive identification and escalation of issues/findings
  • Strong controls mindset with ability to perform root cause analysis
Job Responsibility
Job Responsibility
  • Support the team to perform periodic reviews on various regulatory reports
  • Assist in performing end-to-end sample testing (from trade tickets / client confirms to reporting) to ensure data accuracy, data integrity, completeness, and in compliance with regulatory reporting requirements
  • Take initiative to escalate the findings to his/her manager, and communicate with various departments (i.e. Operations, Front Office, etc.)
  • Assist in executing the Quality Assurance Reviews across various source systems and reporting streams
  • Identify the findings or system issues by utilizing the data from different system applications and database platform
  • Build key relationships across Business Lines, Compliance, Internal Audit and IT functions
  • Assist in compiling and organizing the Quarterly/Monthly Status Report
  • Maintain and distribute the monthly Issue Log to manager, and closely follow-up with concerned parties on the open issues
  • Assist in the on-going development of streamlining the reviewing processes
  • Fulltime
Read More
Arrow Right
New

Bricklaying Assessor Trainer

Join Colchester Institute – Where Your Career Makes a Difference. At Colchester ...
Location
Location
United Kingdom , Colchester
Salary
Salary:
29783.52 - 33535.32 GBP / Year
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCSE grades A-C including English and Maths or equivalent
  • Level 2 qualification (or equivalent) in a Brick trade / discipline
  • Assessor award (or willingness to achieve this whilst in post)
  • Minimum 3 years industry experience within the Bricklaying trade
  • Good IT skills, including excel and the use of outlook
  • Good communication skills and willingness to work within a team
  • Experience of working with and training young people
Job Responsibility
Job Responsibility
  • Train and assess candidates towards agreed qualifications
  • Maintain high expectations of learner’s work, commitment, and behaviour, taking action to ensure the highest of standards are met
  • Evaluate and improve the quality of learning and teaching within your delivery utilising and engaging with the college’s development and observation programmes
  • Maintain workshop, equipment and tools to the highest standards ensuring statutory requirements are met
  • Carry out and oversee practical community or college projects ensuring high standards of work are maintained at all times
  • To effectively and professionally communicate with all staff, external and internal stakeholders and learners
  • Ensuring workshops are maintained in a safe operational condition, that all teaching and learning related materials are prepared in a timely manner and appropriate stock levels are maintained
What we offer
What we offer
  • full benefits
  • Fulltime
Read More
Arrow Right
New

Support worker

Are you looking for a job that has purpose, something that makes you feel like y...
Location
Location
United Kingdom , Banbury
Salary
Salary:
12.75 - 13.05 GBP / Hour
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A positive, can-do, and professional attitude
  • To be reliable and flexible and able to do sleep overs
  • a UK driving Licence.
Job Responsibility
Job Responsibility
  • Help individuals with varying levels of personal care
  • Help with eating and drinking, shopping, household tasks
  • Facilitate fun and fulfilling activities (music, cinema, theatre trips, days out, holidays)
  • Live and breathe our values: Caring – Respectful – Honest – Ambitious – Collaborative
What we offer
What we offer
  • 30 days annual leave (including bank holidays) for full-time staff (pro-rata for part time)
  • £68 per night for sleep-ins. (Breakfast included)
  • Company Pension Scheme - 5% Employer Pension Contribution
  • Flexible working hours
  • Free comprehensive ongoing training, including a unique Leadership Development Programme with the ability to progress to Assistant Support Manager within 18 months
  • Employee benefits package with Perkbox (saving you up to £800 per year)
  • Recommend a friend incentive scheme for employees
  • Wellness programs
  • Company events & social hours
Read More
Arrow Right