CrawlJobs Logo

Research Engineer, Frontier Evals & Environments - Finance

openai.com Logo

OpenAI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

205000.00 - 380000.00 USD / Year

Job Description:

The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI/ASI. This team builds ambitious evaluations to measure and steer our models, and creates self-improvement loops to steer our training, safety, and launch decisions. Some of the team's open-sourced evaluations include SWE-bench Verified, MLE-bench, PaperBench, and SWE-Lancer, and the team built and ran frontier evaluations for GPT4o, o1, o3, GPT 4.5, ChatGPT Agent, and GPT5. If you are interested in feeling firsthand the fast progress of our models, and steering them towards good, this is the team for you.

Job Responsibility:

  • Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas
  • Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it
  • Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities

Requirements:

  • Strong engineering and statistical analysis skills (with at least 2-3 years of full-time technical experience)
  • Passionate about evals for real world applications and knowledge work
  • Detail-oriented and thorough
  • Team player / willing to do a variety of tasks to move the team forward
  • Passionate and knowledgeable about AGI/ASI measurement
  • Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end

Nice to have:

  • An ability to work cross-functionally
  • Excellent communication skills
What we offer:
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Relocation support for eligible employees
  • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided
  • Offers Equity

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Engineer, Frontier Evals & Environments - Finance

New

AI Architect

We’re hiring an AI Architect to sit at the intersection of frontier AI research,...
Location
Location
United States , San Francisco; New York
Salary
Salary:
201600.00 - 241920.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep technical background in applied AI/ML: 5–10+ years in research, engineering, solutions engineering, or technical product roles working on LLMs or multimodal systems, ideally in high-stakes, customer-facing environments
  • Hands-on experience with model improvement workflows: demonstrated experience with post-training techniques, evaluation design, benchmarking, and model quality iteration
  • Ability to work on hard, ambiguous technical problems: proven track record of partnering directly with advanced customers or research teams to scope, reason through, and execute on deep technical challenges involving frontier models
  • Strong technical fluency: you can read papers, interrogate metrics, write or review complex Python/SQL for analysis, and reason about model-data trade-offs
  • Executive presence with world-class researchers and enterprise leaders
  • excellent writing and storytelling
  • Bias to action: you ship, learn, and iterate.
Job Responsibility
Job Responsibility
  • Translate research → product: work with client side researchers on post-training, evals, safety/alignment and build the primitives, data, and tooling they need
  • Partner deeply with core customers and frontier labs: work hands-on with leading AI teams and frontier research labs to tackle hard, open-ended technical problems related to frontier model improvement, performance, and deployment
  • Shape and propose model improvement work: translate customer and research objectives into clear, technically rigorous proposals—scoping post-training, evaluation, and safety work into well-defined statements of work and execution plans
  • Translate research into production impact: collaborate with customer-side researchers on post-training, evaluations, and alignment, and help design the data, primitives, and tooling required to improve frontier models in practice
  • Own the end-to-end lifecycle: lead discovery, write crisp PRDs and technical specs, prioritize trade-offs, run experiments, ship initial solutions, and scale successful pilots into durable, repeatable offerings
  • Lead complex, high-stakes engagements: independently run technical working sessions with senior customer stakeholders
  • define success metrics
  • surface risks early
  • and drive programs to measurable outcomes
  • Partner across Scale: collaborate closely with research (agents, browser/SWE agents), platform, operations, security, and finance to deliver reliable, production-grade results for demanding customers
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • commuter stipend
  • equity based compensation.
  • Fulltime
Read More
Arrow Right
New

Engagement Associate

Legora is on a mission: to redefine how legal work gets done. From the very star...
Location
Location
United States , New York City
Salary
Salary:
125000.00 - 150000.00 USD / Year
Legora
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of professional experience in a management consulting or similar client advisory role, a customer-facing role in B2B SaaS, or within the Legal field
  • Experience working directly with clients, building relationships, and supporting outcomes in fast-paced, professional settings
  • Strong organizational and project management skills, with the ability to manage multiple client relationships and priorities concurrently
  • Clear, confident communication skills – comfortable explaining concepts, running client calls, and supporting both senior and hands-on stakeholders
  • A proactive, detail-oriented mindset with a strong sense of ownership and comfort operating in ambiguous environments
  • Curiosity and the ability to learn complex products, workflows, and industries quickly
  • A collaborative working style - you enjoy partnering with others and contributing to shared goals
  • Comfort working with metrics and tools to understand customer health, engagement, and outcomes
  • A growth mindset and interest in developing toward more senior client ownership
  • A passion for in-office collaboration – we are in-office 5 days per week in our beautiful Union Square HQ, building together
Job Responsibility
Job Responsibility
  • Support Engagement Directors / Senior Managers on Strategic and Enterprise accounts, assisting with onboarding, enablement, rollout coordination, and ongoing client support
  • Ensure client feedback is channelled internally to the correct stakeholders (Customer Enablement, Product, Marketing) and the client is kept informed of progress
  • Run product enablement sessions across the portfolio to drive consistent usage and measurable value for clients
  • Diagnose client challenges, surface risks and opportunities, and contribute to structured solutions in partnership with senior team members
  • Guide clients on best practices, workflows, and usage strategies aligned to their goals and operational needs
  • Monitor usage trends and customer health metrics using internal tools, proactively flagging risks, churn signals, and growth opportunities
  • Serve as the primary point-of-contact for smaller SMB customers requiring tailored engagement strategies
  • Create resources such as presentations (i.e. Quarterly Business Review and Rollout decks) and training materials for customer engagements
  • Contribute to the development and refinement of Legora’s Engagement playbook, templates, and scalable client-facing processes
What we offer
What we offer
  • Global collaboration: Partner with teams and clients across Stockholm, New York, London, Sydney, and more
  • Competitive package: Comprehensive salary, benefits, and tools for success
  • Meaningful work: Your efforts shape how thousands of lawyers use AI daily
  • In-person environment: Union Square NYC office designed for ambitious builders
  • U.S. employees receive medical, dental, and vision coverage, flexible paid time off plus company holidays, and a 401(k) with company match and automatic enrollment
  • Fulltime
Read More
Arrow Right
New

Team Lead, Sales Part Time

As a Team Leader at HEYDUDE, you’re at the heart of crafting unforgettable exper...
Location
Location
United States , Dublin
Salary
Salary:
19.00 - 24.00 USD / Hour
crocs.com Logo
Crocs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must be 18 years or older
  • 2-3 years of retail experience with a preference for candidates with prior store leadership roles
  • Flexibility in the work schedule, including nights, weekends, holidays and extended hours, with regular attendance and punctuality essential functions of this position
  • Desire to succeed in a high growth, fast-paced retail environment
Job Responsibility
Job Responsibility
  • Deliver outstanding consumer experiences by applying our V.I.B.E.S service model
  • Actively champion a positive team culture by sharing ideas, feedback, and concerns, while consistently demonstrating the core values of Crocs, Inc
  • Actively assist in all store departments, including POS system management, customer service, merchandising, product placement, visual presentation, and stockroom operations, to create a seamless shopping experience
  • Lead by example on the sales floor, working closely with Store Management to achieve and exceed personal and team sales goals through effective selling strategies and consumer engagement
  • Manage day-to-day team activities within your assigned area by delegating tasks, monitoring progress, and ensuring timely follow-up, while maintaining high service standards
  • Serve as a brand ambassador by staying informed about current product collaborations, launches, and brand initiatives, and sharing this knowledge with consumers to elevate their experience and connection with HEYDUDE
  • Adhere to all HEYDUDE policies, including Asset Protection procedures, shortage prevention, inventory control, and compliance initiatives
What we offer
What we offer
  • This position is eligible to participate in a company incentive program
  • Parttime
Read More
Arrow Right
New

Senior Staff Software Engineer

GEICO is seeking an experienced Senior Staff Software Engineer to lead the archi...
Location
Location
United States , Palo Alto
Salary
Salary:
130000.00 - 260000.00 USD / Year
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of professional experience in software development
  • 10+ years of professional experience working with large enterprise or business applications, preferably Finance or Risk related
  • 5+ years of experience with Risk and Compliance systems (e.g. GRC
  • Regulatory Management
  • Model management
  • etc.) via established vendors (e.g. Auditboard
  • Archer
  • IBM
  • ServiceNow
  • etc.)
Job Responsibility
Job Responsibility
  • Lead the architecture, solution design, and implementation of vendor products or bespoke systems to support the Risk, Compliance, and Audit functions as well as work towards providing insightful analytics to proactively identify trends and issues
  • Leverage their awareness of Risk & Compliance technologies (e.g. Auditboard
  • Archer
  • OpenPages
  • ServiceNow
  • etc.) to support the implementation of vendor applications to support business requirements
  • Leverage finance system knowledge to ensure seamless integration of financial data from ERP systems, sub-ledgers and other enterprise sources to support the Risk and Compliance system requirements
  • Mentor other engineers and consistently share best practices and improve processes within and across teams
  • Understanding of DevOps concepts including Azure DevOps framework and tools to build out appropriate applications
  • Oversee system-wide technical initiatives, migrations, performance tuning, and process automation
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right
New

Associate Counsel

GEICO is looking to hire an Associate Counsel to defend lawsuits filed in Georgi...
Location
Location
United States , Macon
Salary
Salary:
118900.00 - 185525.00 USD / Year
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2 to 6+ years of experience in related field litigation experience in insurance defense and/or personal injury REQUIRED
  • Juris Doctor degree REQUIRED
  • Admission to the Georgia bar REQUIRED
  • Must be licensed in good standing to practice law in Georgia and meet and maintain licensing requirements including mandatory Continuing Legal Education (CLE) requirements where applicable
  • Must be able to travel as required, including but not limited, to attend trials, hearings, depositions, management meetings and conferences
  • Must be able to document files in a clear, concise, professional written manner, to be understood by customers, clients, co-workers and other employees of the organization
  • Must be able to follow complex instructions, resolve conflicts or facilitate conflict resolution, and have strong organization/priority setting and multi-tasking skills
  • Must be able to learn and apply large amounts of technical and procedural information
Job Responsibility
Job Responsibility
  • Researching laws and preparing legal briefs, opinions, and memoranda
  • Rendering opinions on liability, damages, and value as requested by the Claims Department
  • Preparing and handling pleadings, motions, and discovery, to include depositions/examinations before trial and examinations under oath, and defending by trial or dispositive hearing, all matters assigned, as applicable
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right
New

Restaurant & bakery bus person

At Perkins Restaurant & Bakery our employees are part of the Perkins extended fa...
Location
Location
United States , Grand Forks
Salary
Salary:
Not provided
perkinsrestaurants.com Logo
Perkins Restaurant & Bakery
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive standing and walking for up to 8 hours
  • Must be able to see at a distance (20) feet, at close range (12 inches), distinguish between shapes and utilize peripheral vision to avoid hazards
  • Must be able to communicate clearly
  • Must have high level of mobility/flexibility in space provided
  • Must have time management skills
  • Must be able to fit through openings 30” wide
  • Must be able to work irregular hours under heavy pressure/stress during busy times
  • Bending, reaching, walking
  • Carrying trays of food products weighing about 50 pounds for distances up to 30 feet
  • Lifting up to 50 pounds
Job Responsibility
Job Responsibility
  • Provides friendly and efficient service to guests
  • Performs all duties to maximize guest satisfaction and quality of work environment as directed by the manager on duty
  • Cleans and reset tables and maintain the Guest service areas of the restaurant according to company policies, procedures, programs and performance standards
  • Reports to work well-groomed, in clean and proper uniform and practices good personal hygiene
  • Prior to meal service, sets dining tables with dishes, silverware and glassware and condiments as required
  • Following meal service, removes soiled dishes, silverware, linens and glassware from dining tables by placing on serving tray and wipes table and chairs and walls next to tables clean with sanitizer
  • Clears crumbs onto serving tray and wipes tables clean
  • Vacuums floor directly under and around dining table
  • Maintains cleanliness of his/her stations and work areas
  • Performs side work during shift downtime including but not limited to
What we offer
What we offer
  • Medical, Dental, Vision, Wellness Program, Life Insurance, Disability Insurance
  • 401k, Health Savings Account
  • Employee Assistance Program, Employee Discount Program, Vacation/Sick Time Benefits, Travel Accident Insurance
  • Discounted Tuition, Waived Application Fee, Deferred Tuition, Complimentary Course, & More
  • Fulltime
Read More
Arrow Right
New

General Manager - Smelter

Leveraging your dynamic and seasoned experienced as a General Manager you will s...
Location
Location
Australia , Portland
Salary
Salary:
Not provided
alcoa.com Logo
Alcoa Corporation
Expiration Date
March 04, 2026
Flip Icon
Requirements
Requirements
  • Tertiary degree in process engineering, chemical engineering, metallurgy, or a related discipline is essential
  • Post-graduate business or financial qualifications are highly desirable
  • Proven experience in the leadership and management of large-scale manufacturing/production environments
  • Extensive technical knowledge of plant/manufacturing processes
  • Demonstrated leadership experience in Health & Safety systems
  • Experience in management of financial P&L and key profitability drivers
  • Solid understanding of commercial markets, demand, and capacity
  • Experience in an industrial relations environment is highly desirable
Job Responsibility
Job Responsibility
  • Ensure the effective operation of the smelter to maximize business returns, within socially and environmentally acceptable standards, and assure employee health and safety
  • Establish challenging production objectives, operating and capital budgets, and resource allocations. Lead and develop an effective management team to meet safety, cost, and efficiency targets
  • Promote health, safety, and environmental objectives across the plant. Ensure all personnel are committed to safety plans and enforce environmental standards to minimize impact on the plant and community
  • Develop a consistent approach to employee relations, build a harmonious workforce, and ensure the right team is in place. Coach and mentor high-potential individuals to strengthen Alcoa’s talent pool
  • Promote diversity within the smelter and oversee the intake of a healthy succession talent pipeline
  • Ensure the latest developments in process and technologies are available to all areas of the plant. Lead the deployment of Alcoa Systems, tools and initiatives
  • Represent the location and company in government, community, and business relationships. Partner with community leaders to develop solutions for issues of importance to Alcoa and the community
What we offer
What we offer
  • Relocation support to this idyllic coastal location
  • Career development opportunities to pursue your passion including technical development support from Alcoa’s Centres of Excellence
  • Competitive short-term and long-term performance-based rewards
  • Collaborate across diverse operational domains to shape enterprise-wide outcomes
  • Fulltime
Read More
Arrow Right
New

Team Lead, Sales

As a Team Leader at Crocs, you’re at the heart of crafting unforgettable experie...
Location
Location
United States , Commerce
Salary
Salary:
17.00 - 21.00 USD / Hour
crocs.com Logo
Crocs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must be 18 years or older
  • 2-3 years of retail experience with a preference for candidates with prior store leadership roles
  • Flexibility in the work schedule, including nights, weekends, holidays and extended hours, with regular attendance and punctuality essential functions of this position
  • Desire to succeed in a high growth, fast-paced retail environment
  • Ability to move merchandise with appropriate equipment to and from backroom and sales floor
  • Ability to place and arrange items on all shelves and racks
  • Ability to climb and descend ladders while carrying merchandise
  • Ability to lift 30 pounds or more with assistance
  • Ability to be on your feet for a minimum of 8 hours per shift and to continuously move around all areas of the store
  • Ability to stand, walk, kneel, or balance for a duration of time
Job Responsibility
Job Responsibility
  • Deliver outstanding consumer experiences by applying our C.H.A.R.M service model
  • Actively champion a positive team culture by sharing ideas, feedback, and concerns, while consistently demonstrating the core values of Crocs, Inc.
  • Actively assist in all store departments, including POS system management, customer service, merchandising, product placement, visual presentation, and stockroom operations, to create a seamless shopping experience
  • Lead by example on the sales floor, working closely with Store Management to achieve and exceed personal and team sales goals through effective selling strategies and consumer engagement
  • Manage day-to-day team activities within your assigned area by delegating tasks, monitoring progress, and ensuring timely follow-up, while maintaining high service standards
  • Serve as a brand ambassador by staying informed about current product collaborations, launches, and brand initiatives, and sharing this knowledge with consumers to elevate their experience and connection with Crocs
  • Adhere to all Crocs policies, including Asset Protection procedures, shortage prevention, inventory control, and compliance initiatives
What we offer
What we offer
  • medical, dental, and vision coverage
  • life and AD&D
  • short and long-term disability coverage
  • paid time off
  • employee assistance
  • participation in a 401k program that includes company match
  • many other additional voluntary benefits
  • eligible to participate in a company incentive program
  • Fulltime
Read More
Arrow Right