CrawlJobs Logo

Model Behavior Architect

perplexity.ai Logo

Perplexity

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

180000.00 - 260000.00 USD / Year

Job Description:

We're looking for a Model Behavior Architect to help build Perplexity's AI products and evaluations. You'll sit within our AI team and collaborate closely with research and product teams, designing prompt and context engineering strategies to deliver high quality user experiences across multiple domains and models. This role is equal parts craft and science. You'll develop a deep understanding of our answer engine by pressure-testing model capabilities and working across our AI infrastructure (including system and tool prompts, skills, and evaluations) to create a stellar product experience for our users. You'll serve as a go-to expert on prompting, model quality, and behavioral consistency across new product features and model releases.

Job Responsibility:

  • Context Engineering: Design, test, and optimize context strategies and system prompts that shape answer engine behavior across products, features, and use cases
  • Evaluation Systems: Build automated and semi-automated evaluation pipelines that measure model quality, catch regressions, and scale across product surfaces
  • Model Launch Support: Partner with research and engineering to validate model behavior before and during rollouts, ensuring smooth transitions with no degradation
  • Research & Analysis: Identify inconsistencies and failure modes in model outputs through well-designed research projects — for both internal and production-facing systems
  • Cross-functional Collaboration: Work closely with design, product, and research teams to translate product goals into concrete model behavior requirements
  • Knowledge Sharing: Help engineers across teams build intuition for prompt design, context engineering, and evaluation best practices
  • Staying Current: Track the latest alignment, evaluation, and prompting techniques from industry and academia, and bring the best ideas back to the team

Requirements:

  • Experience designing evaluations, benchmarks, or metrics for AI systems
  • Strong written and verbal communication skills, particularly in explaining complex concepts to diverse stakeholders
  • Ability to manage multiple concurrent projects in a fast-moving environment
  • Strong experience with Perplexity or other frontier AI models in production settings
  • Demonstrated experience with Python — you'll prototype, debug, automate, and build systems at scale
  • 3+ years of experience working with LLMs in a product or research setting

Nice to have:

  • Experience with A/B testing or experimentation frameworks
  • Track record of improving AI system performance through systematic evaluation and iteration
What we offer:
  • equity
  • health
  • dental
  • vision
  • retirement
  • fitness
  • commuter and dependent care accounts

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Model Behavior Architect

Agentic AI Architect

Design, develop, and implement advanced AI systems centered around autonomous ag...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.bosch.pl/ Logo
Robert Bosch Sp. z o.o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Artificial Intelligence, Robotics, or a related field
  • 8+ years of experience in AI/ML architecture
  • 2+ years focused on autonomous systems or agent-based AI
  • Strong experience building and deploying GenAI applications using Microsoft Azure AI services and Google Cloud AI offerings
  • Hands-on development and orchestration of AI agents using Python, multi-agent frameworks, and vector DBs
  • Proven track record of production-grade deployments using MLOps practices on cloud-native platforms
  • Experience integrating LLM-based agents with enterprise systems via APIs, event-driven messaging, and secure identity/access controls
  • Demonstrated ability to lead cross-functional teams and scale AI solutions in mission-critical environments
  • Fluent proficiency in both English and Polish
Job Responsibility
Job Responsibility
  • Defining and evolving architectures by creating and adapting AI reference architectures for agent systems
  • Designing and orchestrating blueprints for multi-agent systems including orchestration, tool use, memory, and human feedback
  • Establishing standards and reusability by creating architecture standards and reusable components
  • Leading implementation by architecting robust, autonomous AI frameworks and guiding development of multi-agent systems
  • Productizing and deploying AI agents by overseeing their deployment and lifecycle using modern infrastructure
  • Utilizing advanced tools such as vector databases, agent orchestration frameworks, and RAG pipelines
  • Collaborating and aligning with teams to define agent behaviors and translate business needs into agent blueprints
  • Monitoring and optimizing by establishing continuous evaluation for agent performance, safety, and compliance
What we offer
What we offer
  • Annual bonus
  • Hybrid work with flexible working hours
  • Referral Bonus Program
  • Copyright costs for IT employees
  • Professional development opportunities
  • Broad access to professional trainings including language courses, conferences and webinars
  • Private medical care and life insurance
  • Cafeteria System with multiple benefits
  • Prepaid Lunch Card
  • Non-working day on the 31st of December
  • Fulltime
Read More
Arrow Right
New

Research Scientist, VLA Models - Atlas

As a VLA Research Scientist on the Atlas team, you will architect, train, and de...
Location
Location
United States , Waltham
Salary
Salary:
Not provided
bostondynamics.com Logo
Boston Dynamics
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MS with 3+ years of experience or PhD in Machine Learning, Robotics, Computer Science, or related fields
  • Prior experience training and deploying learned policies for complex behaviors in robots or simulated characters
  • Strong background in: Behavior cloning / imitation learning
  • Strong background in: Diffusion policies, ACT, or other modern BC architectures
  • Strong background in: Large behavior models or sequence modeling
  • Strong background in: Multimodal (vision/language/action) learning
  • Experience with modern ML frameworks (PyTorch, JAX) and large-scale training workflows
  • Strong analytical and debugging skills
  • ability to write reliable, well-structured research code
Job Responsibility
Job Responsibility
  • Architect and train end-to-end VLA and Large Behavior Models for mobile manipulation on Atlas
  • Build large-scale imitation learning pipelines that learn from human demonstrations, teleoperation, and simulation data
  • Develop policies capable of few-shot generalization across diverse manipulation tasks
  • Create hierarchical behavior systems that combine learned skills into long-horizon behaviors
  • Integrate your models into Atlas’s autonomy stack in collaboration with controls and platform teams
  • Deploy, debug, and iterate your models directly on physical hardware
  • Write high-quality, maintainable Python and C++ code that fits into a large production codebase
What we offer
What we offer
  • Direct access to an advanced humanoid robot—test your models on hardware quickly and often
  • A collaborative, inclusive team that values diverse perspectives and identities
  • The opportunity to do applied VLA research with real-world impact
  • A mission-focused environment where your work will define the future of general-purpose humanoids
  • Fulltime
Read More
Arrow Right
New

Monetization Analyst

As an Analyst, you will be the strategic bridge between raw data and operational...
Location
Location
Salary
Salary:
Not provided
skelar.tech Logo
SKELAR
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of experience in Data or Operations Analytics (preferably in Fintech, SaaS, or high-growth startups)
  • Proficiency in writing complex, optimized SQL queries (CTEs, Window Functions, etc.)
  • Proven ability to tell a story with data in Tableau
  • Strong understanding of product funnels (Amplitude) and business unit economics
  • A keen interest in AI/Machine Learning and a willingness to experiment with new automation tools
  • Fluent in translating data findings into clear business recommendations
Job Responsibility
Job Responsibility
  • Foundation Building & Data Architecture: own the end-to-end data flow—from querying and structuring raw data in BigQuery to ensuring a 'single source of truth' for the entire Ops department
  • Advanced Analytics & Modeling: develop antifraud models and design complex behavioral segmentations
  • Performance Monitoring: architecting the automated KPI ecosystem and health-check dashboards for Payments, Support, and FirstLook
  • Strategic Insight Generation: conducting high-level deep-dives into payment failure trends, transaction routing optimization, and operational bottlenecks
  • AI & Automation Integration: leading the charge in implementing AI/LLM solutions
  • Actionable Implementation: working as a strategic partner to Ops Leads to ensure data models translate into direct business growth
What we offer
What we offer
  • Competitive compensation and a high-impact role
  • Full Ownership: Freedom to optimize and evolve the support function as you see fit
  • Support from the best: Access to internal professional communities (Marketing, Product, Operations) within the SKELAR network
  • A meaningful mission: Help millions of people live healthier lives every day
  • Fulltime
Read More
Arrow Right

Analog Models & Verification Engineer, Architect

Synopsys software engineers are key enablers in the world of Electronic Design A...
Location
Location
United States , Chandler
Salary
Salary:
181000.00 - 271000.00 USD / Year
synopsys.com Logo
Synopsis Engineering
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BSc, MSc or PhD in Electrical/Computer Engineering, with 7+ years of relevant industry experience
  • Advanced proficiency with Verilog, SystemVerilog (including RNM, wreal modeling, and IEEE 1800-2012 SV-DC extensions)
  • Robust understanding of analog/mixed-signal SerDes sub-blocks: TX/RX, ADC, DAC, CDR, CTLE/equalizer, VGA/amplifier, PLL, VCO, Phase Interpolator
  • Proven ability to model analog circuit impairments: offsets, gain/mismatches, jitter, noise, skew, supply noise, etc.
  • Fluency with analog schematics, SPICE-level simulation tools and waveform analysis
  • Strong scripting/programming in Python, TCL, Perl, C/C++
  • Familiarity with verification flows: regression, analog/mixed-signal co-simulation, digital verification, gate-level simulation, formal methods, and emulation
  • Experience with UVM testbenches, assertion-driven and coverage-driven verification
Job Responsibility
Job Responsibility
  • Work closely with analog circuit teams to extract all necessary details, simulate, and sign off on high-fidelity models by rigorous comparison with SPICE-level simulations and silicon data
  • Develop and refine behavioural models of the analog portions of high-speed SerDes blocks (TX/RX, ADC, DAC, CDR, CTLE/equalizer, VGA/amplifier, PLL, VCO, Phase Interpolator)
  • Ensure models accurately capture all relevant functionalities, calibration/adaptation controls, time- and mode-dependent behaviors, key performance aspects, and residual impairments (offsets, gain mismatches, jitter, noise, skew, supply noise, etc.)
  • Interface with digital design and verification teams to guarantee exhaustive model verification—ensuring all functionalities and edge-cases are included in regression and integration test plans
  • Reviewing execution against verification plans through regular meetings with multiple verification teams (analog, cosim, DV, GLS, formal, emulation)
  • Integrate behavioral models into modern verification environments (UVM, MS-MDV), utilizing assertion-based checks, analog/digital interface scoreboards, and power-aware techniques as appropriate
  • Optimize model implementations for simulation speed and accuracy
  • Drive continuous improvement and automation in the creation, maintenance, and validation of SerDes behavioral models
  • Establish and evangelize best practices and reusable frameworks for efficient, scalable RNM modeling and mixed-signal verification
  • Mentor and support teammates, sharing knowledge, methodology innovations, and documentation
What we offer
What we offer
  • Comprehensive medical and healthcare plans that work for you and your family
  • In addition to company holidays, we have ETO and FTO Programs
  • Maternity and paternity leave, parenting resources, adoption and surrogacy assistance, and more
  • Purchase Synopsys common stock at a 15% discount, with a 24 month look-back
  • Save for your future with our retirement plans that vary by region and country
  • Competitive salaries
  • Annual bonus, equity, and other discretionary bonuses
  • Fulltime
Read More
Arrow Right
New

Radar Systems Engineer

Location
Location
United States
Salary
Salary:
Not provided
aptiv.com Logo
Aptiv plc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Gather and analyze user requirements to define clear and comprehensive use cases for automotive systems
  • Document use cases using standardized formats
  • Assist in the design and development of product system architectures
  • Contribute to the development of automotive systems by participating in all phases of the product lifecycle
  • Use tools such as DOORS to capture, document, and track system requirements
  • Participate in FMEA activities
  • Support system integration activities
  • Utilize Enterprise Architect to create and manage system architecture models
  • Use SysML to model system architectures
  • Configure, simulate, and analyze Controller Area Network (CAN) communication networks
Job Responsibility
Job Responsibility
  • Gather and analyze user requirements to define clear and comprehensive use cases for automotive systems
  • Document use cases using standardized formats to ensure alignment with stakeholder needs and project objectives
  • Assist in the design and development of product system architectures, including defining system components, interfaces, and behaviors
  • Contribute to the development of automotive systems by participating in all phases of the product lifecycle, from concept to production
  • Use tools such as DOORS to capture, document, and track system requirements and ensure alignment with project goals and customer needs
  • Participate in FMEA activities to identify potential failure modes, their effects, and the likelihood of occurrence within the system
  • Support system integration activities by assisting in the configuration, integration, and testing of system components and interfaces
  • Utilize Enterprise Architect to create and manage system architecture models, diagrams, and documentation
  • Use SysML to model system architectures, requirements, behaviors, and interactions, facilitating system-level analysis and design
  • Configure, simulate, and analyze Controller Area Network (CAN) communication networks for testing and validation purposes
  • Fulltime
Read More
Arrow Right

Product Lead, Safety Systems & Trust

At Inflection AI, our public benefit mission is to harness the power of AI to im...
Location
Location
United States , Palo Alto
Salary
Salary:
230000.00 - 300000.00 USD / Year
inflection.ai Logo
Inflection AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical deep-diver with a strong grasp of ML concepts including LLMs, RAG, diffusion models, and the nuances of model drift
  • 8+ years of product experience, with a track record of shipping complex products from 0 to 1 at scale, ideally in Integrity, Trust & Safety, or other high-stakes AI domains
  • Systems thinker who thrives in high-ambiguity, low-precedent environments where you define both the problem and the solution
  • Ethical architect and first-principles thinker committed to making AI safe and beneficial for humanity
  • Hands-on experience with Constitutional AI or self-correction loops in LLM chains
  • Deep expertise in adversarial analysis, including many-shot jailbreaking and prompt injection mitigation
  • Proficiency in creating safety evaluations and using telemetry to measure system performance and blind spots
  • Have a bachelor’s degree or equivalent in a related field to the offered position requirements.
Job Responsibility
Job Responsibility
  • Productize safety stack and inference engine capabilities, including low-latency, token-level safety filters and constrained decoding protocols that block harm without degrading performance
  • Partner with Research on alignment and model behavior, defining RLHF/DPO objectives and reinforcement signals while developing taxonomies for steerability and model behavior
  • Lead adversarial assessment and red-teaming initiatives, architecting automated stress-test infrastructure and evaluation frameworks for safety benchmarks
  • Serve as a strategic leader driving cross-functional teams to implement and evolve Inflection’s most critical safety initiatives
  • Build the affordance layer for trust, working with Design to ensure users understand when and why to trust AI interactions and agent decisions
  • Drive policy-as-code execution, partnering with Legal and Engineering to translate privacy, safety, and brand principles into technical specifications
  • Shape system instructions, fine-tuning datasets, and user experience guardrails to support safe and ethical AI deployment.
What we offer
What we offer
  • Diverse medical, dental and vision options
  • 401k matching program
  • Unlimited paid time off
  • Parental leave and flexibility for all parents and caregivers
  • Support of country-specific visa needs for international employees living in the Bay Area
  • Meaningful equity component.
Read More
Arrow Right

Lead Product Data Scientist

We're seeking a strategic analytics leader who can combine deep technical expert...
Location
Location
United States
Salary
Salary:
130000.00 - 170000.00 USD / Year
personifyhealth.com Logo
Personify Health
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced SQL skills for complex query design and optimization
  • Deep proficiency in Python with libraries like pandas, numpy, scikit-learn for modeling and analysis
  • Strong grounding in statistical inference, hypothesis testing, and causal modeling
  • Expertise with experimentation platforms and product analytics tools (Amplitude, Mixpanel, Heap)
  • 5-8+ years leading product analytics or data science initiatives in SaaS, healthcare, or consumer tech
  • Proven track record informing product strategy through analysis and experimentation
  • Experience defining, evolving, and scaling product KPIs and measurement frameworks
  • Demonstrated success influencing product and company-level decisions with data
  • Experience partnering with product managers, digital user flows and funnels
  • Ability to coach and raise data fluency of product managers, designers, and business leaders
Job Responsibility
Job Responsibility
  • Drive product analytics strategy (40%): Act as primary analytics partner to senior leadership, developing advanced product funnel and user behavior models that identify patterns unlocking new opportunities
  • Architect experimentation frameworks (30%): Design and evaluate A/B and multivariate tests with statistical rigor while creating predictive models for feature impact and long-term engagement
  • Lead cross-functional influence (20%): Translate analytical findings into executive-level narratives that drive confident, data-backed decisions while facilitating workshops that raise data fluency across teams
  • Ensure technical excellence (10%): Partner with Data Engineering to optimize analytics-ready pipelines while establishing best practices and reusable libraries for SQL, Python, and experimentation templates
What we offer
What we offer
  • Comprehensive medical and dental coverage through our own health solutions
  • Mental health support and wellness programs designed by experts who get it
  • Flexible work arrangements that fit your life
  • Retirement planning support
  • Basic Life and AD&D Insurance plus Short-Term and Long-Term Disability protection
  • Employee savings programs and voluntary benefits like Critical Illness and Hospital Indemnity coverage
  • Professional development opportunities and clear career progression paths
  • Mentorship from industry leaders
  • Learning budget to invest in skills
  • Unlimited PTO policy
  • Fulltime
Read More
Arrow Right
New

Software Engineer, Monetization AI/ML

We’re looking for experienced Software Engineers to help build OpenAI’s foundati...
Location
Location
United States , San Francisco; Seattle
Salary
Salary:
230000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience building and scaling ML-powered systems in production environments
  • Experience working on ranking, recommendation, ads, marketplaces, or large-scale ML inference systems
  • Comfortable operating across the full stack — from model development to backend services and production deployment
  • Enjoy deeply technical 0→1 problem spaces where architecture, strategy, and implementation are still being invented
  • Strong intuition for ML and system design tradeoffs, and can reason about long-term scalability and maintainability
  • Communicate clearly, collaborate effectively across disciplines, and think holistically about system behavior
Job Responsibility
Job Responsibility
  • Architect, build, and evolve large-scale ads ranking and recommendation systems using modern ML and AI techniques
  • Design and productionize LLM- and transformer-inspired models that leverage sequential signals, long-horizon context, and sparse or delayed feedback
  • Develop model-driven decision logic and inference pipelines that operate under real-world constraints around performance, reliability, and privacy
  • Partner closely with Product, Design, and Research to define requirements and translate ambiguous product goals into scalable ML systems
  • Prototype, experiment, and rapidly iterate on new model architectures and training approaches to improve relevance, quality, and efficiency
  • Build services and infrastructure that support training, evaluation, online inference, and continuous optimization of ML models
  • Establish strong measurement, experimentation, and debugging practices to understand model behavior and system-level outcomes
  • Contribute to technical strategy and help shape the long-term evolution of OpenAI’s monetization and recommendation stack
  • Embed safety, fairness, and policy considerations directly into model design and system architecture from first principles
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right