CrawlJobs Logo

Design Engineer - AI Infrastructure

meta.com Logo

Meta

Location Icon

Location:
United States , Menlo Park

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

113000.00 - 164000.00 USD / Year

Job Description:

Want to build cutting-edge AI developer tools? The AI Infrastructure Experiences team designs the tooling and platforms that power Meta’s AI efforts, including model authoring, training, experimentation, deployment, and debugging. This work enables many of Meta’s top AI priorities, including recommendations on Instagram and Facebook, Ads ranking, research (FAIR) and more. As a Product Design Engineer, you'll have the opportunity to contribute to and influence everything from product strategy, to user experience design, to UI engineering.

Job Responsibility:

  • Works with full-stack software teams to realize designs into experiences and will validate or discard hypotheses by testing new designs and concepts with advanced prototypes
  • Develop and iterate rapid models of proposed design solutions for the purposes of communication and evaluation across cross-functional teams
  • Improve tools and drive quality and craft initiatives in an effort to up-level the look and feel of the AI infrastructure products
  • Translate static designs (both hi-fidelity and wire-frame) into end-to-end, interactive prototypes that can be used in research, critique, reviews with leadership, and public-facing product announcements
  • Present work to peers, product teams and executive leadership/stakeholders

Requirements:

  • Bachelor's degree in Design, Human-Computer Interaction, Computer Science, or a related field
  • 2+ years of experience shipping software products
  • Experience in full lifecycle of design and development -- including ideation, prototyping, development, backwards compatibility, failovers, validation, and user research
  • Understanding of design principles and the knowledge to build experiences that span mobile and desktop browsers and native platforms
  • Experience with front-end technologies and prototyping tools, including Figma, PHP, HTML, CSS, and JavaScript frameworks (e.g., React, React Native, NodeJS)
  • A solid portfolio of things you’ve designed and built across a range of platforms, including native and web-based applications

Nice to have:

  • You've built (and shipped) something - an app, an MVP, a proof of concept, etc
  • A portfolio that shows a range of technical skills and a creative use of code
What we offer:
  • bonus
  • equity
  • benefits

Additional Information:

Job Posted:
January 23, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Design Engineer - AI Infrastructure

Software Engineer, AI Infrastructure

As a Software Engineer on our AI Infrastructure team, you will help design the c...
Location
Location
United States , New York, NY; San Mateo, CA
Salary
Salary:
Not provided
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • 3 years of experience in software engineering, with a focus on infrastructure or machine learning systems
  • Strong programming skills in Python, Go, or a similar language
  • Proven experience in ML infrastructure and tooling (e.g., PyTorch, MLflow, Vertex AI, SageMaker, Kubernetes, etc.)
  • Basic understanding of LLM knowledge (e.g., context length, disaggregated prefill, KV cache memory estimation, etc)
Job Responsibility
Job Responsibility
  • Contribute to the design and development of scalable backend infrastructure that supports distributed training, inference, and data pipelines
  • Build and maintain core backend services such as LLM CI/CD pipeline, control plane, and model serving systems
  • Support performance optimization, cost efficiency, and reliability improvements across compute, storage, and networking layers
  • Building frameworks and safeguards to ensure Fireworks AI has the best model quality in the industry
  • Collaborate with performance, training, and product teams to translate research and product needs into infrastructure solutions
  • Participate in code reviews, technical discussions, and continuous integration and deployment processes
What we offer
What we offer
  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure
  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation
  • Fulltime
Read More
Arrow Right

Senior AI Infrastructure Engineer

This role will be responsible for designing, deploying, and maintaining high-per...
Location
Location
United States , Bothell; Overland Park; Bellevue
Salary
Salary:
113600.00 - 205000.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years technical engineering experience, preferably in multiple technology focus areas
  • Expert understanding of AI/ML infrastructure components, or GPU-based systems – preferably in a high-availability, large scale environment
  • Hands-on Experience with NVIDIA DGX servers, BasePOD architectures, and advanced GPU technologies
  • Proficient in Linux/UNIX environments, including scripting/automation tools (Bash, Python, Ansible, Terraform)
  • Understanding of AI infrastructure security best practices
  • Experience with container orchestration (Kubernetes, Docker) and GPU workload management tools
  • Strong knowledge of networking (InfiniBand/Ethernet) and storage solutions in AI/ML contexts
Job Responsibility
Job Responsibility
  • Technical System Expertise: Understands system protocols, how systems operate and data flows
  • Technical Engineering Services: Drives engineering projects by active contribution to the application of engineering techniques
  • Innovation: Contributes to designs to implement new ideas which improve an existing and new system/process/service
  • Technical Writing: Writes basic documentation on how technology works
  • Technical Leadership: Collaborates with technical teams and utilizes system expertise to deliver technical solutions
  • Technology Strategy: Contributes to new and existing technology options that support business goals
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off
  • Paid holidays
  • Paid parental and family leave
  • Fulltime
Read More
Arrow Right

AI Research Infrastructure Engineer

Block is scaling Customer Insights into an AI-powered insights accelerator that ...
Location
Location
United States , Bay Area
Salary
Salary:
168300.00 - 297000.00 USD / Year
block.xyz Logo
Block
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in research, automation implementation, analytics, or related technical fields with hands-on workflow optimization experience
  • 3+ years implementing AI/ML solutions, with experience in automation, LLM integration, or applied AI/analytics workflows
  • Hands-on technical skills in programming languages (Python, R, SQL) for automation development, API/MCP integrations, cloud platforms, and research data pipeline creation
  • Experience with research and analytic platforms and tools (Qualtrics, Snowflake, etc) or transferable experience with analytics and automation platforms
  • Strong technical communication and translation skills with ability to make complex AI/ML concepts, data architecture decisions, and automation workflows accessible and actionable for researchers, product managers, and business stakeholders
  • Proven ability to build stakeholder confidence and alignment during technology transformation
  • Strong project management skills with ability to coordinate multiple complex automation initiatives, manage competing priorities, and deliver measurable operational efficiency gains (reduced cycle times, improved quality outcomes, increased research capacity)
Job Responsibility
Job Responsibility
  • Design, build, and deploy AI agents and agentic workflows that automate research operations from study design through insights delivery, using LLMs, prompt engineering, MCP (Model Context Protocol) integrations, and workflow orchestration integrated with existing research and analytics tech stack
  • Design, build, and maintain automated data pipelines that ingest, transform, and unify research data from diverse sources (surveys, transcripts, analytics, behavioral logs) into AI-ready repositories with RAG capabilities for instant insight access via tools like Goose
  • Architect ETL/ELT frameworks using Python, SQL or equivalent tools to ensure data consistency, traceability, and scalability
  • Develop data models and schemas for research metadata, participant data, and AI-generated insights to support efficient querying and analysis
  • Design and prototype research automation systems using AI/ML techniques, partnering with design & engineering teams to productionize solutions
  • Partner with engineering, design, and platform teams to integrate research automation systems with Block's tech stack (i.e. Goose, GitHub, etc.) and establish governance frameworks for quality, ethics, and compliance
  • Mentor team members on AI agent development, agentic system design, and research automation best practices to build organizational capabilities in intelligent automation
What we offer
What we offer
  • Remote work
  • medical insurance
  • flexible time off
  • retirement savings plans
  • modern family planning
  • Fulltime
Read More
Arrow Right

AI Research Engineer, Data Infrastructure

As a Research Engineer in Infrastructure, you will design and implement a robust...
Location
Location
United States , Palo Alto
Salary
Salary:
180000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience in building data pipelines and ETL systems
  • Ability to design and implement systems for data collection and management from robotic fleets
  • Familiarity with architectures that span on-robot components, on-premise clusters, and cloud infrastructure
  • Experience with data labeling tools or building dataset visualization and annotation tooling
  • Proficiency in creating or applying machine learning models for dataset organization and automated labeling
Job Responsibility
Job Responsibility
  • Optimize operational efficiency of data collection across the NEO robot fleet
  • Design intelligent triggers to determine when and what data should be uploaded from the robots
  • Automate ETL pipelines to make fleet-wide data easily queryable and training-ready
  • Collaborate with external dataset providers to prepare diverse multi-modal pre-training datasets
  • Build frontend tools for visualizing and automating the labeling of large datasets
  • Develop machine learning models for automatic dataset labeling and organization
What we offer
What we offer
  • Equity
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer - CI/CD & AI Automation (AI-first)

Groupon is undergoing a critical platform transformation, modernizing its core d...
Location
Location
Czechia , Prague
Salary
Salary:
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of dedicated experience in Platform Engineering, DevOps, or Infrastructure roles
  • Deep expertise building, scaling, and migrating CI/CD systems, with strong practical experience in Jenkins and/or GitHub Actions
  • Expertise in scripting and automation (Python, Go, or Bash)
  • Solid understanding of container technologies, Kubernetes, and cloud build systems
  • Proven experience leveraging AI tooling (e.g., Claude Code, code analysis) to meaningfully increase developer output and optimize platform work
  • Excellent communication and ability to drive technical decisions across multiple platform and product teams
Job Responsibility
Job Responsibility
  • Platform Transformation: Lead the design, planning, and execution of the Jenkins-to-GitHub Actions migration across a large portfolio of microservices
  • Pipeline Engineering: Design and optimize high-performance, secure, and observable CI/CD workflows across GitHub Actions, Jenkins, and Kubernetes environments
  • AI-First Automation: Drive an AI-First workflow by leveraging tools (e.g., Copilot, code generation) to eliminate infrastructure toil, accelerate development, and analyze pipeline failures
  • Core Automation: Develop robust platform automation (e.g., Python, Go, Bash) to improve build efficiency, artifact caching, reliability, and repository hygiene
  • Security & Compliance: Harden CI/CD infrastructure with robust controls for secrets management, RBAC, audit logging, and secure runner design
  • Observability: Implement and enhance CI/CD observability using tools like Prometheus, Grafana, and OpenTelemetry to provide deep insights into performance and reliability
  • Technical Leadership: Mentor engineers and partner across Cloud, Security, and Developer Experience teams to define and evolve our end-to-end delivery platform architecture
Read More
Arrow Right

Senior Software Engineer - ML Infrastructure

We build simple yet innovative consumer products and developer APIs that shape h...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of industry experience as a software engineer, with strong focus on ML/AI infrastructure or large-scale distributed systems
  • Hands-on expertise in building and operating ML platforms (e.g., feature stores, data pipelines, training/inference frameworks)
  • Proven experience delivering reliable and scalable infrastructure in production
  • Solid understanding of ML Ops concepts and tooling, as well as best practices for observability, security, and reliability
  • Strong communication skills and ability to collaborate across teams
Job Responsibility
Job Responsibility
  • Design and implement large-scale ML infrastructure, including feature stores, pipelines, deployment tooling, and inference systems
  • Drive the rollout of Plaid’s next-generation feature store to improve reliability and velocity of model development
  • Help define and evangelize an ML Ops “golden path” for secure, scalable model training, deployment, and monitoring
  • Ensure operational excellence of ML pipelines and services, including reliability, scalability, performance, and cost efficiency
  • Collaborate with ML product teams to understand requirements and deliver solutions that accelerate experimentation and iteration
  • Contribute to technical strategy and architecture discussions within the team
  • Mentor and support other engineers through code reviews, design discussions, and technical guidance
What we offer
What we offer
  • medical, dental, vision, and 401(k)
  • Fulltime
Read More
Arrow Right

Engineering Manager, AI Platform

Lead Airtable's AI Platform pod, which builds the foundational infrastructure an...
Location
Location
United States , San Francisco; New York City
Salary
Salary:
240000.00 - 339900.00 USD / Year
airtable.com Logo
Airtable
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Platform builder at heart: think in systems and abstractions
  • experience building infrastructure other teams depend on
  • Technical depth with strategic thinking
  • Systems thinker with shipping velocity
  • AI infrastructure experience: worked on ML platforms, agent frameworks, or AI infrastructure at scale
  • Quality through architecture
  • Strong technical and management growth trajectory: 5+ years experience as an engineer (previously in a staff or TL level IC position) and 1+ years as a manager, or a similar combination
Job Responsibility
Job Responsibility
  • Build the AI platform foundation: own the core agent architecture, orchestration layer, and runtime
  • Design for platform scale: create robust abstractions and APIs
  • Establish AI reliability systems: build evaluation frameworks, monitoring, and quality assurance systems
  • Drive technical strategy: partner with Staff+ engineers to define the technical roadmap
  • Enable AI democratization: build platform capabilities that make sophisticated AI accessible to all Airtable users
What we offer
What we offer
  • Benefits
  • Restricted stock units
  • Incentive compensation
  • Fulltime
Read More
Arrow Right

Software Engineer (Backend Infrastructure)

As a Backend Infrastructure Engineer on the v0 team, you will design and build t...
Location
Location
United States , San Francisco
Salary
Salary:
224000.00 - 336000.00 USD / Year
vercel.com Logo
Vercel
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep experience designing distributed systems, microservices, or high-performance backend architectures
  • Familiarity with AI/LLM-powered systems, developer tools, or platform engineering
  • Strong understanding of TypeScript, Node.js, or similar backend languages
  • Passion for the developer experience and a desire to build magical, intuitive workflows
  • You thrive in ambiguous environments and enjoy shipping in fast cycles
Job Responsibility
Job Responsibility
  • Architect, build, and maintain high-performance, scalable, and reliable backend systems for AI-driven development experiences
  • Develop and enhance AI-powered tools and services within the v0 ecosystem to improve code generation, automation, and platform integrations
  • Design APIs and services that enable seamless interactions between AI models, user input, and Vercel infrastructure
  • Monitor, diagnose, and resolve complex distributed systems issues to ensure reliability at scale
  • Stay ahead of emerging trends in AI, cloud architecture, and developer tooling, identifying opportunities to enhance the v0 platform
  • Collaborate cross-functionally with design, product, and platform engineering to ship features that delight developers
What we offer
What we offer
  • Competitive compensation package, including equity
  • Inclusive Healthcare Package
  • Learn and Grow - we provide mentorship and send you to events that help you build your network and skills
  • Flexible Time Off
  • We will provide you the gear you need to do your role, and a WFH budget for you to outfit your space as needed
  • Fulltime
Read More
Arrow Right