CrawlJobs Logo

Ai/ml Infrastructure Engineer

helpcare.ai Logo

Helpcare AI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

150000.00 - 240000.00 USD / Year

Job Description:

Zensors is the spatial intelligence platform for the physical world. Our AI platform provides real-time insights—from airport queue times to office utilization—helping organizations make smarter operational decisions. Zensors is processing massive streams of video data 24/7 with human-level accuracy. To do this at scale, we rely on cutting-edge optimization to ensure our vision transformer and detection models run efficiently on both cloud and edge compute resources. The AI Infrastructure team at Zensors builds the engine that powers our visual sensing platform. We provide the tools to automate the lifecycle of our AI workflow, including model development, evaluation, optimization, deployment, and monitoring across thousands of video streams. As a Machine Learning Engineer in ML Runtime & Optimization, you will develop technologies to accelerate the training and inference of computer vision models that power smart spaces and cities.

Job Responsibility:

  • Optimizing Core ML Pipelines: Identifying key bottlenecks in our current video analytics pipeline and performing in-depth analysis to ensure the best possible performance on current server and edge compute architectures
  • Cross-Stack Collaboration: Collaborating closely with AI research and platform engineering teams to optimize core parallel algorithms and influence the design of our next-generation inference infrastructure
  • Model Acceleration: Applying advanced model optimization techniques—such as quantization (Int8/FP16), pruning, and layer fusion—to our Vision Transformers (ViTs) and CNNs to maximize throughput and minimize latency
  • Building Efficient Operators: Working across the entire ML framework/compiler stack (e.g., PyTorch, CUDA, TensorRT, and NVIDIA DeepStream) to write custom optimized ML operator libraries
  • Resource Efficiency: Reducing the compute cost per video stream to enable massive scalability of our SaaS product
  • Data Management: Building, improving, maintaining, and operating systems to facilitate the collection, labeling, and use of visual data for ML training

Requirements:

  • BS/MS or Ph.D. in Computer Science, Electrical Engineering, or a related discipline
  • Strong programming skills in C/C++ and Python
  • Experience with model optimization, quantization, and efficient deep learning techniques (e.g., knowledge distillation, pruning)
  • Deep understanding of GPU hardware performance, including execution models, thread hierarchy, memory/cache management, and the cost/performance trade-offs of video processing
  • Experience with profiling and benchmarking tools (e.g., Nsight Systems, Nsight Compute) to validate performance on complex architectures
  • Experience identifying and resolving compute and data flow bottlenecks, particularly in high-bandwidth video processing pipelines
  • Strong communication skills and the ability to work cross-functionally between research and infrastructure teams

Nice to have:

  • Familiarity with database systems (e.g., SQL, Neo4j)
  • Work in Computer Vision, Deep Learning, and Vision Transformers
  • Experience with video processing frameworks such as NVIDIA DeepStream, DALI, or FFmpeg
  • Familiarity with ML compilers (e.g., TVM, MLIR) or inference engines like TensorRT or ONNX Runtime
  • Knowledge of distributed training systems or cloud-scale inference serving (e.g., Triton Inference Server)

Additional Information:

Job Posted:
December 31, 2025

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Ai/ml Infrastructure Engineer

Software Engineer, Data Infrastructure

The Data Infrastructure team at Figma builds and operates the foundational platf...
Location
Location
United States , San Francisco; New York
Salary
Salary:
149000.00 - 350000.00 USD / Year
figma.com Logo
Figma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of Software Engineering experience, specifically in backend or infrastructure engineering
  • Experience designing and building distributed data infrastructure at scale
  • Strong expertise in batch and streaming data processing technologies such as Spark, Flink, Kafka, or Airflow/Dagster
  • A proven track record of impact-driven problem-solving in a fast-paced environment
  • A strong sense of engineering excellence, with a focus on high-quality, reliable, and performant systems
  • Excellent technical communication skills, with experience working across both technical and non-technical counterparts
  • Experience mentoring and supporting engineers, fostering a culture of learning and technical excellence
Job Responsibility
Job Responsibility
  • Design and build large-scale distributed data systems that power analytics, AI/ML, and business intelligence
  • Develop batch and streaming solutions to ensure data is reliable, efficient, and scalable across the company
  • Manage data ingestion, movement, and processing through core platforms like Snowflake, our ML Datalake, and real-time streaming systems
  • Improve data reliability, consistency, and performance, ensuring high-quality data for engineering, research, and business stakeholders
  • Collaborate with AI researchers, data scientists, product engineers, and business teams to understand data needs and build scalable solutions
  • Drive technical decisions and best practices for data ingestion, orchestration, processing, and storage
What we offer
What we offer
  • equity
  • health, dental & vision
  • retirement with company contribution
  • parental leave & reproductive or family planning support
  • mental health & wellness benefits
  • generous PTO
  • company recharge days
  • a learning & development stipend
  • a work from home stipend
  • cell phone reimbursement
  • Fulltime
Read More
Arrow Right

Founding Infrastructure Engineer

As the first dedicated Infrastructure Engineer at Reducto, you will influence ev...
Location
Location
United States , San Francisco
Salary
Salary:
150000.00 - 300000.00 USD / Year
reducto.ai Logo
Reducto
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have 5+ years of hands-on experience in building or supporting production-grade infrastructure and reliability processes for high-throughput systems
  • Are comfortable with Python or similar languages
  • Exceptional at working across cloud platforms, container orchestration (e.g., Kubernetes), networking, and storage technologies
  • Build your own tools on the fly to diagnose, experiment, and address reliability problems
  • Bring a quantitative, hands-on approach to system operations, automation, and continuous improvement
  • Are your own worst critic—have an extremely high bar for quality and always aim for robust solutions rather than quick fixes
Job Responsibility
Job Responsibility
  • Designing, building, and maintaining highly available, scalable infrastructure to support intensive AI/ML workloads and real-time model deployments
  • Implementing robust monitoring, alerting, and observability systems to ensure system health, performance, and uptime across cloud and on-prem environments
  • Debugging, optimizing, and automating infrastructure for fast iteration and rapid deployment cycles, focusing on both reliability and developer velocity
  • Proactively identifying, investigating, and resolving incidents to minimize downtime and maintain world-class service levels for enterprise customers
  • Collaborating closely with engineers, ML specialists, and founders to shape product, infrastructure, and security strategies
What we offer
What we offer
  • Unlimited PTO
  • Free lunch daily at the office
  • Reimbursed Transportation
  • Generous health insurance covering medical, dental, and vision
  • Health and Wellness Budget up to $150/mo reimbursement
  • Parental Leave
  • Fulltime
Read More
Arrow Right

Senior AI/ML Engineer

At Sigma, we’re not just adding AI—we’re building the future of how people work ...
Location
Location
United States , San Francisco
Salary
Salary:
240000.00 - 270000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience building and deploying production-grade AI/ML systems
  • Deep knowledge of machine learning, deep learning, and applied AI
  • Experience across the full ML lifecycle: data curation, training, deployment, monitoring
  • A track record of building things that ship—whether it’s recommendations, search, machine translation, or something equally complex
  • Experience adapting or training foundation models (language or multimodal) for novel domains
Job Responsibility
Job Responsibility
  • Partner with product, design, and engineering teams to identify high-impact AI/ML opportunities
  • Prototype and productionize AI systems that feel intuitive but do a lot under the hood—recommendations, natural language interfaces, agentic workflows, and more
  • Develop and scale AI/ML infrastructure that powers both internal tooling and customer-facing features
  • Tackle novel UX problems at the intersection of AI, Visualizations, and apps
What we offer
What we offer
  • Equity
  • Generous health benefits
  • Flexible time off policy
  • Paid bonding time for all new parents
  • Traditional and Roth 401k
  • Commuter and FSA benefits
  • Lunch Program
  • Dog friendly office
  • Fulltime
Read More
Arrow Right

Senior Data & AI/ML Engineer - GCP Specialization Lead

We are on a bold mission to create the best software services offering in the wo...
Location
Location
United States , Menlo Park
Salary
Salary:
Not provided
techjays.com Logo
techjays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCP Services: BigQuery, Dataflow, Pub/Sub, Vertex AI
  • ML Engineering: End-to-end ML pipelines using Vertex AI / Kubeflow
  • Programming: Python & SQL
  • MLOps: CI/CD for ML, Model deployment & monitoring
  • Infrastructure-as-Code: Terraform
  • Data Engineering: ETL/ELT, real-time & batch pipelines
  • AI/ML Tools: TensorFlow, scikit-learn, XGBoost
  • Min Experience: 10+ Years
Job Responsibility
Job Responsibility
  • Design and implement data architectures for real-time and batch pipelines, leveraging GCP services such as BigQuery, Dataflow, Dataproc, Pub/Sub, Vertex AI, and Cloud Storage
  • Lead the development of ML pipelines, from feature engineering to model training and deployment using Vertex AI, AI Platform, and Kubeflow Pipelines
  • Collaborate with data scientists to operationalize ML models and support MLOps practices using Cloud Functions, CI/CD, and Model Registry
  • Define and implement data governance, lineage, monitoring, and quality frameworks
  • Build and document GCP-native solutions and architectures that can be used for case studies and specialization submissions
  • Lead client-facing PoCs or MVPs to showcase AI/ML capabilities using GCP
  • Contribute to building repeatable solution accelerators in Data & AI/ML
  • Work with the leadership team to align with Google Cloud Partner Program metrics
  • Mentor engineers and data scientists toward achieving GCP certifications, especially in Data Engineering and Machine Learning
  • Organize and lead internal GCP AI/ML enablement sessions
What we offer
What we offer
  • Best in class packages
  • Paid holidays and flexible paid time away
  • Casual dress code & flexible working environment
  • Medical Insurance covering self & family up to 4 lakhs per person
Read More
Arrow Right

Staff AI/ML Engineer

At Sigma, we’re not just adding AI—we’re building the future of how people work ...
Location
Location
United States , New York City
Salary
Salary:
240000.00 - 270000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience building and deploying production-grade AI/ML systems
  • Deep knowledge of machine learning, deep learning, and applied AI
  • Experience across the full ML lifecycle: data curation, training, deployment, monitoring
  • A track record of building things that ship—whether it’s recommendations, search, machine translation, or something equally complex
  • Experience adapting or training foundation models (language or multimodal) for novel domains
Job Responsibility
Job Responsibility
  • Partner with product, design, and engineering teams to identify high-impact AI/ML opportunities
  • Prototype and productionize AI systems that feel intuitive but do a lot under the hood—recommendations, natural language interfaces, agentic workflows, and more
  • Develop and scale AI/ML infrastructure that powers both internal tooling and customer-facing features
  • Tackle novel UX problems at the intersection of AI, BI, and apps
What we offer
What we offer
  • Equity
  • Generous health benefits
  • Flexible time off policy
  • Paid bonding time for all new parents
  • Traditional and Roth 401k
  • Commuter and FSA benefits
  • Lunch Program
  • Dog friendly office
  • Stock options
  • Fulltime
Read More
Arrow Right

Staff AI/ML Engineer

At Sigma, we’re not just adding AI—we’re building the future of how people work ...
Location
Location
United States , San Francisco
Salary
Salary:
240000.00 - 270000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience building and deploying production-grade AI/ML systems
  • Deep knowledge of machine learning, deep learning, and applied AI
  • Experience across the full ML lifecycle: data curation, training, deployment, monitoring
  • A track record of building things that ship—whether it’s recommendations, search, machine translation, or something equally complex
  • Experience adapting or training foundation models (language or multimodal) for novel domains
Job Responsibility
Job Responsibility
  • Partner with product, design, and engineering teams to identify high-impact AI/ML opportunities
  • Prototype and productionize AI systems that feel intuitive but do a lot under the hood—recommendations, natural language interfaces, agentic workflows, and more
  • Develop and scale AI/ML infrastructure that powers both internal tooling and customer-facing features
  • Tackle novel UX problems at the intersection of AI, BI, and apps
What we offer
What we offer
  • Equity
  • Generous health benefits
  • Flexible time off policy
  • Paid bonding time for all new parents
  • Traditional and Roth 401k
  • Commuter and FSA benefits
  • Lunch Program
  • Dog friendly office
  • Stock options
  • Fulltime
Read More
Arrow Right

Senior AI Infrastructure Engineer

This role will be responsible for designing, deploying, and maintaining high-per...
Location
Location
United States , Bothell; Overland Park; Bellevue
Salary
Salary:
113600.00 - 205000.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years technical engineering experience, preferably in multiple technology focus areas
  • Expert understanding of AI/ML infrastructure components, or GPU-based systems – preferably in a high-availability, large scale environment
  • Hands-on Experience with NVIDIA DGX servers, BasePOD architectures, and advanced GPU technologies
  • Proficient in Linux/UNIX environments, including scripting/automation tools (Bash, Python, Ansible, Terraform)
  • Understanding of AI infrastructure security best practices
  • Experience with container orchestration (Kubernetes, Docker) and GPU workload management tools
  • Strong knowledge of networking (InfiniBand/Ethernet) and storage solutions in AI/ML contexts
Job Responsibility
Job Responsibility
  • Technical System Expertise: Understands system protocols, how systems operate and data flows
  • Technical Engineering Services: Drives engineering projects by active contribution to the application of engineering techniques
  • Innovation: Contributes to designs to implement new ideas which improve an existing and new system/process/service
  • Technical Writing: Writes basic documentation on how technology works
  • Technical Leadership: Collaborates with technical teams and utilizes system expertise to deliver technical solutions
  • Technology Strategy: Contributes to new and existing technology options that support business goals
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off
  • Paid holidays
  • Paid parental and family leave
  • Fulltime
Read More
Arrow Right

Principal AI/ML & Innovation Engineer

We are seeking Principal AI/ML & Innovation Engineer who will be leading initiat...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master’s degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Typically, 10-15 years’ experience
  • Solid understanding of fundamental AI and machine learning concepts, including supervised and unsupervised learning, deep learning, reinforcement learning, natural language processing, computer vision, and statistical modeling
  • Proficient in implementing and deploying various machine learning algorithms, such as decision trees, random forests, support vector machines, and neural networks
  • Knowledge of popular machine learning frameworks and libraries like TensorFlow, PyTorch, or sci-kit
  • Strong understanding of GitHub CoPilot, Cursor, N8N, vibe coding, Windsurf, and similar technologies
  • Experience in Cloud Infrastructure (AWS, Azure, etc)
  • Knowledge of Open Source, Linux, etc
  • Understanding of Devops, SRE
  • Expertise in deep learning techniques, architectures, and frameworks (e.g., convolutional neural networks (CNN), recurrent neural networks (RNN), generative adversarial networks (GAN), etc.)
Job Responsibility
Job Responsibility
  • Designing, developing, and deploying advanced machine learning models and algorithms
  • Leading research initiatives to explore novel approaches and technologies
  • Designing the architecture of AI systems and ensuring scalability, performance, and reliability
  • Collaborating with other teams, such as data scientists, software engineers, and product managers
  • Providing technical leadership and mentorship to junior engineers
  • Overseeing and guiding multiple design review sessions across different projects
  • Partnering with the engineering manager and team lead to establish long-term design and implementation strategies
  • Leading efforts to incorporate feedback loops and continuous improvement processes
  • Leading meetings, ensuring efficient progress tracking, issue resolution, and team coordination
  • Creating and delivering high-level presentations and reports to executive stakeholders
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.