CrawlJobs Logo

Ai network engineer

United States, Houston · Job Posted June 09, 2026
Apply Position
Job Link Share

Job Description

We are seeking an experienced AI Network Engineer to support and optimize high-performance infrastructure powering AI/ML workloads. This role focuses on designing and maintaining GPU-accelerated environments leveraging NVIDIA technologies, high-throughput networking, and low-latency architectures.

Job Responsibility

  • Design, implement, and support high-performance networks for AI/ML workloads, including GPU clusters and distributed training environments
  • Deploy and optimize NVIDIA-based infrastructure (DGX systems, HGX platforms, or GPU clusters)
  • Configure and manage high-speed networking technologies such as InfiniBand, RoCE, and 100/200/400Gb Ethernet
  • Optimize network performance for east-west traffic, low latency, and large data throughput required for AI model training
  • Integrate NVIDIA software stack (CUDA, NCCL, GPU Cloud, AI Enterprise) with networking and compute environments
  • Troubleshoot performance bottlenecks across network, storage, and GPU interconnects
  • Collaborate with AI/ML engineers to ensure infrastructure meets training and inference demands
  • Support automation and infrastructure-as-code initiatives for scalable AI environments

Requirements

  • 5+ years of experience in network engineering or infrastructure engineering
  • Hands-on experience with high-performance networking (InfiniBand, RDMA, RoCE)
  • Experience supporting GPU-based or HPC environments
  • Strong knowledge of data center networking (L2/L3, BGP, EVPN, VXLAN)
  • Familiarity with Linux systems and performance tuning
  • Experience with NVIDIA ecosystems (DGX, CUDA, NCCL, or similar)
  • Ability to diagnose low-latency and high-throughput network issues

Nice to have

  • Experience with NVIDIA AI Enterprise or DGX SuperPOD environments
  • Knowledge of AI/ML workflows and distributed training frameworks (PyTorch, TensorFlow)
  • Familiarity with Kubernetes for AI workloads
  • Experience with storage solutions supporting AI workloads (parallel file systems, NVMe over Fabrics)
  • Exposure to cloud-based GPU environments (AWS, Azure, GCP)

What we offer

  • medical, vision, dental, and life and disability insurance
  • company 401(k) plan

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Ai network engineer

8 matching positions

Network Systems Engineer - Data Center AI Network Engineer

We are the Data Center Network Services team within Cisco IT that supports netwo...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
duo.com Logo
Duo Security
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor of Engineering or Technology with a minimum of 10 years of experience in designing, deploying, operating and managing scalable DC network infrastructure (using Nexus OS)
  • Experience in technologies like Routing, Switching, Nexus, VPC, VDC, VLAN, VXLAN, BGP
  • Experience on handling incident, problem and change management
  • Familiarity with DevOps principles, comfortable with Agile practices
Job Responsibility
Job Responsibility
  • You will design, develop, test and deploy DC network capabilities within Data Center Network
  • You are engaging and comfortable collaborating with fellow engineers across multiple disciplines as well as internal clients
  • You will create innovative, high-quality capabilities enabling our clients to have the best possible experience
What we offer
What we offer
  • Our benefits are designed to support every aspect of your life: from your well-being to your time away to your family
Read More
Arrow Right

IT Network Engineer - Network AI Automation & Full Stack Infrastructure

We are seeking a highly skilled IT Network Engineer – Network AI Automation & Fu...
Location
Location
Salary
Salary:
Not provided
valvoline.com Logo
Valvoline
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of enterprise global network engineering experience
  • Bacehlors Degree in Information Technology, CIS or related field or equivalent experience
  • Strong experience with routing and switching protocols (BGP, OSPF, VLANs)
  • Experience with next-generation firewalls and SASE, preferably Palo Alto
  • Experience with Cisco or Meraki switching and wireless
  • Cloud networking experience in Azure, AWS, or GCP
  • Experience supporting global WAN or SD-WAN environments
  • Familiarity with Aviatrix, Megaport, and observability platforms such as Sumo Logic
  • Practical experience with network automation (Python, Terraform, Ansible)
Job Responsibility
Job Responsibility
  • Engineer and support global enterprise network infrastructure across routing, switching, firewalls, SASE, WAN/SD-WAN, wireless, and proxy services
  • Troubleshoot complex network issues across multiple infrastructure layers
  • Support hybrid cloud networking architectures across Azure, AWS, or GCP
  • Manage and optimize next-generation firewall policies using platforms such as Palo Alto Networks
  • Support Cisco and Meraki switching and wireless infrastructure
  • Design and maintain global WAN connectivity, including high-performance cloud connectivity such as Megaport
  • Support cloud networking solutions leveraging platforms such as Aviatrix
  • Implement network automation and Infrastructure as Code (IaC) to improve operational efficiency
  • Leverage observability platforms such as Sumo Logic to improve monitoring and visibility
  • Contribute to AI-driven network operations and automation initiatives
  • Fulltime
Read More
Arrow Right

IT Network Engineer - Network AI Automation & Full Stack Infrastructure

We are seeking a highly skilled IT Network Engineer – Network AI Automation & Fu...
Location
Location
United States
Salary
Salary:
Not provided
valvolineglobal.com Logo
Valvoline Global
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of enterprise global network engineering experience
  • Bacehlors Degree in Information Technology, CIS or related field or equivalent experience
  • Strong experience with routing and switching protocols (BGP, OSPF, VLANs)
  • Experience with next-generation firewalls and SASE, preferably Palo Alto
  • Experience with Cisco or Meraki switching and wireless
  • Cloud networking experience in Azure, AWS, or GCP
  • Experience supporting global WAN or SD-WAN environments
  • Familiarity with Aviatrix, Megaport, and observability platforms such as Sumo Logic
  • Practical experience with network automation (Python, Terraform, Ansible)
Job Responsibility
Job Responsibility
  • Engineer and support global enterprise network infrastructure across routing, switching, firewalls, SASE, WAN/SD-WAN, wireless, and proxy services
  • Troubleshoot complex network issues across multiple infrastructure layers
  • Support hybrid cloud networking architectures across Azure, AWS, or GCP
  • Manage and optimize next-generation firewall policies using platforms such as Palo Alto Networks
  • Support Cisco and Meraki switching and wireless infrastructure
  • Design and maintain global WAN connectivity, including high-performance cloud connectivity such as Megaport
  • Support cloud networking solutions leveraging platforms such as Aviatrix
  • Implement network automation and Infrastructure as Code (IaC) to improve operational efficiency
  • Leverage observability platforms such as Sumo Logic to improve monitoring and visibility
  • Contribute to AI-driven network operations and automation initiatives
  • Fulltime
Read More
Arrow Right

Data Center Network Engineer, AI Repair

Are you passionate about cutting-edge technology and its implementation on a glo...
Location
Location
United States , Sarpy County
Salary
Salary:
193000.00 - 271000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 10+ years of work experience with designing and deploying large-scale data center network infrastructure
  • Experience with data center design, structured cabling, and fiber optic network infrastructure
  • Demonstrated knowledge of NICs, optical transceivers, AOC, and DAC for high-speed interconnects
  • Demonstrated knowledge of TCP, IPv4/6, Routing Protocols, and related network services (DHCP, DNS)
  • Experience with implementing tooling and automation for network configuration and monitoring
  • Track record of solving complex problems, executing tactically, and delivering on infrastructure projects
  • Experience to work independently, stay organized, multitask, prioritize, and communicate effectively
  • 15 to 20% travel required based on project demand
Job Responsibility
Job Responsibility
  • Work cross functionally to maintain AI and DC network health while leading long term initiatives to drive for better repair and greater efficiencies
  • Contribute to organizational level strategy and establish team roadmaps and goals that align with current business priorities and organizational strategy
  • Accountable for driving improvements in technical references, NPI process, and deployment/operations documentation standards in support of continuous improvement initiatives
  • Facilitate clear communication of technical requirements, risks, and escalations to leadership and cross-functional partners
  • Integrate new networking technologies into ENS operations processes to efficiently scale Meta’s AI, Compute, and Network capabilities
  • Develop new operational support models for deploying and operating new data center infrastructure
  • Influence design of data center, network, server, and applications to ensure seamless integration
  • Publish technical reference, process, and training documentation for a global network deployment and operations teams
  • Build and nurture business relationships with key stakeholders, partners, and vendors
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Principal Software Engineer (Cloud Network and AI Security)

Location
Location
United States , Santa Clara
Salary
Salary:
147000.00 - 237500.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong expertise in programming languages such as C, Python, and Go
  • Proficiency in low-level linux systems programming technologies
  • Familiarity with cloud service architectures, including compute, networking, load-balancers, and identity management
  • Experience with cloud deployments on platforms such as Azure, AWS, and GCP
  • Experience with network virtualization technologies such as DPDK, XDP
  • Strong knowledge in network security fields such as stateful firewall, packet processing, and network ACL
  • Strong scripting skills in bash and Python
  • Familiarity with Terraform/CFT/PowerShell ARM templates
  • BS/MS degree in Computer Science, Computer Engineering, Electrical Engineering or equivalent or equivalent military experience
  • 7+ years of related engineering experience
Job Responsibility
Job Responsibility
  • Design and implement new features and integrations for virtualization features across diverse cloud environments and deployments
  • Engage in all phases of the product development cycle from concept definition, design, through implementation, and testing
  • Develop comprehensive functional specifications, evaluate task requirements and timelines, and contribute to design, development, debugging, and support processes
  • Hands-on experience with virtualization technologies, various hypervisors, system software, and networking
What we offer
What we offer
  • Restricted stock units
  • Bonus
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, AI - Global Treasury Payment Network

As the senior engineer in our new Generative AI team, you will own the end-to-en...
Location
Location
United States , San Francisco
Salary
Salary:
200000.00 - 250000.00 USD / Year
airwallex.com Logo
Airwallex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years in backend software development, with a focus on AI
  • Experience building internal or customer-facing products with LLMs
  • Expertise in building efficient, large-scale software systems
  • Skill in writing high-quality, maintainable code
  • Experience in design and development of large-scale distributed, high concurrency, high load, high availability systems
  • Excellent communication and mentoring abilities
  • A relevant degree in Computer Science, Mathematics or related fields
Job Responsibility
Job Responsibility
  • Build AI solutions to enable growth and drive internal efficiency across the business
  • Design and implement robust API and system architecture for new AI applications
  • Collaborate closely with a cross-functional team to bring these AI solutions to life
  • Embrace the challenges of a fast-paced, high-growth environment with a focus on innovation
What we offer
What we offer
  • Offers Equity
  • Offers Bonus
  • medical, dental, and vision insurance
  • a 401(k) plan
  • short-term and long-term disability
  • basic life insurance
  • well-being benefits
  • 20 paid days of vacation
  • 12 paid days of company holidays
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, AI - Global Treasury Payment Network

The senior engineer in our new Generative AI team will own the end-to-end develo...
Location
Location
Singapore , Singapore
Salary
Salary:
180000.00 - 245000.00 SGD / Year
airwallex.com Logo
Airwallex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years in backend software development, with a focus on AI
  • Experience building internal or customer-facing products with LLMs
  • Expertise in building efficient, large-scale software systems
  • Skill in writing high-quality, maintainable code
  • Experience in design and development of large-scale distributed, high concurrency, high load, high availability systems
  • Excellent communication and mentoring abilities
  • A relevant degree in Computer Science, Mathematics or related fields
Job Responsibility
Job Responsibility
  • Build AI solutions to enable growth and drive internal efficiency across the business
  • Design and implement robust API and system architecture for new AI applications
  • Collaborate closely with a cross-functional team to bring these AI solutions to life
  • Embrace the challenges of a fast-paced, high-growth environment with a focus on innovation
What we offer
What we offer
  • Offers Equity
  • medical, dental, and vision insurance
  • a 401(k) plan
  • short-term and long-term disability
  • basic life insurance
  • well-being benefits
  • 20 paid days of vacation
  • 12 paid days of company holidays in a calendar year
  • Fulltime
Read More
Arrow Right

Sr. Applied AI Engineer, Generative AI Applications

HBS Foundry is a new initiative at Harvard Business School that helps founders b...
Location
Location
United States , Boston
Salary
Salary:
Not provided
hbs.edu Logo
HBS
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of seven years’ post-secondary education or relevant work experience
  • A bachelor's degree in computer science, data science, or a related field is required
  • Strong proficiency in programming languages such as Python and Java
  • Experience with AI frameworks like TensorFlow and PyTorch
  • Strong analytical and logical thinking skills to tackle complex problems
  • Ability to work effectively with cross-functional teams
  • Familiarity with cloud platforms is often beneficial
Job Responsibility
Job Responsibility
  • Lead comprehensive applications/web development for highly complex projects
  • Deliver strategic and expert coding
  • focus on overarching development strategy for a large, complex, multi-faceted application
  • May manage a number of projects simultaneously
  • Design, develop, and deploy state-of-the-art generative AI models
  • Build trust and collaboration by being present on-site and engaging directly with colleagues and various constituents
  • Develop and train AI models: Create and train machine learning models and neural networks for tasks like data analysis, natural language processing, and computer vision
  • Implement and integrate AI: Write code to integrate AI functionality into existing software and ensure seamless deployment with other systems and APIs
  • Data preprocessing: Clean, prepare, and manage large datasets to be used for training and fine-tuning models
  • Optimize performance: Design and optimize machine learning algorithms for scalability and high performance and continuously monitor and test AI systems for improvement
What we offer
What we offer
  • Generous paid time off including parental leave
  • Medical, dental, and vision health insurance coverage starting on day one
  • Retirement plans with university contributions
  • Wellbeing and mental health resources
  • Support for families and caregivers
  • Professional development opportunities including tuition assistance and reimbursement
  • Commuter benefits, discounts and campus perks
Read More
Arrow Right