This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Network and Security DevOps Engineer will be responsible for designing and securing advanced networking infrastructures for AI workloads. The role requires a deep understanding of high-speed networking protocols, experience with Dell/NVIDIA GPU infrastructure, and strong network security expertise. The ideal candidate will have a bachelor's degree in a related field and at least 5 years of experience in network engineering and security practices.
Job Responsibility:
Design, deploy, and secure high-throughput network architectures supporting RoCE v2 fabrics for AI/ML workloads
Manage Dell/NVIDIA-based GPU networking environments, including backend and frontend switch configurations (e.g., Z9432F, S5248F)
Implement and maintain network security controls across RDMA-enabled fabrics, ensuring isolation and integrity of GPU clusters
Manage DC fabrics like VXLAN-EVPN
Implementation of QoS for DC environment with very high performance (marking, Queing, WRED, smart/adaptative buffering, smart ECMP/multi-path load-balancing…etc…)
Configure and maintain network security devices including firewalls Palo Alto
Monitor and analyse traffic across 400G/800G fabrics using telemetry tools
Respond to security incidents, conduct root cause analysis, and support forensic investigations
Ensure compliance with industry security standards and regulatory requirements
Collaborate with security teams to implement network segmentation strategies
Experience on orchestration environments (Openshift/Kubernetes/Docker) focusing on GPU-aware scheduling and network policies and security enforcements
Integrate security controls and best practices into automated build and deployment processes
Monitor web services for performance, availability, and security issues using Prometheus, Grafana, ELK stack, etc
Support web application security testing (SAST/DAST) and vulnerability remediation
Develop scripts and tools to automate repetitive tasks and improve operational efficiency
Requirements:
Deep understanding of RoCE v2, InfiniBand, and high-speed networking protocols
Experience with Dell/NVIDIA GPU infrastructure and switch management (Z9432F, S5248F)
Strong background in network security technologies and RDMA-aware security practices
Familiarity with OpenShift, Kubernetes, and container networking
Strong background in network security technologies (firewall palo alto)
Good knowledge of architecture and technologies of QoS for High-performance Datacenter (marking, Queing, WRED, smart/adaptative buffering, smart ECMP/multi-path load-balancing…etc…), and of lossless fabrics for AI/ML environnement AI/ML : PFC, DCQCN, le « RoCE Adpatative Routing », NVDIA etc…
Familiarity with AI/ML pipeline security and GPU orchestration platforms (RUN:AI, NVIDIA Enterprise)
Experience scripting with Python, Bash, or PowerShell
Understanding of web services, APIs, and HTTP security concepts
Knowledge of monitoring/logging tools (Prometheus, Grafana)
Strong troubleshooting and analytical skills with a security mindset
Bachelor’s degree in Computer Science, Information Security, Network Engineering, or equivalent experience
At least 5 years of experience in network engineering and security practices
Nice to have:
Hands-on experience with Kubernetes and container networking/security
Familiarity with CI/CD pipelines and DevOps tools (Jenkins, GitLab CI, CircleCI)
Proficiency in Infrastructure as Code tools (Terraform, Ansible, CloudFormation)
Certifications such as CCNP Network
Experience with security / network segmentation
Experience working in Agile and DevSecOps environments