This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a highly skilled and adaptable Network & Security Engineer with strong Web/DevOps experience to architect, secure, and automate the advanced networking infrastructure supporting our AI Factory. This role involves managing high-bandwidth fabrics (RoCE, InfiniBand) across Dell/NVIDIA GPU clusters, ensuring secure and resilient connectivity for AI/ML workloads. You will collaborate closely with DevOps and AI platform teams to streamline deployment, monitoring, and incident response for cloud-native applications running on OpenShift and Kubernetes, while integrating cutting-edge GPU orchestration and data management solutions.
Job Responsibility:
Design, deploy, and secure high-throughput network architectures supporting RoCE v2 fabrics for AI/ML workloads
Manage Dell/NVIDIA-based GPU networking environments, including backend and frontend switch configurations