CrawlJobs Logo

Software Engineer (Leadership), Host Networking

meta.com Logo

Meta

Location Icon

Location:
India , Bangalore

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

This Software Engineer will be working on NICs and Transport solution addressing growing demands of the distributed fleet of accelerators for our AI workloads. Do you want to work on transport for large scale AI clusters? Do you want to develop innovative solutions to our challenges and ship them into production? This role on our host networking teams is for you!

Job Responsibility:

  • Own design and architecture of Drivers and Firmware for NICs supporting AI workloads
  • Collaborate with ASIC and HW teams, and external partners in building infrastructure scale embedded solutions
  • Mentor team members who will also work on building driver and firmware software
  • Work with cross functional teams through releasing software to production and supporting them
  • Help build roadmap for our solutions and the team

Requirements:

  • Bachelor's degree in Computer Science/Engineering or relevant technical field and 10+ years of experience
  • Proficiency in coding in C/C++
  • Experience building driver and/or firmware for embedded infrastructure systems running Linux
  • Experience with RDMA/RoCE and/or TCP stack for Linux
  • Experience with Hardware Bringup

Nice to have:

  • Experience developing Drivers and/or Firmware for Networking stack in Linux, preferably for NICs
  • Experience with Congestion control for RDMA/RoCE networks
  • Experience with simulation environments with Qemu and/or emulation environments
  • Working knowledge of Collectives (XCCL) and GPU direct for AI workloads

Additional Information:

Job Posted:
January 24, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Engineer (Leadership), Host Networking

Staff Platform Software Engineer

EarnIn is seeking a Staff Platform Engineer to lead the strategic design, automa...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
earnin.com Logo
EarnIn
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s Degree in Computer Science or equivalent industry experience
  • 7+ years of experience in cloud infrastructure, managing large-scale, high-availability, customer-facing distributed systems
  • Proven experience mentoring and guiding senior engineers, driving technical decisions, and leading company-wide cloud initiatives
  • Mastery of public cloud providers, specifically AWS (EKS, DynamoDB, Aurora, Kinesis, etc.)
  • Strong expertise in containerized microservices running on Kubernetes
  • Deep knowledge of automation and configuration management tools (Terraform, Ansible)
  • Expertise on CICD pipelines and tools, including Jenkins, GHA, Argo CD, Spinnaker & FluxCD or similar
  • Experience with advanced observability tools (DataDog, CloudWatch)
  • Track record of leading cost optimization / FinOps initiatives, performance tuning, and operational excellence projects
  • Proven ability to drive cross-functional initiatives with engineering, product, and business teams
Job Responsibility
Job Responsibility
  • Serve as a key architect and thought leader in the cloud infrastructure domain, guiding the team on best practices
  • Mentor and coach senior engineers across the company in advanced cloud operations practices
  • Provide oversight of hosted Linux and Windows systems, networks, databases, and applications, identifying and solving critical performance, scalability, and stability challenges
  • Design and develop reusable components and operational strategies to enhance the scalability, performance, and monitoring of cloud systems
  • Collaborate with other senior engineers to create technical solutions that address company-wide cloud challenges
  • Lead the establishment and continuous evolution of infrastructure-as-code best practices, driving automation, self-healing, and security standards
  • Drive operational cost savings through service optimizations, autoscaling strategies, and distributed processing architectures
  • Collaborate closely with cross-functional teams, including security, engineering, and business teams, to ensure that operational strategies align with company-wide objectives
  • Provide thought leadership in company-wide initiatives such as observability, automation, and disaster recovery
  • Continuously evaluate existing tools and processes, lead efforts to socialize, present, and implement enhancements for optimal operational efficiency
What we offer
What we offer
  • healthcare
  • internet/cell phone reimbursement
  • a learning and development stipend
  • opportunities to travel to our Mountain View HQ
  • Fulltime
Read More
Arrow Right

Senior Security Engineer

The Senior Security Engineer will provide hands-on technical leadership within t...
Location
Location
United Kingdom , Leeds; Thame
Salary
Salary:
65000.00 - 75000.00 GBP / Year
pexa.co.uk Logo
PEXA UK
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proactive, can-do attitude to get things done quickly and efficiently
  • Strong collaboration and communication skills
  • Willingness to contribute ideas to the security programme
  • Demonstratable first-hand experience in achieving organisational adherence to security best practices
  • Experience in the practical protection of a remote working laptop estate and SaaS cloud solutions
  • Experience in identity and access management solutions
  • Experience in device business automation and updates
  • Experience in the security aspects of cloud web application hosting and defence measures like WAF
Job Responsibility
Job Responsibility
  • Maintenance and Operational Security: Ensure all security solutions remain operationally effective
  • Ensure technical teams timely patch applications, systems, software, and hardware
  • Maintain and audit secure configurations for devices, applications, and cloud environments
  • Access Control and Identity Management: Conduct regular user and privileged account reviews
  • Manage and monitor Privileged Identity Management (PIM) profiles and elevated access accounts
  • Coordinate with IT and HR for onboarding/offboarding
  • Tool, Infrastructure, and Encryption Management: Maintain and optimise security infrastructure and tools
  • Oversee encryption key and certificate management
  • Work with vendors and internal teams to ensure tools remain current
  • VPN, Network & Firewall Security: Design, configure, and maintain secure VPN and Zero-Trust network solutions
What we offer
What we offer
  • Your growth: We encourage you to hit your personal and professional learning and development goals with our tailored programs and tools
  • Your wellness: We care about your holistic wellbeing
  • Your work/life blend: We want to help you create your ideal work/life blend
  • Fulltime
Read More
Arrow Right

Production Engineering Manager

Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologie...
Location
Location
United States , New York
Salary
Salary:
269082.00 - 297550.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree (or foreign degree equivalent) in Computer Science, Engineering, Information Systems, Analytics, Mathematics, Physics, Applied Sciences, or a related field, and 4 years of work experience in the job offered or a related occupation
  • Requires 4 years of experience in the following: Systems, networking, and troubleshooting
  • Drafting and reviewing code
  • Coding in Python, PHP/Hack, and Java
  • Software development in a Unix environment and data processing environment
  • Managing sharded workload fitting at the scale of hundreds of thousands of hosts
  • Technical leadership on large-scale, performant, multi-component systems
  • Automation of capacity management using autoscaling, continuous load testing software, and capacity modeling software and
  • Performance management, roadmap planning, and long-term goal setting using OKRs
Job Responsibility
Job Responsibility
  • Support and lead engineers working on Meta's products and services, at different layers of the stack, on challenges related to scalability, reliability, performance and efficiency of systems
  • Understand and contribute to technical architectures, capacity plans, tooling needs, automation plans, product launch plans and create comprehensive plans for prioritizing technical and resourcing challenges
  • Drive technical architecture discussions, even on subjects you haven't had direct experience working with
  • Develop lasting partnerships with product management, program management, network engineering, software engineering and other related groups to build and improve our ever-growing large-scale distributed infrastructure and product environment
  • Empower engineers to develop their careers, matching their strengths with projects tailored to their skill levels, long-term skill development, personalities, and work styles
  • Help build and enrich an inclusive work environment comprised of people from diverse backgrounds
  • Assess employee performance on an ongoing basis, address under-performance, and recognize and promote performance
  • Work closely with dedicated recruiting staff to expand the team including interviewing candidates, participating in conferences/events, and on-boarding new employees
  • Balance the need to “keep things running” with allocating time to long-term, high-impact projects
  • 25% domestic and international travel required
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

D&A Domain Architect - Snowflake

As a partner of our company's Enabling Functions (EF), we the Enabling Functions...
Location
Location
Spain , Mollet del Valles, Barcelona
Salary
Salary:
Not provided
sigmaaldrich.com Logo
MilliporeSigma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • University degree preferably in Information Technology, Computer Science, Finance, Business Administration, or a related field
  • 5+ years experience in data engineering, application design, analytics, and visualization within a global organization
  • Experience with Finance core business processes
  • Strong technical skills in systems architecture, cloud computing, cybersecurity, and data management
  • Proven technical leadership experience in agile software development, including leading and mentoring engineering teams
  • Highly engaged expert with in-depth knowledge in Snowflake and ideally in AWS, Palantir Foundry, or SAP Business Data Warehouse / Cloud
  • Proficiency in ETL processes, Spark, Kafka, and Python for distributed computation (preferably PySpark)
  • Familiarity with SQL, R, REST APIs and basic design/visual competencies
  • Ability to work both individually and collaboratively in global matrixed product teams
  • Ability in establishing software engineering best practices including DevOps methodologies
Job Responsibility
Job Responsibility
  • Lead the design of cloud-native data & analytics solutions utilizing Snowflake, Palantir Foundry, and AWS
  • Guide lighthouse implementations
  • Define the target architectural vision and govern the future implementation of the Finance Data Warehouse on Snowflake and it’s integration into our Analytics Ecosystem
  • Collaborate with various teams to ensure that product architectures are scalable, secure, and aligned with the overall technology strategy
  • Establish best practices and standards that guide product development and ensure consistent quality across the EF Data, Analytics and AI portfolio
  • Guide and consult development teams and stakeholders in selecting and implementing suitable technology solutions
  • Monitor the architecture-related metrics and KPIs to ensure a continuous improvement
  • Engage actively in both internal and external people networks for sharing knowledge, mentoring colleagues, and building capabilities across the organization
  • Conduct technology scouting, support vendor RFPs, and host knowledge sharing sessions
  • Represent the team and the company at various internal and external events
What we offer
What we offer
  • Financial & Protection benefits
  • Health and Wellbeing benefits (e.g., health checkups or medical insurance)
  • Family benefits (e.g., Fertility Benefit)
  • Time Away benefits
  • Life-style benefits (e.g., flexible working, gyms, car benefits, shopping discounts)
Read More
Arrow Right

Azure Networking Software Engineer

Microsoft Azure’s core priority is to be world's most trusted, secure, and globa...
Location
Location
India , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to engage in site-reliability engineering practices
  • Commitment to collaboration and teamwork and ability to deliver via influence
  • Demonstrated problem solving and debugging skills
  • 2+ years of experience developing software hosted in Azure, AWS, or other similar Cloud platforms
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design, implement, and run highly scalable SDN services that enable networking of millions of services and VMs with timely execution and high quality
  • Responsible for ensuring that highly usable, reliable and secure services are delivered to delight our customers
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
  • Exhibit thought leadership in helping take the service forward with new capabilities and innovation, and improving the experience on existing capabilities
  • Effectively create clarity of status, progress and blockers affecting large projects
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
  • Fulltime
Read More
Arrow Right

Staff Site Reliability Engineer

Ever since we started in 2007, Sunrun has been at the forefront of connecting pe...
Location
Location
United States , Lehi
Salary
Salary:
242050.00 USD / Year
sunrun.com Logo
Sunrun
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s in Computer Information Systems, Software Engineering or closely related
  • 5 years of experience as a Software Developer using Microservices hosted in Azure
  • 5 years of experience with Virtualization and cloud computing
  • 5 years of experience with Object Oriented Design (OOD) & and Object-Oriented Programming (OOP)
  • 5 years of experience building software solutions in an engineering environment using Python & Shell scripting
  • 5 years of experience with Network analysis, debugging and troubleshooting with Wireshark & Git
Job Responsibility
Job Responsibility
  • Provide strategic leadership in designing, implementing, and managing the overall infrastructure strategy for our organization
  • Leverage cloud platforms (e.g., AWS, Azure) to design, deploy, and manage scalable infrastructure solutions
  • Spearhead the definition of advanced monitoring requirements and elevate SLAs
  • Collaborate with the engineering team and TPM to implement and enhance monitoring practices
  • Expertly convey intricate technical information to diverse stakeholders with clarity and precision
  • Provide leadership in integrating advanced SRE principles into applications and services
  • Lead the implementation of sophisticated system design measures for heightened security, performance, and resiliency
  • Develop strategic notification strategies for production outages
  • Leverage SLOs and SLIs to measure and optimize availability, latency, and response time
  • Lead and strategize emergency response efforts, conduct retrospectives with RCA, and manage on-call workloads effectively
What we offer
What we offer
  • Medical/Dental/Vision Insurance
  • Life Insurance
  • Disability Insurance
  • 401k Plan + Company Match
  • Stock Purchase Plan
  • Paid Vacations/Holidays
  • Paid Baby Bonding Leave
  • Employee Discounts
  • PowerU - 100% Funded Education Programs
  • Employee Donation Matching
  • Fulltime
Read More
Arrow Right

IT Systems Engineer

The IT Systems Engineer is responsible for the planning, design, integration, an...
Location
Location
United States , Vandenberg SFB
Salary
Salary:
150000.00 - 170000.00 USD / Year
deltasands.com Logo
Delta Solutions & Strategies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • TS/SCI Security Clearance
  • Minimum of 7 years of progressive experience in planning, designing, and implementing enterprise-scale systems, including full lifecycle project engineering, integration, and cross-functional team leadership
  • Master’s degree in Computer Science, Information Systems, Cybersecurity, Computer Engineering, or a related IT discipline
  • or equivalent combination of accredited education and experience
  • Must have, or be able to obtain, a DoD 8140
Job Responsibility
Job Responsibility
  • Design, integrate, and sustain IT systems, services, and infrastructure aligned to mission, operational, and security requirements
  • Develop architecture diagrams, implementation strategies, and configuration standards for enterprise and mission systems
  • Integrate hardware, software, virtualization, and network components into cohesive and compliant system solutions
  • Develop detailed implementation plans, including Work Breakdown Structures (WBS), milestones, and schedules
  • Produce and maintain system engineering documentation, including CONOPS, configuration guides, SOPs, and technical baselines
  • Support generation of risk assessments, fallback procedures, and deployment/sustainment checklists
  • Research hardware, software, and cloud service components based on functional requirements, interoperability, and lifecycle compatibility
  • Generate cost estimates and Bills of Material (BOMs) to support planning, procurement, and budgeting
  • Coordinate with acquisition or supply chain personnel to source equipment, verify lead times, and conduct technical evaluations
  • Implement system hardening and security baselines in accordance with applicable STIGs and DoD security frameworks
What we offer
What we offer
  • medical
  • dental
  • vision
  • life insurance
  • 401(k)
  • PTO
  • paid holidays
  • parental
  • military and jury duty paid leaves
  • Fulltime
Read More
Arrow Right

Senior Consultant Infrastructure Engineer

Infrastructure Engineers help clients build and evolve systems that client organ...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
thoughtworks.com Logo
Thoughtworks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You can contribute to the design and implementation of enterprise and/or web-scale hosting platforms and can administer application servers, web servers and databases
  • You have a deep understanding of cloud and virtualization platforms, infrastructure automation and application hosting technologies
  • You have experience working with software delivery teams, and understand DevOps philosophies, Agile methods, Infrastructure as Code and how to apply them to your work
  • You have a history of working with at least one IaaS cloud platform, and two or more application runtime platforms including physical servers, virtual servers, container clusters, serverless and databases
  • You can write scripts using at least one scripting language and are comfortable building one or more of: Linux servers, Windows servers, or container clusters
  • You have experience with continuous integration and continuous delivery tools with different tech stacks
  • You’ve previously worked with monitoring systems for availability, performance or security
  • You have an understanding of security concerns, threats and approaches for dealing with them, including infrastructure platform vulnerabilities, secrets management, network security and software supply chain security
Job Responsibility
Job Responsibility
  • You will work within teams to launch projects through hands-on implementation, evaluate existing infrastructure and drive improvements
  • You will explore the client’s needs and collaborate on building a technical roadmap and impactful solution that will support their ambitious business goals
  • You will help shape and build Thoughtworks’ cloud and infrastructure practice through collaboration with other practitioners, business development, marketing and capabilities development teams
  • You will ensure and build the controls and processes for continuous delivery and evolution of infrastructure and applications, driving automation through all stages of the process
  • You will take a proactive role in monitoring and ensuring that technical expectations of deliverables are consistently met on projects
  • You will provide expertise and guidance in the areas of DevOps, cloud, platform and infrastructure engineering, both internally and in client sites
  • You will establish trusting and thoughtful partnerships with a client’s engineering leadership
  • You will adjust and suggest innovative solutions to current constraints and business policies
What we offer
What we offer
  • There is no one-size-fits-all career path at Thoughtworks: however you want to develop your career is entirely up to you
  • your career is supported by interactive tools, numerous development programs and teammates who want to help you grow
  • Fulltime
Read More
Arrow Right