CrawlJobs Logo

Software Engineer (Leadership), Host Networking

India, Bangalore · Job Posted January 24, 2026
Apply Position
Job Link Share

Job Description

This Software Engineer will be working on NICs and Transport solution addressing growing demands of the distributed fleet of accelerators for our AI workloads. Do you want to work on transport for large scale AI clusters? Do you want to develop innovative solutions to our challenges and ship them into production? This role on our host networking teams is for you!

Job Responsibility

  • Own design and architecture of Drivers and Firmware for NICs supporting AI workloads
  • Collaborate with ASIC and HW teams, and external partners in building infrastructure scale embedded solutions
  • Mentor team members who will also work on building driver and firmware software
  • Work with cross functional teams through releasing software to production and supporting them
  • Help build roadmap for our solutions and the team

Requirements

  • Bachelor's degree in Computer Science/Engineering or relevant technical field and 10+ years of experience
  • Proficiency in coding in C/C++
  • Experience building driver and/or firmware for embedded infrastructure systems running Linux
  • Experience with RDMA/RoCE and/or TCP stack for Linux
  • Experience with Hardware Bringup

Nice to have

  • Experience developing Drivers and/or Firmware for Networking stack in Linux, preferably for NICs
  • Experience with Congestion control for RDMA/RoCE networks
  • Experience with simulation environments with Qemu and/or emulation environments
  • Working knowledge of Collectives (XCCL) and GPU direct for AI workloads

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Software Engineer (Leadership), Host Networking

8 matching positions

Software Engineer (Technical Leadership) - Host Networking

Meta is investing heavily on the Meta Cloud. As part of this push we want to bui...
Location
Location
United States , Menlo Park, CA
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 10+ years software development experience in industry settings or PhD with 4+ years of experience
  • 4+ Years Proven experience designing and shipping networking dataplane/control-plane systems in production (e.g., host networking, L4/L7 load balancing, NAT/conntrack, firewalls/policy, network virtualization)
  • Hands-on Kubernetes networking experience in production (e.g., CNI, Services, NetworkPolicy/policy models, service connectivity/load balancing integration)
  • Proficiency in C/C++ and at least one scripting language (Python/Shell Scripting)
  • Experience with developing and automating test suites
  • Demonstrated experience working across disciplines to align on technical decisions and deliver integrated solutions
Job Responsibility
Job Responsibility
  • Architect and lead delivery of Meta Compute’s cloud-native host networking platform, owning end-to-end technical direction from design through production rollout
  • Build/modernize Kubernetes networking features integrated with Meta’s host networking stack (e.g., dataplane integration, operability, scaling characteristics)
  • Design and implement core networking primitives: network virtualization, load balancing/service connectivity, and distributed firewalling/policy enforcement
  • Drive time-to-market execution: define milestones, de-risk key technical choices early and deliver iterative production increments
  • Lead broad cross-functional alignment across compute, storage, platform, security, and hardware/fabric partners—owning interfaces/contracts and end-to-end outcomes
  • Leverage and influence open-source ecosystems: evaluate open-source building blocks, contribute improvements where needed, and ensure Meta’s solution is maintainable and operable long-term
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Azure Networking Software Engineer

Microsoft Azure’s core priority is to be world's most trusted, secure, and globa...
Location
Location
India , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to engage in site-reliability engineering practices
  • Commitment to collaboration and teamwork and ability to deliver via influence
  • Demonstrated problem solving and debugging skills
  • 2+ years of experience developing software hosted in Azure, AWS, or other similar Cloud platforms
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design, implement, and run highly scalable SDN services that enable networking of millions of services and VMs with timely execution and high quality
  • Responsible for ensuring that highly usable, reliable and secure services are delivered to delight our customers
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
  • Exhibit thought leadership in helping take the service forward with new capabilities and innovation, and improving the experience on existing capabilities
  • Effectively create clarity of status, progress and blockers affecting large projects
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
  • Fulltime
Read More
Arrow Right

Staff Platform Software Engineer

EarnIn is seeking a Staff Platform Engineer to lead the strategic design, automa...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
earnin.com Logo
EarnIn
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s Degree in Computer Science or equivalent industry experience
  • 7+ years of experience in cloud infrastructure, managing large-scale, high-availability, customer-facing distributed systems
  • Proven experience mentoring and guiding senior engineers, driving technical decisions, and leading company-wide cloud initiatives
  • Mastery of public cloud providers, specifically AWS (EKS, DynamoDB, Aurora, Kinesis, etc.)
  • Strong expertise in containerized microservices running on Kubernetes
  • Deep knowledge of automation and configuration management tools (Terraform, Ansible)
  • Expertise on CICD pipelines and tools, including Jenkins, GHA, Argo CD, Spinnaker & FluxCD or similar
  • Experience with advanced observability tools (DataDog, CloudWatch)
  • Track record of leading cost optimization / FinOps initiatives, performance tuning, and operational excellence projects
  • Proven ability to drive cross-functional initiatives with engineering, product, and business teams
Job Responsibility
Job Responsibility
  • Serve as a key architect and thought leader in the cloud infrastructure domain, guiding the team on best practices
  • Mentor and coach senior engineers across the company in advanced cloud operations practices
  • Provide oversight of hosted Linux and Windows systems, networks, databases, and applications, identifying and solving critical performance, scalability, and stability challenges
  • Design and develop reusable components and operational strategies to enhance the scalability, performance, and monitoring of cloud systems
  • Collaborate with other senior engineers to create technical solutions that address company-wide cloud challenges
  • Lead the establishment and continuous evolution of infrastructure-as-code best practices, driving automation, self-healing, and security standards
  • Drive operational cost savings through service optimizations, autoscaling strategies, and distributed processing architectures
  • Collaborate closely with cross-functional teams, including security, engineering, and business teams, to ensure that operational strategies align with company-wide objectives
  • Provide thought leadership in company-wide initiatives such as observability, automation, and disaster recovery
  • Continuously evaluate existing tools and processes, lead efforts to socialize, present, and implement enhancements for optimal operational efficiency
What we offer
What we offer
  • healthcare
  • internet/cell phone reimbursement
  • a learning and development stipend
  • opportunities to travel to our Mountain View HQ
  • Fulltime
Read More
Arrow Right
New

Vp Gtsm Clientman Tribe And Singapore Support Lead

Join Barclays as a VP GTSM Clientman Tribe and Singapore Support Lead role, wher...
Location
Location
India , Pune
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 13 years of L2/L3 Application Support Experience is Must
  • ITIL v3 or above foundation certified
  • proven ability to lead end to end Run the Bank (RtB) services across complex, business critical application estates, ensuring stability, availability, and operational excellence
  • deep expertise in Incident, Problem, Change, Release, and Major Incident Management, with a demonstrated ability to mature service processes from reactive to proactive models
  • strong capability in service resilience, disaster recovery, system recovery planning, and regulatory resilience testing (including assurance, audits, and evidence based controls)
  • ability to engage credibly with senior technology leaders, CIOs, and business stakeholders, translating service risks and performance into clear, outcome focused narratives
  • experience establishing and operating robust governance models covering risk, controls, regulatory commitments, compliance, and audit readiness within a regulated enterprise environment
  • demonstrated leadership in driving observability, automation, and GenAI enabled service management to reduce toil, improve MTTR, and enhance service predictability
  • ability to lead, inspire, and transform large, geographically distributed teams, including capability uplift, talent development, succession planning, and modern skill adoption
  • roles requires working pattern to support Singapore business hours and rotational on-call availability
Job Responsibility
Job Responsibility
  • Provision of technical support for the service management function to resolve more complex issues for a specific client of group of clients
  • develop the support model and service offering to improve the service to customers and stakeholders
  • execution of preventative maintenance tasks on hardware and software and utilisation of monitoring tools/metrics to identify, prevent and address potential issues and ensure optimal performance
  • maintenance of a knowledge base containing detailed documentation of resolved cases for future reference, self-service opportunities and knowledge sharing
  • analysis of system logs, error messages and user reports to identify the root causes of hardware, software and network issues, and providing a resolution to these issues by fixing or replacing faulty hardware components, reinstalling software, or applying configuration changes
  • automation, monitoring enhancements, capacity management, resiliency, business continuity management, front office specific support and stakeholder management
  • identification and remediation or raising, through appropriate process, of potential service impacting risks and issues
  • proactively assess support activities implementing automations where appropriate to maintain stability and drive efficiency
  • actively tune monitoring tools, thresholds, and alerting to ensure issues are known when they occur
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Fulltime
Read More
Arrow Right

Senior Technical Support Engineer

This is **not** a ticket-queue role. This is the job you take if you *like* bein...
Location
Location
Mexico , Mexico DF
Salary
Salary:
Not provided
bentley.com Logo
Bentley Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years in senior technical support / escalation engineering / production support for enterprise software applications (SaaS, cloud-hosted, or hybrid strongly preferred).
  • Proven record owning P1/P0 escalations and guiding issues through resolution with Engineering/Operations partners.
  • Strong, practical expertise in: Azure cloud architecture concepts (identity, networking, compute, storage, monitoring)
  • Microsoft Windows (server and client), troubleshooting, performance, and diagnostics
  • Networking fundamentals and packet-level troubleshooting methods
  • API/integration troubleshooting and familiarity with modern auth patterns (OAuth/SSO concepts)
  • Demonstrated ability to interpret complex diagnostic data (e.g., packet captures, HAR files, deep logs) and drive clear hypotheses to closure.
  • Exceptional communication: able to explain deeply technical findings to both engineers and customer stakeholders
  • skilled at de-escalation under pressure.
  • Required: at least one current, role-relevant cloud certification (or evidence you can achieve within 90 days).
Job Responsibility
Job Responsibility
  • Technical Leadership in Escalations (Customer-Facing): Act as a senior technical lead during high severity (P1 / Sev1) customer escalations
  • Join and actively participate in live escalation bridges, providing: Technical diagnosis and direction
  • Clear explanation of system behavior and failure modes
  • Credible technical input to support customer and executive conversations
  • Partner closely with Escalation Managers, who own: Customer and executive messaging
  • Communication cadence
  • Stakeholder alignment
  • while you ensure the technical narrative is accurate, coherent, and defensible
  • Help stabilize emotionally charged situations by bringing clarity, structure, and technical confidence to live discussions
  • Deep Technical Diagnostics & Resolution (Prove root cause, no guessing, no vibes): Lead complex technical investigations across: Cloud hosted and hybrid application architectures
  • Fulltime
Read More
Arrow Right

National Real Estate Programs Solutions Architect

As our Real Estate Solutions Architect, you will be responsible for designing an...
Location
Location
United States , Englewood
Salary
Salary:
49.78 - 74.05 USD / Hour
americannursingcare.com Logo
American Nursing Care
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-8 years of experience in technology solutions engineering, technical infrastructure, or technical operations
  • 1-3 years of experience in the healthcare or medical industry
  • Bachelors Degree
  • Subject matter expertise in multiple technology areas, such as mobile solutions, open source systems, SOA, cloud computing, security and identity management, data warehousing and analytics, voice and data network architectures, storage technologies, and SaaS, IaaS, and PaaS relationships
  • Strong time management and multi-tasking skills
  • Career and technical progression from hand-on support, development, design, and architecture to portfolio-level responsibilities including standards, strategy, product, and capability roadmaps
  • Demonstrated leadership driving technology solutions from design through implementation thru a flexible staffing model which leverages cross-functional teams comprised of employees, vendors, outsource partners, and contractors across multiple locations
  • Proven ability to drive complex technical solutions which may include both legacy and converging technology use at a portfolio or enterprise level through education and partnership with stakeholders
  • Proven ability to direct technical evaluations of IT issues and products
  • Awareness and understanding of emerging trends and technologies in IT and Healthcare
Job Responsibility
Job Responsibility
  • Design and optimize innovative technology solutions that address complex challenges within our real estate operations
  • Collaborate with real estate business leaders, IT teams, and external vendors to understand requirements, evaluate existing systems, and architect scalable, secure, and integrated solutions
  • Create technical roadmaps, define system specifications, oversee implementation phases, and ensure that our real estate technology investments align with strategic business objectives
  • Act as the CommonSpirit Technology Services champion with internal and external stakeholders regarding security, compliance, feature and product roadmaps, remediation, and service level agreements
  • Mentor, and coach Solution Architects I to build their skills and readiness in technology, portfolio, and platform responsibilities
  • Lead focused proof-of-concept activities for technology assessments, communicate findings, develop portfolio roadmaps, investment prioritization, and successful implementation and adoption
  • Be known as the subject matter expert for their portfolios/platforms and actively collaborate with other Solution Architects on IT standards, innovation and integration opportunities to enable new value chains and business efficiencies
  • Act as the IT standards owner for several technology, platform, or portfolio domains and facilitate a collaborative process to establish, gain consensus, and communicate standards
  • Drive adherence to IT strategy and IT standards in design processes for 3rd party services, products acquired, internal services, integrated hosted solutions, and hardware/software vendors
What we offer
What we offer
  • Medical, prescription drug, dental, vision plans, life insurance, paid time off (full-time benefit eligible team members may receive a minimum of 14 paid time off days, including holidays annually), tuition reimbursement, retirement plan benefit(s) including, but not limited to, 401(k), 403(b), and other defined benefits offerings
  • Fulltime
Read More
Arrow Right

Senior Information Security Engineer

The Sr. Information Security Engineering job collaborates with various business ...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Information Security or IT Technology
  • 3+ years of experience leading complex enterprise-wide integration programs and efforts as an individual contributor
  • 3+ years of engineering experience with vulnerability management tools such as Nexpose, Tenable
  • 3+ years of engineering experience with operating systems such as Linux and Windows Server
  • 2+ years of self-leadership experience
  • 2+ years of experience writing Python, GRAPH (GQL)
  • 2+ years of experience working with services in AWS, GCP, OCI, and Azure
Job Responsibility
Job Responsibility
  • Provides operations and engineering support for critical security systems and services including servers, endpoint security, computer forensics, vulnerability/penetration assessment/mitigation, and security event management
  • Leads the cost/benefit evaluation of cloud solutions compared to virtual private networks, dedicated hosting, and in-house solutions
  • Reviews technical feasibility of adopting external cloud based IT platform and infrastructure services within the organization
  • Leads the identification of portions of the organization's IT platform/infrastructure with the highest potential return for cloud deployment
  • Facilitates implementation of the organization's global strategies and initiatives to enhance Information Technology plans, operations and procedures
  • Ensures the execution of vulnerability analysis and exploitation of applications, operating systems and networks
  • Reports identified intrusion or incident paths and methods discovered through testing and evaluation procedures
  • Designs, develops and implements countermeasures, systems integration and tools specific to cyber and information operations
  • Resolves and documents complex malware and intrusion issues within the system as they occur
  • Functions as an internal information security consultant on the standards, complex issues and best practices for the organization
Read More
Arrow Right

Cloud Solution Architect - Partner Practice Development

Partner Practice Builder Lead CSA is responsible for building thriving Customer ...
Location
Location
United States , Multiple Locations
Salary
Salary:
106400.00 - 203600.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, Liberal Arts, or related field AND 4+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting
  • OR equivalent experience.
  • Microsoft is unable to sponsor a work visa for this role due to the nature of the role’s job duties.
  • 4+ years experience working in a customer-facing role (e.g., internal and/or external).
  • 4+ years experience working on technical projects.
  • Technical Certification in Cloud (e.g., Azure, Amazon Web Services, Google, security certifications).
  • Experience building a product or program from 0 to 1 with 2k+ users/customers
  • 5+ years of experience working with Partners or within a Partner ecosystem
  • 5+ years of experience working with a major cloud service provider
  • Hands-on experience with developing learning, onboarding, training collateral and/or Intellectual Property (VBDs, Tech Talks, etc.) for field audiences
Job Responsibility
Job Responsibility
  • Identify 4-5 beacon metrics to track Practice Building ROI monthly
  • Develop the Capability & Maturity Assessment for Support, Azure, ABS, and Security Practice Builders
  • Develop the CSAM playbook with clear mapping of enablement collateral (VBD, Webinar, Tech Talks, etc.) to attributes in Support Capability and Maturity Assessment
  • Develop CSAM onboarding materials for UFP and Practice Builders
  • Develop guidance to develop Practice Builder Partner Success Plan and Monthly Partner Success Delivery Reviews
  • Identify gaps in existing IP based on Partner needs and work with GSA and CSS teams to develop content
  • Maintain a comprehensive list of Practice Building IP across all modalities (Offline & Live VBDs, Labs, Tech Talks, Webinars, Learning Paths, etc.)
  • Host Quarterly Advisory Board with Partner Operations & Delivery leaders for ongoing feedback on Practice Building impact and requirements for future
  • Manage the end-to-end delivery of 1: many motion across all support offers (Live: Open Workshops, Accelerators, Webinars, Offline: ODA, Tech Talks, Learning Paths, Digital Learning)
  • Actively participate in CSAM community rhythms and embed Partner & UFP collateral in existing rhythms
  • Fulltime
Read More
Arrow Right