CrawlJobs Logo

Ai/hpc System Performance Engineer, Phd

meta.com Logo

Meta

Location Icon

Location:
United States , Menlo Park

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

122000.00 - 181000.00 USD / Year

Job Description:

Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI. This results in a dramatic scaling challenge that our engineers have to deal with on a daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we need to ensure that the network is running smoothly and meets stringent performance and availability requirements of RDMA workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network fabric and host networking, comms lib and scheduling infrastructure.

Job Responsibility:

  • Active member of a multi-disciplinary team to develop solutions for large scale training systems
  • Responsible for the overall performance of the communication system, including performance benchmarking, monitoring and troubleshooting production issues
  • Identify potential performance issues across the stack: comms lib, RDMA transport, host networking, scheduling and network fabric. Develop and deploy innovative solutions to address the performance issues

Requirements:

  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • BS/MS/PhD in relevant fields (EE, CS), with 2+ years work experience
  • Experience with using communication libraries, such as MPI, NCCL, and UCX
  • Experience with developing, evaluating and debugging host networking protocols such as RDMA
  • Experience with triaging performance issues in complex scale-out distributed applications
  • Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment

Nice to have:

  • Understanding of AI training workloads and demands they exert on networks
  • Understanding of RDMA congestion control mechanisms on IB and RoCE Networks
  • Understanding of the latest artificial intelligence (AI) technologies
  • Experience with machine learning frameworks such as PyTorch and TensorFlow
  • Experience in developing systems software in languages like C++
What we offer:
  • bonus
  • equity
  • benefits

Additional Information:

Job Posted:
February 19, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Ai/hpc System Performance Engineer, Phd

AI Research Lab Research Associate

We are currently seeking highly qualified interns to accelerate research towards...
Location
Location
United States , Milpitas
Salary
Salary:
43.27 - 93.15 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
May 26, 2026
Flip Icon
Requirements
Requirements
  • Pursuing PhD degree (or other degree with significant research and innovation experience) in a relevant discipline (e.g. machine learning, computer science, electrical engineering, math, statistics, etc.)
  • Track record of world-class innovative contributions and ideas in machine learning
  • Experience with innovative solution development, such as developing proofs-of-concept, first-of-a-kind solutions, and/or technology transfer
  • Experience in deep learning research
  • Experience in developing deep learning software with high proficiency in data structures and algorithms
  • Strong programming skills and experience with Python, C/C++, and preferably Java
  • Software development experience in Deep Learning, GPU acceleration, and Model Optimization
  • Experience in Deep Learning and Machine Learning frameworks and models like Tensorflow, PyTorch
  • Experience in Transformer Neural Network architectures for Generative AI and natural language processing
  • Experience with Agentic AI and Generative AI workflows - desired
Job Responsibility
Job Responsibility
  • Conduct research and come up with solutions with a fast turnaround time
  • Build the software and applications for Neural Networks and Machine Learning
  • Work with system programming, Deep Learning frameworks and models, GPU acceleration, Model optimization, real-time streaming data, distributed computing, and deployment
  • Provide thought leadership and technical influence both internally and externally to HPE
  • Collaborate with HPE Labs research teams as well as external partners
  • Work in alignment with HPE's broader innovation community.
What we offer
What we offer
  • Health & Wellbeing benefits including physical, financial and emotional wellbeing support
  • Personal and professional development programs
  • Unconditional inclusion and flexibility to manage work and personal needs.
  • Fulltime
Read More
Arrow Right
New

Social Care Practitioner

We are a generic Adult Social Care team, working in the North of the County of S...
Location
Location
United Kingdom , North Shropshire
Salary
Salary:
28598.00 GBP / Year
shropshire.gov.uk Logo
Shropshire Council
Expiration Date
May 13, 2026
Flip Icon
Requirements
Requirements
  • Ability to work at a fast pace and under pressure
  • Excellent organisational skills
  • Excellent IT skills
Job Responsibility
Job Responsibility
  • Assessing and arranging support for adults via the internet, at local hubs and in people’s own homes to promote their independence
  • Working with individuals in the community and placement settings
  • Fulltime
Read More
Arrow Right
New

Stock Controller

Xpert Resourcing is seeking a QC Stock Controller to join our client’s successfu...
Location
Location
United Kingdom , Ely
Salary
Salary:
13.00 GBP / Hour
xpertresourcing.co.uk Logo
Xpert Resourcing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Previous experience in a QC, stock control, or production environment
  • Proficient in production planning systems and intermediate use of Microsoft Excel and Outlook
  • Strong attention to detail and a logical, methodical approach to work
  • Reliable, responsible, and self-motivated with excellent organisational skills
  • Physically fit with the ability to carry out manual handling and stand for extended periods
  • Forklift Licence Essential
Job Responsibility
Job Responsibility
  • Oversee the quality and control of all stock across production and warehouse operations
  • Lead and manage full stocktakes and Perpetual Inventory
  • Maintain accurate stock records and ensure full traceability and audit compliance
  • Liaise across departments to ensure quality assurance processes are followed
  • Input and maintain accurate data within internal stock systems
  • Ensure stock accuracy through regular monitoring and reporting
  • Support continuous improvement initiatives in stock and quality processes
What we offer
What we offer
  • Structured training and career development opportunities
  • Company bonus scheme
  • Private healthcare
  • Fulltime
Read More
Arrow Right
New

Internship: Digital Business Solutions

The Digital Bureau Solution is the entity responsible for ICT operations within ...
Location
Location
France , Paris
Salary
Salary:
Not provided
unesco.org Logo
UNESCO
Expiration Date
June 30, 2026
Flip Icon
Requirements
Requirements
  • At least 20 years old
  • Currently enrolled in the 3rd or final year of a Bachelor's degree OR currently enrolled in a second university degree (Master's or PhD) OR have finished a Bachelor's, Master's or PhD degree within the last 12 months
  • Excellent command, both written and spoken, of English or French
  • Excellent knowledge of computer systems and office-related software
  • Able to work well in a team and adapt to an international working environment
  • Strong interpersonal and communication skills
Job Responsibility
Job Responsibility
  • Assist in monitoring network performance and diagnosing connectivity issues
  • Gain hands-on experience with network device configuration
  • Gain experience with cloud platforms
  • Assist in planning migration to the cloud
  • Learn to troubleshoot network problems
  • Help update and maintain network documentation
  • Provide basic user support
  • Support the deployment, configuration, and testing of AI and GAI models
  • Create content for cybersecurity awareness workshops
  • Coordinate simulated phishing campaigns
What we offer
What we offer
  • 2.5 days of leave per month
  • Additional leave of 1 working day per complete month of internship
  • Limited workplace insurance coverage
  • Fulltime
Read More
Arrow Right
New

Consultant - lifelong learning policy

The consultant will be responsible for implementing a programme to strengthen li...
Location
Location
Germany , Hamburg
Salary
Salary:
Not provided
unesco.org Logo
UNESCO
Expiration Date
April 17, 2026
Flip Icon
Requirements
Requirements
  • Master's degree in education, international development, political science, social sciences or a related field
  • At least five years of relevant professional experience in education policy, lifelong learning or international cooperation
  • Demonstrated experience designing and facilitating multi-stakeholder activities
  • Experience working with or in support of local and municipal authorities, city networks or decentralized governance structures
  • Fluency in English is required
Job Responsibility
Job Responsibility
  • Lead implementation of the UIL-SOU LLL systems programme for 2026, providing context-specific technical support to three partner countries
  • Lead the stock-taking exercise for the three participating countries
  • Develop and iteratively refine draft country profiles for each of the three countries
  • Support participating countries in articulating priorities for system strengthening
  • Coordinate and deliver online capacity-building activities and technical exchange sessions with country teams
  • Prepare and follow up on national or sub-regional policy dialogues for each participating country
  • Lead UIL’s work on developing capacity-building resources for learning cities in 2026
  • Finalise and revise draft modules for each resource
  • Coordinate consultation, review and revision processes with GNLC cities, UIL staff and external experts
  • Support the editorial, design and publication process
!
Read More
Arrow Right
New

Director, UNESCO International Institute for STEM Education (IISTEM)

UNESCO is seeking a pro-active and visionary Director to lead this new Institute...
Location
Location
China , Shanghai
Salary
Salary:
180350.00 USD / Year
unesco.org Logo
UNESCO
Expiration Date
May 09, 2026
Flip Icon
Requirements
Requirements
  • Advanced university degree (Master’s level or equivalent) in education, sciences, engineering, social sciences or related fields
  • Minimum 15 years of progressively responsible relevant professional experience in the field of education, of which preferably 7 years acquired at international level
  • Proven record in advancing STEM education through research, innovation or policy
  • Experience in management and developing research and training programmes, preferably in fields related to STEM education
  • Experience in strategic planning and change management
  • Demonstrated experience in building strategic partnerships and mobilizing funds from diversified resources
  • Experience in leading teams in multi-cultural context
  • Excellent knowledge and drafting skills in English
Job Responsibility
Job Responsibility
  • Provide intellectual, strategic and operational leadership for the Institute in its mission and goals
  • Develop strategies and activities to strengthen international cooperation in the area of STEM education, including through innovative research, networking and capacity development, in alignment with the United Nations sustainable development agenda
  • Design and manage operational mechanisms and action plans to ensure effectiveness of the lnstitute's operations, as well as the implementation, monitoring and evaluation of its programmes
  • Ensure close collaboration with UNESCO's Education Sector and other Sectors, field offices, UNESCO's specialized Institutes, and services and units concerned
  • Ensure close cooperation the UNESCO Office in Beijing, national authorities, United Nations agencies, development banks, bilateral organizations, non-governmental organizations, academic institutions, and other stakeholders for the effective implementation of IISTEM's programmes
  • Mobilize, manage and coordinate resources for the lnstitute's programmes
  • Lead, motivate and develop a high-performing and diverse team
What we offer
What we offer
  • 30 days annual leave
  • family allowance
  • medical insurance
  • pension plan
  • Fulltime
Read More
Arrow Right
New

Invoicing Team Lead

Purpose of the Position: To maintain the accuracy and integrity of orders in the...
Location
Location
Canada , Kitchener
Salary
Salary:
Not provided
demant.com Logo
Demant
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A high school diploma or equivalent is required
  • Minimum of 2+ years of data entry/invoicing experience
  • Excellent typing skills are necessary (Minimum of 40 WPM)
  • Must be detail oriented and have a solid background in record keeping procedures and maintaining organized documentation and/or filing systems
  • Knowledge of Microsoft Office programs (Word processing and spreadsheet applications)
  • Fluent with a Windows operating computer program
Job Responsibility
Job Responsibility
  • Generate all invoicing documents ITE (In the Ear), BTE (Behind the Ear) hearing aid repair, remake, and service orders for billing
  • Monitor and invoice all incoming orders from Courier, Email, Deskpro, and Web Orders
  • Input sales orders into the ERP system
  • Manager and monitor the BTE dispatch and the invoicing shelves to ensure we are maintaining or exceeding our TAT
  • Manager the Callbacks to ensure we are meeting or exceeding our TAT
  • Report to the Manager on any punctuality issues, absences, incidents, and other concerns in the team that needs to be addressed
  • Regularly interact with Production and with Client Services, Audiology and Accounting regarding orders
  • Responsible for accurate documentation of ITE, BTE, Accessories and Ear Molds and consignment aids
  • Work closely with all brands in the order entry team in all activities such as accurate and efficient data entry, record keeping and general file maintenance
  • Assisting with order entry and invoicing of ITE, BTE, Ear Molds Accessories, and RTS for all brands when needed
  • Fulltime
Read More
Arrow Right
New

Caretaker Level 1b

This is a combined caretaker & cleaning supervisor role totalling 17.5 hrs per w...
Location
Location
United Kingdom , Bewdley
Salary
Salary:
13.26 - 13.47 GBP / Hour
shropshire.gov.uk Logo
Shropshire Council
Expiration Date
April 27, 2026
Flip Icon
Requirements
Requirements
  • DBS Required
Job Responsibility
Job Responsibility
  • Caretaking
  • Cleaning supervision
  • Parttime
Read More
Arrow Right