CrawlJobs Logo

Ai/hpc System Performance Engineer, Phd

United States, Menlo Park 122000.00 - 181000.00 USD / Year · Job Posted February 19, 2026
Apply Position
Job Link Share

Job Description

Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI. This results in a dramatic scaling challenge that our engineers have to deal with on a daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we need to ensure that the network is running smoothly and meets stringent performance and availability requirements of RDMA workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network fabric and host networking, comms lib and scheduling infrastructure.

Job Responsibility

  • Active member of a multi-disciplinary team to develop solutions for large scale training systems
  • Responsible for the overall performance of the communication system, including performance benchmarking, monitoring and troubleshooting production issues
  • Identify potential performance issues across the stack: comms lib, RDMA transport, host networking, scheduling and network fabric. Develop and deploy innovative solutions to address the performance issues

Requirements

  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • BS/MS/PhD in relevant fields (EE, CS), with 2+ years work experience
  • Experience with using communication libraries, such as MPI, NCCL, and UCX
  • Experience with developing, evaluating and debugging host networking protocols such as RDMA
  • Experience with triaging performance issues in complex scale-out distributed applications
  • Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment

Nice to have

  • Understanding of AI training workloads and demands they exert on networks
  • Understanding of RDMA congestion control mechanisms on IB and RoCE Networks
  • Understanding of the latest artificial intelligence (AI) technologies
  • Experience with machine learning frameworks such as PyTorch and TensorFlow
  • Experience in developing systems software in languages like C++

What we offer

  • bonus
  • equity
  • benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Ai/hpc System Performance Engineer, Phd

8 matching positions

New

Compliance Assurance Sr. Analyst

We are looking for a proactive and detail-oriented professional to join our TCCO...
Location
Location
Philippines , Taguig
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum of 5 years of experience in a Technology Risk Management, Cyber Risk, Information Security, IT Audit, or a related controls-focused role.
  • Direct experience in performing compliance testing, assurance, or audits for technology and/or cybersecurity controls.
  • Strong understanding of the risks and controls related to End-User Computing (EUC) and associated data management.
  • Solid understanding of IT systems, networks, cloud services, security infrastructure, and system vulnerabilities.
  • Familiarity with regulatory requirements and enterprise risk management frameworks (e.g., COBIT, NIST, ISO 27001).
  • Demonstrated experience in the risk management lifecycle, including risk identification, assessment, mitigation, and reporting.
  • Excellent project management and organizational skills, with the ability to prioritize tasks and manage multiple initiatives effectively.
  • Strong analytical, problem-solving, and decision-making skills.
  • Bachelor's/University degree or equivalent experience.
Job Responsibility
Job Responsibility
  • Perform independent, full-scope, and targeted compliance tests on technology and cybersecurity controls to assess their design and operating effectiveness.
  • Conduct regular and comprehensive technology and cyber risk assessments, documenting findings and recommendations to reduce risk exposure.
  • Assess and test risks associated with End-User Computing (EUC), and IT enabaled Smart Solution (ITeSS) focusing on its principles, cycles and overall governance.
  • Independently evaluate technology and cyber risks within the business to ensure they are within the acceptable risk appetite, taking proactive measures to address areas of concern.
  • Build and maintain effective engagement with the 1st Line of Defense, business partners, and other key stakeholders to understand their risk profiles, strategic priorities, and challenges.
  • Provide expert guidance on technology and cyber risk mitigation strategies and control enhancements.
  • Participate in initiatives to augment technology and cyber risk management practices. Support the development and implementation of enhanced procedures and methodologies.
  • Ensure all technology and cyber risk management activities adhere to internal policies, external regulations, and industry standards.
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency.
  • Analyze data to identify trends, emerging risks, and control gaps, providing timely recommendations to mitigate risk exposure.
What we offer
What we offer
  • Programs and services for physical and mental well-being including access to telehealth options, health advocates, confidential counseling and more.
  • Expanded Paid Parental Leave Policy.
  • Programs to manage financial well-being and help plan for the future.
  • Access to an array of learning and development resources.
  • Generous paid time off packages.
  • Resources and tools to volunteer in the communities.
  • Fulltime
Read More
Arrow Right
New

Cfo

Exceptional opportunity to join this thriving business that is looking for a lon...
Location
Location
United States , Greenville
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years strong financial accounting and reporting experience required
  • Good negotiation and systems skills are essential
  • Strong supervisory experience
Job Responsibility
Job Responsibility
  • Supervising a small accounting team and acting as a hands-on leader
  • Supervise several other departments with large staffs
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
Read More
Arrow Right
New

Accounting Specialist

We are looking for an Accounting Specialist to support high-volume financial ope...
Location
Location
United States , Portland
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Prior experience in accounting, billing, reconciliation, or a related financial support role
  • Background in title, escrow, or a similarly regulated financial services environment is strongly preferred
  • Solid understanding of debits, credits, postings, and general transaction flow within accounting operations
  • Strong attention to detail with the ability to detect inconsistencies and maintain a high level of accuracy
  • Proficiency with Microsoft Excel, including creating and working with pivot tables
  • Experience using accounting software
  • Willingness to learn multiple aspects of the role and contribute across a variety of accounting tasks
Job Responsibility
Job Responsibility
  • Handle daily financial transactions by entering, reviewing, and confirming data for completeness and accuracy
  • Reconcile fees and related account activity to ensure balances are correct and discrepancies are resolved promptly
  • Examine billing records and process invoicing activity in alignment with established accounting practices
  • Review financial postings and transaction details to identify errors and support accurate recordkeeping
  • Execute wire transfer activity through banking platforms while following internal controls and approval requirements
  • Update recurring financial reports using prior-day transaction data and maintain organized supporting documentation
  • Use Excel, Adobe, and accounting applications to manage and validate financial information
  • Support compliance with internal procedures by maintaining consistency across transaction processing and documentation standards
What we offer
What we offer
  • medical, vision, dental, and life and disability insurance
  • enrollment in company 401(k) plan
Read More
Arrow Right
New

Cybersecurity Analyst

Location
Location
United States , Honolulu
Salary
Salary:
125000.00 - 130000.00 USD / Year
imcva.com Logo
Innovative Management Concept
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active CompTIA Security+CE certification
  • Pursuant to a government contract, this specific position requires U.S. Citizenship
  • Current DoD TS/SCI clearance eligibility day one and prior to entry on duty
  • At start date, must possess an active CompTIA Security+CE certification
  • 7+ years of cybersecurity experience, preferably working directly with the Army
  • 5+ years of knowledge of DoD and Army cybersecurity policy
  • Strong interpersonal and relationship-building skills
  • Strong writing skills and experience addressing senior executive leaders and General Officers
  • Ability to evaluate data to quickly identify problems, issues, and gaps
Job Responsibility
Job Responsibility
  • Oversight and accountability of the day-to-day security operations of cybersecurity tasks
  • Validate compliant security architecture through understanding and application of current policies, procedures, and standards to provide a layered approach to cybersecurity
  • Assist in the review and drafting of policies against applicable standards for regulatory compliance
  • Cross-reference and validate physical, personnel, facility, and information systems, through policies and controls IAW Army Regulations, Department of Defense (DoD) Directives and Instructions
  • Manage information security risks and report findings to the Government
  • Work with system owners to maintain current Authorities to Operate (ATO) in a manner compliant with the Federal Information Security Management Act (FISMA), DoD Risk Management Framework (RMF), and National Institute of Standards and Technology (NIST) guidance
  • Support cybersecurity requirements during Army and Joint exercises
  • Represent the customer and CG in briefings and meetings regarding the cybersecurity posture of the AOR
  • Ensure appropriate Secure Technical Implementation Guidelines (STIG) are maintained through monthly review
  • Use eMASS to validate compliance with Army RMF 2.0 standards
What we offer
What we offer
  • 401(k) with a 3% employer match
  • paid time off
  • paid holidays
  • FSA spending
  • dental
  • vision
  • health insurance
  • company-sponsored AD&D
  • life insurance
  • voluntary life
  • Fulltime
Read More
Arrow Right
New

Software Engineer (ESB System)

Develop and maintain system integrations between enterprise applications and pla...
Location
Location
Vietnam , Ho Chi Minh City
Salary
Salary:
Not provided
amaris.com Logo
Amaris Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or related fields
  • From 3.5+ years of experience in software engineer or system integration
  • Hands-on experience with ESB/middleware/integration platforms
  • Strong experience with JAVA or Golang, Microservice architect
  • Knowledge of database architecture and design
  • Knowledge of API Specification (OpenAPI, RAML, WSDL)
  • Strong skills in the programming technologies ESB, JAVA, Golang and Microservice Architect
  • Strong Database skills (Oracle, MS SQL, MongoDB, etc.)
  • Root cause analysis skills and planning to optimize system performance
  • Experience with Web services, API Gateway
Job Responsibility
Job Responsibility
  • Develop and maintain system integrations between enterprise applications and platforms
  • Build and enhance APIs, web services, and middleware integration solutions
  • Design and implement integration flows using ESB and microservices architecture
  • Participate in integration solution design for both on-premise and cloud environments
  • Collaborate with developers, analysts, and infrastructure teams to deliver integration projects
  • Troubleshoot system integration issues and perform root cause analysis
  • Optimize system performance, scalability, and security
  • Support automation initiatives through scripting and modern development tools
  • Prepare and maintain technical documentation
  • Participate in Agile/Scrum development activities
What we offer
What we offer
  • Competitive salary and 13th-month salary
  • 14+ annual leaves per year
  • Premium healthcare insurance, starting from your probation period
  • Project reviews and yearly performance appraisals
  • Annual company trips
  • Teambuilding activities: Team lunch/dinner, events, and celebrations, sports clubs (football, yoga, badminton, etc.)
  • International team with flexible working time
  • Tailor-made career path
  • Technical workshops and training courses
  • Mobility: Opportunities to be on-site abroad in our offices in over 60+ countries
  • Fulltime
Read More
Arrow Right
New

Project Manager / Tech Lead

Amaris Consulting is looking for an experienced Tech Lead / Project Manager with...
Location
Location
Belgium , Brussels
Salary
Salary:
Not provided
amaris.com Logo
Amaris Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Engineering, Information Systems, or a related field
  • At least 8 years of experience in project management, technical leadership, or software delivery roles
  • Strong experience leading custom software development projects in Agile environments
  • Hands-on knowledge of Scrum methodologies, backlog management, and Agile delivery practices
  • Experience working in regulated environments such as pharmaceutical, healthcare, banking, or food industries
  • Strong stakeholder management and communication skills with the ability to challenge priorities when needed
  • Experience managing project financials, budgets, and resource allocation
  • Ability to identify risks proactively and drive mitigation actions effectively
  • Fluent in English, both written and spoken
Job Responsibility
Job Responsibility
  • Lead the delivery and continuous improvement of a strategic internal digital platform
  • Coordinate multidisciplinary teams including developers, business analysts, testers, and UX specialists
  • Manage project scope, planning, budget, timelines, and risk mitigation activities
  • Facilitate Agile ceremonies including sprint planning, daily stand-ups, backlog grooming, and demos
  • Collaborate closely with cross-functional IT teams such as infrastructure, DevOps, and validation teams
  • Interact with business stakeholders to gather requirements and manage expectations effectively
  • Support new project intake activities and evaluate feasibility, priorities, and delivery approach
  • Ensure proactive communication, issue escalation, and alignment with management and key stakeholders
What we offer
What we offer
  • International and multicultural work environment
  • Access to training programs and certifications
  • R&D lab to explore new technologies
  • Opportunity to propose and lead innovative ideas
  • Personalized coaching and mentoring from experienced professionals
  • Tailor-made career path aligned with your growth ambitions
  • Fulltime
Read More
Arrow Right
New

Intelligent Automation Lead Engineer

NTT DATA strives to hire exceptional, innovative and passionate individuals who ...
Location
Location
Mexico , Remote
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of hands-on experience in UiPath development
  • Experience in multi-agent automation modeling, orchestration, and maintenance
  • Experience working in enterprise automation programs or Centers of Excellence (CoE)
  • Knowledge of intelligent document processing, AI/ML integrations, or attended/unattended automation models
  • Strong expertise in: UiPath Studio, Orchestrator, AI Center, Document Understanding, API integrations, Power Platform
  • Proven experience in Insurance domain processes (Claims, Underwriting, Policy Admin)
  • Experience integrating Generative AI / LLMs into enterprise workflows
  • Knowledge on integrating the Machine Learning is a plus
  • Knowledge of insurance core platforms (Guidewire, Duck Creek, etc.)
  • Strong understanding of secure data handling in regulated environments
Job Responsibility
Job Responsibility
  • Lead the Design, develop, and deploy end-to-end UiPath and Microsoft Power Automate solutions across insurance operations
  • Develop and maintain intelligent automation frameworks, including multi-agent models and orchestrated bot ecosystems
  • Build Agentic AI workflows capable of autonomous decision-making in underwriting, claims adjudication, and policy processing
  • Collaborate with business stakeholders to understand current processes, identify automation opportunities, and perform feasibility assessments
  • Gather, analyze, and document business and functional requirements for automation initiatives
  • Translate business requirements into technical solution designs and implementation plans
  • Ensure automation solutions are scalable, secure, and aligned with organizational standards and best practices
  • Integrate LLMs (OpenAI/Azure OpenAI) into RPA solutions for: Claims document summarization, Policy document interpretation, Risk assessment support, Customer communication automation
  • Implement Intelligent Document Processing (IDP) for FNOL, claims forms, medical reports, KYC, policy applications, and invoices
  • Develop automation for core insurance systems (eg., Guidewire, Duck Creek, Majesco, etc.) is a plus
Read More
Arrow Right
New

Brokerage Client Assistant - RCAST

Wells Fargo is seeking a Brokerage Client Assistant - RCAST as part of the Remot...
Location
Location
United States , Charlotte
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
June 04, 2026
Flip Icon
Requirements
Requirements
  • 1+ year of Brokerage and Client Services support experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Job Responsibility
Job Responsibility
  • Support Financial Advisory teams and their clients by providing account information or quotes, entering Financial Advisory approved security tickets and various administrative tasks
  • Identify ways to improve Remote Client Associate Support Team (RCAST) Brokerage Client Support processes and offer RCAST Brokerage Client Support work group ideas
  • Perform moderately complex administrative and operational tasks within RCAST Brokerage Client Support functional area
  • Handle telephone calls or respond to inquiries and requests for researching of reports and account related issues
  • Establish and maintain files to meet the firms regulatory requirements
  • Create, produce, and maintain reports, databases, and record keeping for the purpose of growing client relationships
  • Receive direction from supervisor and Financial Advisory functional area and escalate non-routine questions
  • Interact with Financial Advisory functional area on wider range of inquiries or requests, as well as internal and external customers
  • Fulltime
!
Read More
Arrow Right