CrawlJobs Logo

Performance Engineer, Low-Level Libraries

meta.com Logo

Meta

Location Icon

Location:
United States , Bellevue

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

257000.00 USD / Year

Job Description:

Are you committed to squeezing every drop of performance? Join Meta's Low-Level Libraries team and drive impact across our foundational infrastructure. We own the performance and Developer Experience (DevX) for critical C/C++ libraries like *folly*, *Jemalloc*, and GEMM (MLK, Aocl-Blas, etc.). Our work involves cutting-edge optimizations in domains such as memory management, concurrency, architecture-specific enablement, and AI frameworks. The result? Significant power savings and enabling new platforms at Meta.

Job Responsibility:

  • Develop and optimize C/C++ libraries for Meta services–memory allocation, thread pools and work scheduling, thread synchronization and lockless data structures, highly performant collections, async processing and I/O, RPC, etc
  • Analyze resource utilization in server applications (CPU, GPU, memory, network, etc.), identify bottlenecks, scope out opportunities for improved resource utilization, and implement improvements, such as modifying core libraries to optimize Meta server workloads, implementing efficiency improvements in production code (e.g., change core data structures), or improving server utilization
  • Work with internal customers and partners to define requirements
  • Reflect requirements in the team roadmap and plan out execution

Requirements:

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 7+ years of professional C/C++ experience
  • Knowledge of computer architecture, CPU and memory subsystem, and OS-level resource management
  • Experience using performance-profiling tools and optimizing native applications for execution-time and memory efficiency

Nice to have:

  • Experience implementing and optimizing low-level libraries, such as memory management, threading, GEMM, data compression, or string processing
  • Knowledge of modern ISAs, such as x86 and ARM
  • Experience hand-tuning code, e.g., with loop optimizations, vectorization, parallelization, HW-architecture-specific optimizations
  • Experience developing operating-system kernels
What we offer:
  • bonus
  • equity
  • benefits

Additional Information:

Job Posted:
January 26, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Performance Engineer, Low-Level Libraries

Software Engineer - Performance Tools

Join our team as a Software Engineer - Performance Tools and take the lead in il...
Location
Location
United States , San Jose
Salary
Salary:
150000.00 - 275000.00 USD / Year
etched.com Logo
Etched
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong proficiency in C++ or Rust
  • Proficiency in Python is a plus
  • Deep understanding of computer architecture (CPU, GPU, accelerators), memory hierarchies (caches, DRAM), and interconnects (especially PCIe)
  • Proven experience in low-level performance analysis, profiling, and bottleneck identification on complex hardware systems (GPUs, CPUs, FPGAs, or custom ASICs)
  • Experience with performance analysis tools (e.g., NVIDIA Nsight, AMD uProf, Intel VTune, perf, Tracy, ETW)
  • Experience working close to hardware, potentially reading performance counters or interacting directly with device drivers
Job Responsibility
Job Responsibility
  • Tool Architecture & Design: Lead the design and architecture of a comprehensive performance analysis suite, including data collection mechanisms, data processing pipelines, analysis engines, and user interfaces (CLI and/or GUI)
  • Low-Level Data Collection: Develop robust methods to capture performance data directly from our custom ML accelerator hardware (e.g., hardware performance counters, execution unit status, memory access patterns) via driver interfaces or other mechanisms
  • Host & System Tracing: Implement tracing for host-side API calls (runtime libraries, driver interactions) and system-level events (CPU activity, PCIe traffic, memory usage, network contention) related to Sohu workloads
  • Data Correlation & Synchronization: Design and implement techniques to accurately correlate performance events across the host CPU, device driver, PCIe bus, multiple accelerators, and multiple hosts, ensuring precise time synchronization
  • Performance Analysis Engine: Build analysis modules to automatically interpret collected trace and counter data, identifying key performance limiters (e.g., compute-bound, memory bandwidth-bound, latency-bound, PCIe-bound, specific hardware bottlenecks)
  • Visualization & Reporting: Develop intuitive visualizations (timelines, dependency graphs, resource utilization charts, statistical summaries) to clearly communicate performance characteristics and bottlenecks to users
  • Collaboration & Support: Work closely with hardware architects, firmware engineers, driver developers, compiler engineers, and ML application engineers to understand their needs, define tool requirements, and provide expert guidance on performance analysis and optimization using the tool
What we offer
What we offer
  • Medical, dental, and vision packages with generous premium coverage
  • $500 per month credit for waiving medical benefits
  • Housing subsidy of $2k per month for those living within walking distance of the office
  • Relocation support for those moving to San Jose (Santana Row)
  • Various wellness benefits covering fitness, mental health, and more
  • Daily lunch + dinner in our office
  • Fulltime
Read More
Arrow Right

Software Engineer II -iOS

Uber's applications are a critical part of daily life for hundreds of millions o...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or equivalent in Computer Science, Engineering, Mathematics or related field OR 3-years full-time Software Engineering work experience, WHICH INCLUDES 3-years total technical software engineering experience in one or more of the following areas: Programming language (e.g. Swift, Objective C, C)
  • Extensive iOS development experience, with deep proficiency in Swift and Objective-C.
  • Proven experience in building, maintaining, and shipping mobile SDKs or shared, reusable libraries.
  • Deep understanding of iOS internals, memory management (ARC, retain-cycles), and concurrency (GCD, OperationQueues).
  • Expertise in low-level debugging and performance optimization using tools like Instruments (Time Profiler, Leaks, Allocations, Hangs), lldb, and crash log analysis.
  • Familiarity with mobile observability, crash reporting, and performance monitoring principles and tools.
  • Experience developing for applications at massive scale (handling hundreds of millions of users) is a strong plus.
  • Ability to take ownership and independently drive complex, low-level platform projects to completion.
  • Strong focus on writing highly performant, testable, and resource-efficient code.
Job Responsibility
Job Responsibility
  • Design, develop, and maintain the core iOS observability SDKs responsible for crash reporting, hang/ANR detection, performance monitoring, and broad bug diagnostics.
  • Engineer highly performant, low-overhead code. Our SDKs run in the critical path of all Uber apps and must have a minimal impact on application performance and battery life.
  • Dive deep into complex crashes, hangs, memory issues, and performance bugs from across all Uber apps, performing root cause analysis and implementing platform-level fixes.
  • Develop new tooling and heuristics to automatically detect and classify a wide spectrum of bugs, from simple UI glitches to complex, multi-threaded race conditions.
  • Collaborate with backend engineers to build the data ingestion pipelines and dashboards that process terabytes of diagnostic data from our mobile fleet.
  • Act as a subject matter expert on iOS stability and performance, providing guidance, tooling, and best practices to feature teams across Uber.
  • Continuously innovate on our diagnostic capabilities, exploring new ways to capture and surface critical health metrics related to bugs, crashes, and performance from hundreds of millions of devices.
  • Uphold rigorous standards for code quality, testing (especially unit and integration testing for an SDK), and on-call practices for a Tier-0 service.
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer - Inference

Together AI is seeking a Machine Learning Engineer to join our Inference Engine ...
Location
Location
United States , San Francisco
Salary
Salary:
160000.00 - 230000.00 USD / Year
together.ai Logo
Together AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience writing high-performance, well-tested, production-quality code
  • Proficiency with Python and PyTorch
  • Demonstrated experience in building high performance libraries and tooling
  • Excellent understanding of low-level operating systems concepts including multi-threading, memory management, networking, storage, performance, and scale
Job Responsibility
Job Responsibility
  • Design and build the production systems that power the Together AI inference engine, enabling reliability and performance at scale
  • Develop and optimize runtime inference services for large-scale AI applications
  • Collaborate with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world
  • Conduct design and code reviews to ensure high standards of quality
  • Create services, tools, and developer documentation to support the inference engine
  • Implement robust and fault-tolerant systems for data ingestion and processing
What we offer
What we offer
  • competitive compensation
  • startup equity
  • health insurance
  • other competitive benefits
  • Fulltime
Read More
Arrow Right

Kernel Optimization Engineer

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. ...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
cerebras.net Logo
Cerebras Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s, Master’s, PhD or foreign equivalents in Computer Science, Computer Engineering, Mathematics, or related fields
  • Understanding of hardware architecture concepts — must be comfortable learning the details of a new hardware architecture
  • Skilled in C++ and Python programming languages
  • Good knowledge of library and/or API development best practices
  • Strong debugging skills and knowledge of debugging complex software stack
Job Responsibility
Job Responsibility
  • Develop design specifications for new machine learning and linear algebra kernels and mapping to the Cerebras WSE System using various parallel programming algorithms
  • Develop and debug kernel library of highly optimized low level assembly instruction and C-like domain specific language routines to implement algorithms targeting the Cerebras hardware system
  • Develop and debug high-performance kernel routines in low-level assembly and a custom C-like (CSL) language, implementing algorithms optimized for the Cerebras hardware system
  • Using mathematical models and analysis to measure the software performance and inform design decisions
  • Develop and integrate unit and system testing methodologies to verify correct functionality and performance of kernel libraries
  • Study emerging trends in Machine Learning applications and help evolve Kernel library architecture to address computational challenges of the start-of-the-art Neural Networks
  • Interact with chip and system architects to optimize instruction sets, microarchitecture, and IO of next generation systems
What we offer
What we offer
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs
Read More
Arrow Right

Firmware Intern

In our firmware role, you will help develop firmware for custom ASICs designed t...
Location
Location
United States , San Jose
Salary
Salary:
Not provided
etched.com Logo
Etched
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Progress towards a Bachelor’s, Master’s, or PhD degree in Computer Science, Engineering, or a related technical field
  • Proficiency in C/C++ or Rust
  • Strong fundamentals in data structures and algorithms
  • Strong understanding of low-level software engineering
  • Strong understanding of hardware/software co-design
  • Excellent communication and collaboration skills
Job Responsibility
Job Responsibility
  • Help develop firmware for custom ASICs designed to run large transformer models efficiently
  • Work across the stack—from low-level drivers, hardware interfaces, and system initialization code to integration with runtime libraries and model-execution frameworks
  • Focus on making the hardware operate reliably and at peak performance, enabling high-throughput inference and training workloads
  • Collaborate with hardware, architecture, and software teams to bring up new silicon, validate features, and support real-world AI applications
What we offer
What we offer
  • 12-week paid internship
  • Generous housing support for those relocating
  • Daily lunch and dinner in our office
  • Direct mentorship from industry leaders and world-class engineers
  • Opportunity to work on one of the most important problems of our time
  • Fulltime
Read More
Arrow Right

Manager, Kernel Software

As a Manager, Kernel Software, you will lead a team of engineers at the intersec...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
cerebras.net Logo
Cerebras Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s, Master’s, PhD, or foreign equivalent in Computer Science, Computer Engineering, Mathematics, or a related field
  • Proven experience leading technical teams, including mentoring engineers, setting technical direction, and driving execution
  • Strong understanding of hardware architecture concepts and willingness to dive into new system architectures
  • Proficiency in C++ and Python
  • experience with low-level systems programming
  • Familiarity with library/API development best practices and performance optimization
  • Excellent debugging skills across complex, layered software stacks
Job Responsibility
Job Responsibility
  • Lead the design and development of high-performance ML and linear algebra kernels for the Cerebras WSE using parallel programming techniques
  • Guide a team building optimized low-level routines in assembly and a domain-specific C-like language
  • Use performance modeling to inform design and optimization decisions
  • Drive test development to ensure correctness and performance of kernel libraries
  • Evolve kernel architecture to support emerging ML models and workloads
  • Collaborate with hardware architects to influence future system design
  • Mentor engineers and foster a high-performing, collaborative team culture
What we offer
What we offer
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs
Read More
Arrow Right

Quantitative Engineer

Blockchain.com is connecting the world to the future of finance. As the most tru...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
blockchain.com Logo
Blockchain
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep expertise in C++ and/or Rust, with strong focus on low-level optimization and performance
  • Proven experience building fast-market, low-latency trading systems, ideally for options or derivatives
  • Strong understanding of: Options theory and derivatives markets
  • Pricing models, volatility surfaces, and risk metrics (Greeks)
  • Algorithms, data structures, and memory-efficient system design
  • Strong interest in algorithmic options trading, risk management, and quantitative analysis
  • Exceptional quantitative and analytical skills
  • Bachelor’s or Master’s degree in Computer Science, Mathematics, Statistics, or equivalent experience
  • Strong written and verbal communication skills
Job Responsibility
Job Responsibility
  • Design, build, test, and deploy high-performance options trading infrastructure in C++ and Rust
  • Develop and optimize options pricing and risk-management libraries, including Greeks and volatility-driven models
  • Build and maintain low-latency, high-throughput options trading and quoting systems
  • Identify, analyze, and optimize critical performance bottlenecks in both research and production environments
  • Partner closely with options traders to: Define trading and risk requirements
  • Deliver custom software solutions for options strategies
  • Provide timely systems support during live trading
  • Take full ownership of projects from initial design through deployment and production support
  • Build efficient, reliable, and highly available applications in a scaled, performance-based environment
What we offer
What we offer
  • Full-time salary based on experience and meaningful equity in an industry-leading company
  • Work from Anywhere Policy: You can work remotely from anywhere in the world for up to 20 days per year
  • Apple equipment
  • The opportunity to be a key player and build your career at a rapidly expanding, global technology company in an emerging field
  • Flexible work culture
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - ReactJS (Next-Gen Firewall)

As a Senior Software Engineer – Front End, you will be responsible for designing...
Location
Location
Vietnam , Ho Chi Minh City
Salary
Salary:
Not provided
qualgo.net Logo
Qualgo
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 5+ years of experience in frontend software engineering
  • Deep expertise in ReactJS and modern hooks patterns
  • Strong proficiency in TypeScript (Non-negotiable for this role)
  • Experience building complex Enterprise B2B Dashboards or Management Consoles (not just static websites)
  • Experience with Data Visualization libraries (D3.js, Chart.js, ECharts, or similar)
  • Experience handling Real-time data via WebSockets or Server-Sent Events (SSE)
  • Analytical Mindset: Ability to understand complex technical data and figure out how to present it simply
  • Precision: In a security product, UI bugs can lead to security holes. Attention to detail is paramount
  • Collaboration: Ability to work with "low-level" system engineers and "high-level" product designers simultaneously
Job Responsibility
Job Responsibility
  • Design and build complex, data-heavy dashboards using ReactJS and TypeScript to monitor network traffic and security events
  • Develop "Zero-Trust" policy editors and firewall rule management interfaces that are intuitive yet powerful
  • Optimize rendering performance to handle thousands of real-time events per second (logs, alerts, packet flows) without browser lag
  • Work with Data Scientists to visualize AI detection models (e.g., distinguishing between normal traffic and an anomaly)
  • Implement advanced data visualizations (using D3.js, Recharts, or WebGL/Three.js) to represent global threat maps, network topology graphs, and attack vectors
  • Translate complex AI "confidence scores" into clear, human-readable risk indicators for network administrators
  • Architect and implement robust WebSocket connections to stream live firewall logs and threat alerts to the browser
  • Manage complex client-side state (using Redux, Zustand, or TanStack Query) to ensure the UI reflects the exact state of the network devices in real-time
  • Translate networking concepts (IP ranges, Subnets, DNS, VPN tunnels) into user-friendly UI components
  • Champion UI/UX best practices for "Critical Control Systems"—ensuring that blocking a threat or changing a firewall rule is safe, confirmed, and error-free
What we offer
What we offer
  • Meaningful work & impact: Work on products that protect users’ data and make a real difference in people’s lives
  • Competitive rewards: Great work deserves great rewards — enjoy competitive pay and recognition for contribution and impact you make
  • Growth & well-being: Fuel your growth with hands-on learning. Enjoy extra leaves, and premium healthcare for you and your family
  • People & workspace: Collaborate, create, and celebrate — a modern workspace built for teamwork, fun, and innovation
  • Young & dynamic environment: Work with stunning colleagues where creativity thrives, ideas are welcome, and every day brings new challenges and opportunities
Read More
Arrow Right