CrawlJobs Logo

Software Engineer 2 - Processing Unit for Copilot

China, Beijing · Job Posted March 22, 2026
Apply Position
Job Link Share

Job Description

We are seeking an expert GPU Engineer 2 to join our AI Infrastructure team. In this role, you will architect and optimize the core inference engine that powers our large-scale AI models. You will be responsible for pushing the boundaries of hardware performance, reducing latency, and maximizing throughput for Generative AI and Deep Learning workloads. You will work at the intersection of Deep Learning algorithms and low-level hardware, designing custom operators and building a highly efficient training/inference execution engine from the ground up.

Job Responsibility

  • Custom Operator Development: Design and implement highly optimized GPU kernels (CUDA/Triton) for critical deep learning operations (e.g., FlashAttention, GEMM, LayerNorm) to outperform standard libraries
  • Inference Engine Architecture: Contribute to the development of our high-performance inference engine, focusing on graph optimizations, operator fusion, and dynamic memory management (e.g., KV Cache optimization)
  • Performance Optimization: Deeply analyze and profile model performance using tools like Nsight Systems/Compute. Identify bottlenecks in memory bandwidth, instruction throughput, and kernel launch overheads
  • Model Acceleration: Implement advanced acceleration techniques such as Quantization (INT8, FP8, AWQ), Kernel Fusion, and continuous batching
  • Distributed Computing: Optimize communication primitives (NCCL) to enable efficient multi-GPU and multi-node inference (Tensor Parallelism, Pipeline Parallelism)
  • Hardware Adaptation: Ensure the software stack fully utilizes modern GPU architecture features (e.g., NVIDIA Hopper/Ampere Tensor Cores, Asynchronous Copy)

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Architectural Mastery: Expertise in the CUDA programming model and NVIDIA GPU architectures (specifically Ampere/Hopper)
  • Deep understanding of the memory hierarchy (Shared Memory, L2 cache, Registers), warp-level primitives, occupancy optimization, and bank conflict resolution
  • Familiarity with advanced hardware features: Tensor Cores, TMA (Tensor Memory Accelerator), and asynchronous copy
  • Proven ability to navigate and modify complex, large-scale codebases (e.g., PyTorch internals, Linux kernel)
  • Experience with build and binding ecosystems: CMake, pybind11, and CI/CD for GPU workloads
  • Performance Engineering: Mastery of NVIDIA Nsight Systems/Compute
  • Ability to mathematically reason about performance using the Roofline Model, memory bandwidth utilization, and compute throughput

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Software Engineer 2 - Processing Unit for Copilot

8 matching positions

Software Engineer- Full Stack - Java, React - Engineer

The Digital S/W Engineer Intmd Analyst is a seasoned professional role. Applies ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years software development experience, specializing in JavaScript, Java frameworks, databases like React.js, Spring Boot, Oracle and MongoDB
  • In-depth knowledge of modern application architecture principles
  • Clear understanding of Data Structures and Object-Oriented Principles using Java
  • Practical experience with Artificial Intelligence (AI) tools for enhancing development workflows
  • Proficiency in Microservices frameworks (e.g., Spring Boot), Event-Driven Services, and Cloud-Native Application Development
  • Multiple years of experience on Service Oriented and Microservices architectures, including REST and GraphQL implementations
  • Demonstrated ability to design, develop, and maintain both front-end and back-end components of robust web applications
  • Strong expertise in developing intuitive user interfaces using contemporary JavaScript frameworks (e.g., React), HTML5, and CSS
  • Solid experience in developing server-side logic and APIs using languages such as Java, Python, or similar
  • Comprehensive knowledge of SQL and PL/SQL, with a deep understanding of Relational Database Management Systems (RDBMS), particularly Oracle
Job Responsibility
Job Responsibility
  • Apply depth of knowledge and expertise to all aspects of the software development lifecycle, as well as partner continuously with stakeholders on a regular basis
  • Develop and engineer solutions within an Agile software delivery team, working to collaboratively deliver sprint goals, write code, and participate in the broader Citi technical community and team-level Agile and Scrum processes
  • Contribute to the design, documentation, and development of world-class enterprise applications leveraging the latest technologies and software design patterns
  • Leverage technical knowledge of concepts and procedures within own area and basic knowledge of other areas to resolve issues, as necessary
  • Follow and contribute to defining technical and team standards
  • Collaborate technical leadership to achieve established goals, in line with our broader technical strategy
  • AI-Driven Development: Leverage AI tools, such as GitHub Copilot, to enhance development efficiency, accelerate delivery timelines, and optimize software solutions
  • AI Landscape Awareness: Maintain awareness of the rapidly evolving AI landscape and proactively propose innovative approaches to implement product features
  • Problem Solving and Troubleshooting: Possess the expertise to analyze and effectively troubleshoot complex coding, application performance, and design challenges
  • Root Cause Analysis: Capable of conducting thorough research to identify the root causes of development and performance issues, as well as devising and implementing effective defect resolutions
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Java / Hibernate

Imagine building the analytics layer that millions of business users rely on to ...
Location
Location
United States , McLean
Salary
Salary:
Not provided
appian.com Logo
Appian Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Computer Science (4+ years experience) or MS in Computer Science (2+ years experience)
  • Strong Java fundamentals - Spring Framework, Hibernate/JPA, interface-first architecture, dependency injection
  • Experience building interactive web UIs with TypeScript/React (or similar component frameworks)
  • Comfort with functional programming concepts (immutability, higher-order functions, declarative UI)
  • Experience with relational databases - schema design, migrations, query optimization
  • Demonstrated ability to work across multiple languages and layers in a single feature
  • Strong testing discipline - unit, integration, and UI-level tests
  • Excellent communication skills and ability to mentor others
Job Responsibility
Job Responsibility
  • Design and implement full-stack features across Java services, SAIL interfaces, and React/TypeScript chart components
  • Build and extend AI copilot pipelines that use LLMs to generate process insights, KPI suggestions, and actionable recommendations
  • Own features from database schema (Liquibase/JPA) through API layer (Spring) to interactive UI
  • Develop rich data visualization experiences: chart configuration UIs, dashboard builders, and report designers
  • Write interface-first Java with constructor injection and proper layering
  • Contribute to SAIL system rules using functional programming patterns, Redux state management, and deferred evaluation
  • Build and maintain TypeScript/React chart widgets that render complex configurations
  • Lead feature development with spec-driven workflows, mentor engineers, and conduct thorough code reviews
  • Ensure reliability through comprehensive testing (unit expression tests, integration tests, UI tests, migration tests)
  • Collaborate on feature breakdowns, sizing, and architectural decisions with your squad
What we offer
What we offer
  • Health coverage
  • Employee Assistance Program (EAP) with free mental health support
  • Life and disability insurance
  • Employee Stock Purchase Program (ESPP)
  • Retirement/pension plan
  • Wellness dollars
  • Tuition reimbursement
  • Family-forming benefits
  • Fulltime
Read More
Arrow Right
New

Senior Android Developer

The Digital S/W Engineer Sr Analyst is a seasoned professional role. Applies in-...
Location
Location
India , Chennai
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of professional experience in native mobile application development
  • 5+ years of hands-on experience with Kotlin (strong preference for Kotlin-first development
  • Java familiarity a plus)
  • 5+ years of experience building and shipping enterprise-grade native Android applications at scale
  • Deep understanding of the Android SDK, platform lifecycle, and Google ecosystem (Jetpack, Android Runtime, etc.)
  • Strong understanding of Android internals — memory management, process lifecycle, activity/fragment back stack, and app lifecycle at a low level
  • Experience with networking layers, Retrofit, OkHttp, and custom network stack implementations
  • Proficiency with data persistence strategies (Room, SQLite, DataStore, SharedPreferences, EncryptedSharedPreferences)
  • Solid understanding of security best practices at the OS level (certificate pinning, Android Keystore, data encryption, ProGuard/R8 obfuscation)
  • Familiarity with performance profiling and optimization using Android Studio Profiler (memory leaks, CPU usage, rendering bottlenecks, battery impact)
Job Responsibility
Job Responsibility
  • Accountable for executing and driving mid-size feature application design and development efforts to completion, serving as a development lead on medium-scale projects and supporting the execution of larger efforts
  • Proficient at operating with considerable autonomy and discretion as you will significantly influence the way an application is designed and developed by providing subject specific expertise and an advanced level of understanding of application programming principles
  • Sought after due to ability to analyze and troubleshoot coding, application performance and design challenges
  • Capable of research in root cause of development and performance concerns as well as the resolution of defects
  • Have a deep understanding of the technical requirements for the solutions being built
  • Understand engineering needs including those required to build, maintain, and operate the system through all phases of its life
  • Proficient in information modeling, data structures and algorithms
  • Understand maintenance characteristics, runtime properties and dependencies that exist in support of your system’s software. These dependencies will include but not limited to: hardware, platform, operating system, performance and load requirements, as well as contingencies with upstream and downstream systems
  • Demonstrate an advanced understanding of supported main system flows and possess a comprehensive understanding of how the system and others (i.e. architecture or infrastructure) collectively integrate to contribute towards achieving business objectives. Using this expertise, assist with the development of standards for coding, testing, debugging and implementation. This will ensure application design adheres to the overall architecture blueprint
  • Participate in design discussions as a Development Lead and as such will play the part of a key decision maker in driving design decisions. This is evidenced by your knowledge of the application code details and your experience with the technology and teams engaged
  • Fulltime
Read More
Arrow Right

Lead Agentic Ai Engineer

Citi's Wholesale Technology organization is seeking an exceptional, hands-on Lea...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6–10 years of relevant experience in an AI/ML development role, Applications Development, or Systems Analysis, with a substantial and demonstrated focus on Python technologies
  • Minimum 2+ years of professional experience in software development with a focus on AI, prompt engineering, machine learning, and/or agentic AI systems
  • Proven track record as a lead developer for agentic flow design, prompt design, and testing of autonomous AI systems with deep expertise in Google ADK
  • Subject Matter Expert (SME) in at least one area of Applications Development, particularly Python application development (Django, Flask, FastAPI)
  • Python (expert-level): FastAPI, Django, Flask, asyncio, PySpark — strong fundamentals in algorithms, data structures, concurrency, and design patterns
  • Proficient in Java (Spring Boot, Spring Cloud), JavaScript/TypeScript (React, Next.js, Node.js), and SQL/data modeling
  • Experience across AWS, Azure, and GCP with Docker, Kubernetes, and CI/CD pipelines
  • Proficient in MLOps practices including model versioning, deployment, and lifecycle management
  • Strong foundation in secure API design, microservices, event-driven architecture, and distributed systems with expertise in testing, Git workflows, and performance optimization
  • Deep expertise in LLMs (OpenAI GPT, Gemini, Claude, Llama) with hands-on experience in LangChain, LangGraph, LlamaIndex, AutoGen, CrewAI, and Google ADK
Job Responsibility
Job Responsibility
  • Lead end-to-end design, development, and deployment of large-scale agentic AI solutions using Google Agent Development Kit (ADK) and frameworks such as LangChain, LangGraph
  • Architect advanced multi-agent systems (perception, reasoning, planning, execution) integrating multiple LLM providers (OpenAI, Anthropic, Google Gemini)
  • Build AI-powered capabilities using Google Gemini, Vertex AI, Agent Development Kit (ADK), Google A2UI, vector databases, RAG pipelines, semantic search, and advanced prompt and context management
  • Engineer autonomous agents incorporating planning, tool usage, memory management, and multi-step reasoning patterns
  • Develop scalable, high-performance backend services in Python (FastAPI, asyncio) with resilient APIs, event-driven designs, and microservices architectures
  • Build and maintain robust data pipelines working with SQL (Oracle, PostgreSQL) and NoSQL (MongoDB) databases
  • Implement secure REST APIs and agent interfaces with strong authentication, authorization (OAuth), and encryption best practices
  • Optimize AI agent performance, latency, and cost through prompt optimization, caching strategies, and vector index tuning
  • Provide architectural guidance for Next-Generation AI (NGAI) initiatives, ensuring adherence to CTO guidelines and platform standards
  • Develop and maintain a strategic roadmap for generative AI adoption, evaluating new models, techniques, and platforms
  • Fulltime
Read More
Arrow Right

Senior Front-end Developer (Angular/TypeScript)

You will join the cross-functional "Account" team to work on new Angular service...
Location
Location
Ukraine , Kyiv
Salary
Salary:
Not provided
3shape.com Logo
3Shape
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of software development experience
  • Proven 2+ years of experience with Angular understanding of RxJS, state management, and the latest framework features (e.g., Signals, Standalone Components)
  • Strong expertise with web-based development using HTML, CSS, TypeScript, JavaScript
  • Hands on experience with CI/CD pipelines (Docker, Azure DevOps, etc.) and an interest in migrating to GitHub Actions
  • Strong knowledge of OOP, SOLID principles and design patterns
  • English level - Upper-Intermediate and above
  • Understanding of the memory management
  • Passion for new technologies, and empathy for users
  • Ownership mindset — you drive things to completion and take responsibility for outcomes, not just tasks
  • Familiar with Scrum or another agile framework
Job Responsibility
Job Responsibility
  • Develop front-end features using Angular and Angular Material within the 3Shape ecosystem
  • Contribute to product architecture, core components, environment setup, and technical investigations
  • Collaborate with UX/UI designer and backend engineers to deliver integrated solutions
  • Follow and improve development practices, including code reviews, unit testing, and process optimization
  • Use GitHub Copilot and Claude Code for everyday work
  • Deliver high-quality, resilient products
  • Expand code coverage through unit, integration, and automated UI testing
  • Participate in Agile ceremonies and support a collaborative, proactive team culture
What we offer
What we offer
  • 24 working days of annual vacation
  • medical insurance
  • paid sick leaves and child sick leaves
  • maternity and paternity leaves
  • breakfasts and lunches in the office
  • good working conditions in a comfortable office in UNIT.City
  • a parking lot with free spaces for employees
  • occasional business trips to Western Europe
  • Fulltime
Read More
Arrow Right

Summer 2026 Software Engineering Internship

This is a 11-week paid learning experience during which you'll be able to connec...
Location
Location
United States , Overland Park
Salary
Salary:
20.00 - 40.00 USD / Hour
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree OR combination of education and experience deemed equivalent
  • Acceptable areas of study include Computer Science or Engineering (Preferred)
  • Less than 2 years Technical engineering experience. (Preferred)
  • Communication (Required)
  • Customer Service (Required)
  • Analytics (Required)
  • Technical Writing (Required)
  • Java, Spring Boot, Python, Oracle, JSON, YAML
  • Multi-threading concepts
  • JVM memory management and Garbage collection concepts
Job Responsibility
Job Responsibility
  • Develop and implement software solutions to meet customer requirements and support scalable, highly available environments
  • Maintain clear and accurate documentation for software code, system designs, and business requirements
  • Collaborate with technical teams to deliver software solutions and communicate technical ideas effectively
  • Contribute to the understanding and application of current and emerging technologies to support business goals
  • Identify and share ideas for system analysis and continuous improvement of existing processes and services
  • Also responsible for other duties/projects as assigned by business management as needed
What we offer
What we offer
  • Relocation may be provided to program participants who reside more than 50 miles from the internship location
  • Fulltime
Read More
Arrow Right

Cloud Solution Architect- AI Business Process

Our purpose is to empower organizations to transform the way they work by harnes...
Location
Location
United States , Multiple Locations
Salary
Salary:
85100.00 - 169800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, Liberal Arts, or related field AND 2+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting OR equivalent experience
  • Authorization to work in the United States that does not now or in the future require employer sponsorship
  • 4+ years experience working in a customer-facing role
  • 4+ years experience leading technical projects
  • 1+ year experience customizing, troubleshooting, and deploying Copilot Studio with one or more demonstratable production deployments
  • 4+ years of experience customizing, tuning, troubleshooting, and deploying one or more of the following cloud-based systems: Dynamics 365 CE Solutions (Customer Service, Sales, and/or Field Service) and/or similar AI-powered CRM and ERP business applications
  • Microsoft Power Platform (Model Driven Apps, Canvas Apps, Power Automate) with Dataverse and/or similar suite of low-code tools to create intelligent agents for business processes
Job Responsibility
Job Responsibility
  • Drive positive Customer Satisfaction and become a trusted advisor to customers by leveraging AI Business Process (Dynamics 365, Power Platform, and/or Copilot Studio) expertise to enable defined Customer Success Plan outcomes
  • Provide feedback and insights from customers back to the relevant Microsoft teams to enable continuous improvement
  • Develop opportunities to drive Customer Success business results by working with business and technical decision makers to ensure they understand Microsoft's AI Business Process value proposition to effectively drive usage and generate value from Microsoft technology investments via implementation advisory, performance efficiency, and organizational/operational excellence
  • Identify resolutions to issues blocking go-live of customer success projects by leveraging deep knowledge of AI Business Process solutions and technical subject matter expertise
  • Lead the technical conversations with customers to drive value from their MS investments
  • Deliver all work according to Microsoft best practices and policies and using repeatable Intellectual Property (IP)
  • Demonstrate Self Learner mindset through continuous alignment of individual skilling to team/area demands and Customer Success goals
  • Engage in relevant communities to share expertise, contribute to IP creation, prioritize IP re-use, and learn from others to help accelerate your customers transformation journey
  • Fulltime
Read More
Arrow Right

Cloud Solution Architect - AI Business Solutions

We are looking for a Cloud Solution Architect – AI Business Solutions for AI Wor...
Location
Location
United States , Multiple Locations
Salary
Salary:
85100.00 - 169800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors Degree in Computer Science, Information Technology, Engineering, Business, or related field AND 2+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting OR equivalent experience
  • Microsoft is unable to sponsor a work visa for this role due to the nature of the role’s job duties
  • Bachelors Degree in Computer Science, Information Technology, Engineering, Business, Liberal Arts, or related field AND 8+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting OR Masters Degree in Computer Science, Information Technology, Engineering, Business, or related field AND 6+ years experience in cloud/infrastructure technologies, technology solutions, practice development, architecture, and/or consulting OR equivalent experience
  • 4+ years experience working in a customer-facing role (e.g., internal and/or external)
  • 4+ years experience working on technical projects
  • Technical certifications in relevant technologies or disciplines (e.g., Azure, Amazon Web Services, Google, security certifications)
  • Experience with Copilot, D635, and/or Power Platform
Job Responsibility
Job Responsibility
  • Accelerate AI transformation and Value Realization: Actively engage with business and technical decision makers to drive intent, enablement and usage of Microsoft AI solutions, securing long-term customer renewal. Galvanize technical and sales experts around additional identified opportunities to develop customer specific roadmaps that drive further growth & business value realization
  • Customer Insights: Provide feedback and insights from customers back to relevant MS teams to enable continuous improvement
  • Customer Satisfaction: Drive positive Customer Satisfaction and become a trusted advisor to customers by leveraging Microsoft’s AI Business Solutions within AI Workforce and/or AI Business Process to enable defined Customer Success Plan outcomes. You will actively listen and respectfully challenge to drive the best outcomes
  • Usage (Cloud & Support) Growth: Develop opportunities to drive Customer Success business results by working with business & technical decision makers to ensure they understand Microsoft’s AI solutions value proposition. Lead architecture design sessions, address customer questions, and provide best practices for activating solutions within the customer environment
  • Identify Opportunities: Identify resolutions to issues blocking the go-live of customer success projects by leveraging deep knowledge of either AI Workforce or AI Business Process technical subject matter expertise. Lead business & technical conversations with customers to drive value from their MS investments. Deliver all work according to MS best practices & policies and using repeatable IP
  • Learn-It-All: Demonstrate a self-learner mindset through continuous, self-directed learning in areas such as Copilot, D365, Power Platform, business value realization, and adoption and change management. Build and maintain technical readiness aligned to Customer Success Unit (CSU) priorities and corporate initiatives
  • Accelerate Customer Outcomes: Engage in relevant communities to share expertise, contribute to IP creation, prioritize IP re-use and learn from others to help accelerate your customers transformation journey
  • Fulltime
Read More
Arrow Right