CrawlJobs Logo

Principal Data Platform Engineer

Malaysia, Batu Kawan, Penang · Job Posted January 13, 2026
Apply Position
Job Link Share

Job Description

We are seeking a Principal Data Platform Engineer to define, evolve, and scale our enterprise data platform. This role is data-first and architecture-driven, with hands-on impact across ETL/ELT, big data, streaming, and cloud data platforms. You will act as a technical authority, shaping long-term platform direction, setting engineering standards, and mentoring senior engineers across teams.

Job Responsibility

  • Own and define data platform architecture, standards, and long-term technical roadmap
  • Design and oversee scalable ETL/ELT pipelines using Python across multiple data domains
  • Establish data ingestion and data access APIs using Python and FastAPI for platform consistency and reuse
  • Lead design and optimization of batch and streaming pipelines using Spark and Apache Kafka
  • Drive architecture decisions for relational databases (MySQL, Oracle), cloud data warehouses (AWS Redshift), and NoSQL systems (Elasticsearch)
  • Guide large-scale data processing using Hive, Trino, and Hadoop distributed computing
  • Define standards for object and file storage integrations (AWS S3, Dell ECS, SFTP)
  • Enable data quality, lineage, governance, and reliability at platform scale
  • Support analytics and BI enablement (Power BI, Spotfire) through well-modeled datasets
  • Contribute to lightweight internal UIs using React for data observability, configuration, or platform tooling (custom, not product UI)
  • Mentor senior and junior engineers
  • influence architecture across teams

Requirements

  • 10+ years of experience in data engineering, data platform, or large-scale distributed data systems
  • Demonstrated ownership of data platform architecture in complex environments
  • Deep expertise in Python, SQL, and ETL/ELT design
  • Strong hands-on experience with distributed data systems, including: Big data: Spark, Hive, Trino, Hadoop
  • Streaming: Apache Kafka
  • Experience designing platforms using: MySQL, Oracle, AWS Redshift
  • Elasticsearch or other NoSQL systems
  • Experience building data-focused APIs using FastAPI
  • Familiarity with AWS cloud, Docker, Kubernetes and CI/CD at production scale
  • Proven ability to influence without authority, mentor senior engineers, and drive cross-team alignment

Nice to have

  • Lakehouse technologies (Delta Lake, Iceberg)
  • Workflow orchestration (Airflow, Dagster, Prefect)
  • MLOps platforms (MLflow, Kubeflow, SageMaker)
  • Real-time processing frameworks (Flink, Kafka Streams)
  • Knowledge of data governance and compliance (GDPR, CCPA)
  • Experience building internal tooling UIs using React

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Principal Data Platform Engineer

8 matching positions

Principal Data Platform Engineer Vice President

We are seeking an exceptionally skilled and motivated Principal Data Platform En...
Location
Location
United States , Irving; Jacksonville
Salary
Salary:
125760.00 - 188640.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of relevant experience in Apps Development or systems analysis role with extensive Python and/or big data expertise
  • Python/Pyspark Mastery
  • Big Data Technologies & Platforms: Extensive experience (8+ years) architecting, designing, implementing, and managing solutions within distributed data processing platforms, specifically with the Hadoop ecosystem (preferably Cloudera distributions). Proficient in leveraging key big data components such as distributed file systems (e.g., HDFS), data warehousing solutions (e.g., Hive), data transformation frameworks (e.g., Pig), and data ingestion tools (e.g., Sqoop), alongside hands-on experience with NoSQL databases (preferably MongoDB)
  • ETL Architecture & Development: Proven ability to architect, design, and implement highly scalable data pipelines. Extensive experience leveraging industry-standard ETL tools and frameworks for efficient data extraction, transformation, and loading into various relational databases and data warehouses, coupled with a strategic vision and planning experience for migrating to cloud-native, serverless ETL solutions
  • Data Architecture & Strategic Modeling: Understanding of advanced data modeling principles and practical experience in data warehouse design and development, ensuring data integrity, scalability, security, and optimal performance
  • AI-Powered Development: Ability to leverage advanced AI tools, such as Devin, for efficient code refactoring, optimization, and identifying potential code improvements, thereby enhancing code quality and developer productivity
  • DevOps, Version Control & Containerization: CI/CD pipelines, Git, Docker and Kubernetes
  • Extensive experience system analysis and in programming of software applications
  • Experience in managing and implementing successful projects
  • Subject Matter Expert (SME) in at least one area of Applications Development
Job Responsibility
Job Responsibility
  • Partner with multiple management teams to ensure appropriate integration of functions to meet goals as well as identify and define necessary system enhancements to deploy new products and process improvements
  • Resolve variety of high impact problems/projects through in-depth evaluation of complex business processes, system processes, and industry standards
  • Provide expertise in area and advanced knowledge of applications programming and ensure application design adheres to the overall architecture blueprint
  • Utilize advanced knowledge of system flow and develop standards for coding, testing, debugging, and implementation
  • Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals
  • Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions
  • Serve as advisor or coach to mid-level developers and analysts, allocating work as necessary
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency
What we offer
What we offer
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
  • Fulltime
Read More
Arrow Right

Principal Platform Engineer - Data & AI

As a Principal Platform Engineer - Data & AI, you will architect and lead Data a...
Location
Location
United States , San Diego
Salary
Salary:
190000.00 - 284000.00 USD / Year
resmed.com Logo
ResMed
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS/MS in Computer Science or equivalent experience
  • 10+ years architecting AI/ML and Data Platform solutions
  • Experience building and scaling ML platforms for model training, deployment, and monitoring using large datasets
  • Expertise in modern technology stacks, APIs, microservices, and cloud Data and AI platforms
  • Excellent communication skills with ability to present to all levels of leadership
  • Demonstrated agile mindset focused on iterative improvement
Job Responsibility
Job Responsibility
  • Lead the technical evolution of data and AI/ML capabilities, ensuring incremental solution delivery and proactive risk management
  • Develop deep technical expertise across AI domains to guide cross-team initiatives and optimize productivity
  • Set and maintain high standards for data quality, platform reliability, and security while addressing scalability challenges
  • Drive strategic technical decisions that balance business needs with long-term sustainability across data and AI/ML initiatives
  • Mentor and coach data engineering teams, collaborate with engineering leadership, and provide actionable feedback for team growth
  • Effectively communicate technical concepts to both technical and non-technical stakeholders at all management levels
  • Provide thought leadership in data architecture, governance, and technology selection to shape company-wide technical direction
What we offer
What we offer
  • comprehensive medical, vision, dental, and life, AD&D, short-term and long-term disability insurance
  • sleep care management
  • Health Savings Account (HSA)
  • Flexible Spending Account (FSA)
  • commuter benefits
  • 401(k)
  • Employee Stock Purchase Plan (ESPP)
  • Employee Assistance Program (EAP)
  • tuition assistance
  • fifteen days Paid Time Off (PTO) in their first year of employment
  • Fulltime
Read More
Arrow Right

Principal Software Engineer - Data Platform

Arcesium seeks an exceptional Principal Software Engineer – Data Platform to joi...
Location
Location
United States , New York
Salary
Salary:
200000.00 - 250000.00 USD / Year
arcesium.com Logo
Arcesium
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong academic background in computer science
  • 5+ years of relevant experience as a software engineer at a top startup or technology firm
  • Expertise in at least one of: Kafka, ZeroMQ, AWS SNS/SQS, or equivalent streaming technology
  • Expertise in at least one of: Spark, Flink, Beam, or equivalent streaming data processing frameworks
  • Expertise in at least one of: Iceberg, delta lake, or other data lake table formats and supporting tooling
  • Hands-on experience in building near real-time data ETL, ELT and normalization
  • Fluency with Java, Kotlin, Scala, or any other JVM language
  • Familiarity with relational databases like Postgres, MSSQL, or Sqlite
  • Track record and appetite for building and launching new data products from scratch
  • Experience guiding and mentoring highly skilled engineers and driving large scale engineering projects across the firm
Job Responsibility
Job Responsibility
  • Build next generation technology used by some of the most sophisticated financial institutions in the world
  • Design exciting new products for our offering with best of breed distributed systems technologies
  • Drive solutions around data modeling, data pipelining & orchestration, data transport, and access control
  • Propose and review solution design across engineering teams as they onboard onto the data platform
  • Lead high-visibility engineering efforts on some of our data intensive core components
  • Take a hands-on technical leadership role in guiding a group of equally talented engineers to own and deliver high quality solutions under tight market-driven deadlines
What we offer
What we offer
  • Variable compensation in the form of a year-end bonus, guaranteed in the first year of hire
  • Medical and prescription drug coverage
  • 401k contribution matching
  • Fulltime
Read More
Arrow Right

Senior Principal Data Platform Software Engineer

We’re looking for a Sr Principal Data Platform Software Engineer (P70) to be a k...
Location
Location
Salary
Salary:
239400.00 - 312550.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years in Data Engineering, Software Engineering, or related roles, with substantial exposure to big data ecosystems
  • Demonstrated experience building and operating data platforms or large‑scale data services in production
  • Proven track record of building services from the ground up (requirements → design → implementation → deployment → ongoing ownership)
  • Hands‑on experience with AWS, GCP (e.g., compute, storage, data, and streaming services) and cloud‑native architectures
  • Practical experience with big data technologies, such as Databricks, Apache Spark, AWS EMR, Apache Flink, or StarRocks
  • Strong programming skills in one or more of: Kotlin, Scala, Java, Python
  • Experience leading cross‑team technical initiatives and influencing senior stakeholders
  • Experience mentoring Staff/Principal engineers and lifting the technical bar for a team or org
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Design, develop and own delivery of high quality big data and analytical platform solutions aiming to solve Atlassian’s needs to support millions of users with optimal cost, minimal latency and maximum reliability
  • Improve and operate large‑scale distributed data systems in the cloud (primarily AWS, with increasing integration with GCP and Kubernetes‑based microservices)
  • Drive the evolution of our high-performance analytical databases and its integrations with products, cloud infrastructures (AWS and GCP) and isolated cloud environments
  • Help define and uplift engineering and operational standards for petabyte scale data platforms, with sub‑second analytic queries and multi‑region availability (coding guidelines, code review practices, observability, incident response, SLIs/SLOs)
  • Partner across multiple product and platform teams (including Analytics, Marketplace/Ecosystem, Core Data Platform, ML Platform, Search, and Oasis/FedRAMP) to deliver company‑wide initiatives that depend on reliable, high‑quality data
  • Act as a technical mentor and multiplier, raising the bar on design quality, code quality, and operational excellence across the broader team
  • Design and implement self‑healing, resilient data platforms with strong observability, fault tolerance, and recovery characteristics
  • Own the long‑term architecture and technical direction of Atlassian’s product data platform with projects that are directly tied to Atlassian’s company-level OKRs
  • Be accountable for the reliability, cost efficiency, and strategic direction of Atlassian’s product analytical data platform
  • Partner with executives and influence senior leaders to align engineering efforts with Atlassian’s long-term business objectives
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
  • Fulltime
Read More
Arrow Right

Principal Data Genai Platform Engineer - Senior Vice President

Location
Location
India , Chennai
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of relevant experience in enterprise application development, data engineering, or AI platform engineering, with a strong track record of leadership in regulated environments
  • 8+ years of experience leading multi-team Agile organizations (20+ engineers), including managing distributed and hybrid AI-assisted teams
  • Advanced expertise in Python, PySpark, and Databricks ecosystem for large-scale data processing and ELT/ETL pipelines
  • Proven experience architecting and implementing enterprise AI/GenAI platforms, including agentic AI frameworks, LLM integrations, and prompt engineering
  • Hands-on experience with AI-assisted development tools such as Devin.AI and GitHub Copilot and integrating them into engineering workflows
  • Strong experience with microservices architecture, APIs, and cloud-native deployment (Kubernetes/OpenShift)
  • Strong experience with event-driven architectures and streaming platforms (Kafka)
  • Deep understanding of data architecture, data mesh, data federation, and regulatory data requirements
  • Exceptional leadership, communication, stakeholder management, and decision-making capabilities
  • Experience with cloud platforms (AWS, Azure, GCP, Databricks) and modern data ecosystems
Job Responsibility
Job Responsibility
  • Lead multiple agile scrum teams comprising ~15+ engineers, including hybrid teams of human engineers and AI-assisted development (Devin.AI, Copilot), ensuring delivery excellence and alignment with business priorities
  • Define and execute the enterprise strategy for Python engineering, AI agent platforms, and full-stack data applications, aligned with Retail and Wealth Risk objectives
  • Serve as the senior architect and technical authority for enterprise-scale AI agents, data engineering pipelines, and microservices-based applications, ensuring scalability, resilience, and security
  • Drive the adoption and operationalization of AI Product Development Lifecycle (AI PDLC), including model governance, evaluation, deployment, monitoring, and compliance with Model Risk Management (MRM)
  • Lead development of high-volume data pipelines and data federation layers using PySpark, Databricks, Kafka, and Data Mesh architecture to support regulatory reporting (CCAR, FDIC) and risk analytics
  • Architect and oversee GenAI agent ecosystems using LLMs (Google ADK, Gemini/Flash), implementing Human-in-the-Loop (HITL) frameworks to ensure explainability, auditability, and compliance
  • Drive AI-augmented software development lifecycle, integrating tools such as Devin.AI, GitHub Copilot, and MCP platforms through advanced prompt engineering and governance guardrails
  • Lead microservices and cloud-native architecture using FastAPI/Spring Boot, Kubernetes/OpenShift, and CI/CD pipelines, ensuring high availability and performance
  • Drive engineering efficiency and standardization by reusing and repurposing enterprise-level frameworks, platforms, and tools, reducing duplication and accelerating delivery across teams
  • Ensure all engineering solutions incorporate data governance and non-functional requirements, including Data Quality (DQ), data lineage, data tracing, and auditability, aligned with enterprise governance processes and regulatory expectations
  • Fulltime
Read More
Arrow Right

Principal ML Systems Engineer, Data Platform (Autonomous Vehicles)

We are seeking a highly skilled and experienced Principal ML Systems Engineer to...
Location
Location
United States , Austin; Bellevue
Salary
Salary:
233400.00 - 339650.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BA or MS in Computer Science, Electrical Engineering, Mathematics, Physics, or another relevant field
  • 10+ years in building distributed data platforms using major cloud providers and open-source frameworks
  • Expert-level proficiency in Java, C++, or Python, with a proven track record of designing and implementing robust, distributed systems
  • Expertise in implementing Data Processing Frameworks (Beam, Spark) and serving layers optimized for high-throughput, low-latency delivery
  • Experience optimizing services for cost efficiency, performance & reliability
  • Experience with Micro services architecture and proven ability to manage the full operational lifecycle of systems
  • Deep understanding of full ML model lifecycle (feature engineering, training, validation, deployment, monitoring, etc.)
  • Strong passion, and understanding about self-driving technology and its potential impact on the world
  • Experience working with (100+) petabyte-scale ingestion, processing, and serving architectures
  • Experience with SQL engines / queries
Job Responsibility
Job Responsibility
  • Design & develop the next generation distributed ML data platform (Ingestion, Processing, Serving) using GCP and open-source frameworks
  • Leading the strategy of building performant and efficient multi-cloud platforms
  • Collaborate with stakeholders (ML & Data Engineers), translate needs & pain points into requirements, build self-serve capabilities and drive adoption
  • Deliver e2e technical projects owning major technical decisions and tradeoffs & contribute to the team’s strategic roadmap
  • Champion engineering & operational excellence by continuously improving systems and processes
  • Actively participate in team’s planning, code reviews and design discussions
  • Conduct technical interviews, onboard new and mentor junior engineers
What we offer
What we offer
  • Relocation benefits
  • Company vehicle evaluation program
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • Fulltime
Read More
Arrow Right

Principal Software Engineer, Financial Data Platform & GenSearch Agent

At AlphaSense, GenSearch is a production, customer-facing generative AI system d...
Location
Location
United States
Salary
Salary:
246000.00 - 339000.00 USD / Year
alpha-sense.com Logo
AlphaSense
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years building and operating production systems
  • Experience operating at a Staff or Principal level
  • Proven ability to simplify complex systems and reduce architectural entropy
  • Strong background in distributed systems and performance optimization
  • Experience building AI-integrated systems (LLM orchestration, tool usage, retrieval, evaluation)
  • Comfortable working across backend systems and modern frontend frameworks (e.g., React)
  • Experienced designing APIs and durable service contracts
  • Effective at driving cross-team technical initiatives
Job Responsibility
Job Responsibility
  • Improve AI & Performance
  • Optimize orchestration between LLM workflows, structured datasets, and services
  • Introduce performance benchmarks, observability standards, and SLO discipline
  • Improve reliability and correctness of AI-backed financial responses
  • Simplify Architecture
  • Reduce fragmentation across datasets and services
  • Partner with data platform teams to define scalable dataset integration patterns
  • Remove legacy complexity while continuing to ship
  • Build & Evolve Web Products
  • Contribute directly to backend and frontend systems powering financial data experiences
  • Fulltime
Read More
Arrow Right

Principal Software Engineer--Web data platform

We are looking for a talented and experienced Principal Software Engineer to joi...
Location
Location
China , Beijing
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 7+ years of professional software development experience
  • Proficiency in at least one high-level programming language including but not limited to: C++, C#, Java, Golang or Rust
  • Good communication, collaboration and problem-solving skills
  • Fluent English speaking and writing
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design the architecture of Crawler system, ensuring scalability, efficiency, performance and quality
  • Implement features in a distributed and scalable environment using data to guide and measure success
  • Develop and execute unit, integration, and performance tests to ensure the system is reliable, robust, and meets quality standards
  • Debug and mitigate incidents in live production environments
  • Stay up to date with the latest industry trends and technologies and proactively suggest improvements to the existing system
  • Collaborate closely with cross-functional teams, including product managers and other engineers, to align development goals with business objectives
  • Fulltime
Read More
Arrow Right