CrawlJobs Logo

Big Data Developer

Canada, Mississauga 120800.00 - 170800.00 USD / Year · Job Posted March 01, 2026
Apply Position
Job Link Share

Job Description

The Big Data Developer is a senior level position responsible for establishing and implementing scalable, efficient big data application systems and platforms—primarily across Hadoop/Spark and cloud environments—in coordination with the Technology team. The overall objective of this role is to lead big data systems analysis, data engineering, and applications programming activities.

Job Responsibility

  • Partner with multiple management teams to ensure appropriate integration of functions to meet goals, and to identify and define necessary platform and system enhancements to deploy new data products and process improvements
  • Design and implement scalable and efficient Hadoop architecture solutions encompassing core ecosystem components, including HDFS, YARN, MapReduce, Hive, HBase, and Spark
  • Collaborate with data engineers, data scientists, and analytics stakeholders to understand data requirements and deliver robust, reliable pipelines and analytical datasets
  • Develop Spark/PySpark solutions to support near real-time data ingestion, analytics, and reporting, ensuring high performance and reliability
  • Optimize Hadoop and Spark clusters for performance and resource utilization, including capacity planning, tuning, and job orchestration best practices
  • Maintain and monitor Hadoop infrastructure to ensure high availability, reliability, and observability
  • implement proactive alerting, logging, and issue resolution
  • Implement and enforce data security and governance policies (e.g., access controls, encryption, data quality, lineage, and cataloging) across big data platforms
  • Troubleshoot and resolve issues across the Hadoop ecosystem (jobs, services, resource management), driving root-cause analysis and permanent fixes
  • Provide expertise in the area and advanced knowledge of applications programming, ensuring application and data solution design adheres to the overall architecture blueprint and cloud reference patterns
  • Utilize advanced knowledge of system flow to develop standards for coding, testing, debugging, deployment, and implementation—leveraging Python, PySpark, Unix/Linux, and SQL
  • Develop comprehensive knowledge of how architecture, data platforms, and infrastructure integrate to accomplish business goals, including data modeling, ETL processes, data warehousing, and cloud-native services (AWS, Azure, Google Cloud)
  • Provide in-depth analysis with interpretive thinking to define issues and develop innovative, scalable solutions aligned with business and regulatory requirements
  • Serve as advisor or coach to mid-level developers and analysts, allocating work as necessary and uplifting engineering practices through code reviews and mentorship
  • Stay updated with the latest advancements in Hadoop/big data technologies and related areas
  • evaluate and introduce improvements, including AI/ML lifecycle management, MLOps, and GenAI-adjacent integrations where appropriate
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm’s reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency

Requirements

  • 6+ years of relevant experience in Big Data/Application Development or systems analysis roles, including building and operating production-grade data pipelines on Hadoop/Spark
  • Extensive experience in system analysis and in programming of big data applications and data platforms
  • Proven experience designing and managing Hadoop-based architectures, including cluster configuration, resource management (YARN), and ecosystem integration
  • Strong understanding and hands-on expertise with the Hadoop ecosystem: HDFS, YARN, MapReduce, Hive, HBase, and Spark
  • Strong hands-on and architectural knowledge of Python, PySpark, Unix/Linux, and SQL
  • Experience with data modeling, ETL processes, and data warehousing concepts and implementation
  • Experience implementing data security and governance (e.g., RBAC, encryption, data quality, data lineage, catalog)
  • Exposure to AI/ML lifecycle management, MLOps, and GenAI solution patterns and integration points
  • Experience with major cloud platforms—AWS, Azure, Google Cloud—and related big data services (e.g., EMR, HDInsight, Dataproc, Databricks)
  • Subject Matter Expert (SME) in at least one area of Big Data/Application Development (e.g., Spark performance tuning, Hive optimization, HBase administration, data security)
  • Experience in managing and implementing successful projects
  • demonstrated leadership and project management skills
  • Ability to adjust priorities quickly as circumstances dictate
  • Consistently demonstrates clear and concise written and verbal communication
  • Technical Proficiencies: Hadoop, HDFS, YARN, MapReduce, Hive, HBase, Spark
  • SQL
  • Data modeling
  • ETL
  • Data warehousing
  • AWS/Azure/Google Cloud
  • Bachelor’s degree/University degree or equivalent experience
  • Master’s degree preferred

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Big Data Developer

8 matching positions

Big Data Developer

The Applications Development Intermediate Programmer Analyst is an intermediate ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 5 years of overall IT experience, with at least 3 years of hands-on experience in Big Data technologies
  • Proven experience working with large-scale, high-volume datasets in distributed environments
  • Strong proficiency in Hadoop ecosystem tools, including: HDFS (Hadoop Distributed File System) for data storage
  • Hive for data querying and warehousing
  • Sqoop for data ingestion from relational databases
  • Advanced knowledge of Apache Spark, including: Spark Core, Spark SQL, and Spark Streaming (preferred)
  • Performance tuning and optimization techniques (e.g., partitioning, caching, memory management)
  • Solid programming skills in Python and PySpark for data processing and pipeline development
  • Strong command of SQL for complex queries, data transformations, and performance tuning
  • Hands-on experience in data sourcing, ingestion, and extraction from multiple structured and unstructured data sources
Job Responsibility
Job Responsibility
  • Participation in the establishment and implementation of new or revised application systems and programs in coordination with the Technology team
  • Contribute to applications systems analysis and programming activities
  • Design, develop, and optimize large-scale data processing systems
  • Work closely with cross-functional teams to build efficient data pipelines, perform data analysis, and support business-critical financial solutions
  • Fulltime
Read More
Arrow Right

PySpark Big Data Developer

The Applications Development Intermediate Programmer Analyst is an intermediate ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2-5 years of relevant experience in the Financial Service industry
  • Intermediate level experience in Applications Development role
  • Consistently demonstrates clear and concise written and verbal communication
  • Demonstrated problem-solving and decision-making skills
  • Ability to work under pressure and manage deadlines or unexpected changes in expectations or requirements
  • Bachelor’s degree/University degree or equivalent experience
  • Enterprise Application Development: 6-8 years in developing and managing enterprise-grade applications
  • Object-Oriented Programming (OOP): Solid foundation in OOP concepts
  • Big Data Development: Expertise in PySpark, HDFS, Hive, Sqoop, and Hadoop for Big Data environments
  • Database Technologies: Good exposure to SQL Server and ORACLE databases
Job Responsibility
Job Responsibility
  • Utilize knowledge of applications development procedures and concepts, and basic knowledge of other technical areas to identify and define necessary system enhancements, including using script tools and analyzing/interpreting code
  • Consult with users, clients, and other technology groups on issues, and recommend programming solutions, install, and support customer exposure systems
  • Apply fundamental knowledge of programming languages for design specifications
  • Analyze applications to identify vulnerabilities and security issues, as well as conduct testing and debugging
  • Serve as advisor or coach to new or lower level analysts
  • Identify problems, analyze information, and make evaluative judgements to recommend and implement solutions
  • Resolve issues by identifying and selecting solutions through the applications of acquired technical experience and guided by precedents
  • Has the ability to operate with a limited level of direct supervision
  • Can exercise independence of judgement and autonomy
  • Acts as SME to senior stakeholders and /or other team members
  • Fulltime
Read More
Arrow Right

PySpark Big Data Developer

Market sales Technology is going through several transformational technology ini...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 4+ years’ experience in Python, SQL and PySpark
  • Proficiency in distributed data processing and big data tools and technologies: Hadoop, HDFS, YARN, Spark, Hive
  • API's and backend development (FASTAPI / FLASK)
  • Devops: GIT, CI/CD basics, Linux/Unix commands
  • Design, develop, and maintain big data pipelines using PySpark
  • Experience with integration of data from multiple data sources
  • Experience in DevTools like openshift, teamcity, uDeploy, BitBucket, GitHub
  • Proactively contribute to stability of overall production systems and troubleshoot key components and processes
  • Keep track of latest technology trends and proactively learn new technologies driving practical and reference implementations
  • Bachelor’s degree (preferably in technology /engineering or related field)
Job Responsibility
Job Responsibility
  • Work with a cross-functional and geographically dispersed team for quality assurance
  • Perform end to end features development of application including requirements understanding, impact analysis, development and execution, production deployment and maintenance
  • Work with multiple teams to develop, and execute features
  • Work with teams in a collaborative style to engage partners and proactively manage activities
What we offer
What we offer
  • Opportunity for professional development in the international and multicultural organization
  • Unique opportunity to participate in global investment banking projects
  • Internal and external trainings
  • Developing opportunities and challenging assignments
  • Attractive and stable employment conditions
  • Social benefits (medical care, Benefit System, life insurance, pension scheme)
  • Flexible working hours
  • Fulltime
Read More
Arrow Right

Hadoop / Big Data Developer

We are currently looking for a candidate for the position of Hadoop / Big Data D...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
astek.pl Logo
Astek
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of Java 8+ and/or Scala experience
  • Experience in working in Spark
  • Knowledge of Linux Shell Scripting
  • Knowledge of SQL
  • Knowledge of Hadoop stack (YARN, Sqoop, Hive, Impala, MapReduce Oozie, etc)
  • Familiar with version control and CI/CD tools (Git, Ansible, Bamboo, Jenkins…)
Job Responsibility
Job Responsibility
  • Design, develop, and maintain scalable Big Data solutions using Hadoop ecosystem technologies
  • Build and optimize data pipelines
  • Develop high‑quality, efficient, and reusable code using technologies such as Java, SQL, Hive, Spark, and related tools
  • Work closely with business stakeholders and product owners
  • Participate actively in SAFe ceremonies, including PI planning, sprint planning, daily stand‑ups, reviews, and retrospectives
  • Optimize existing Big Data processes and queries for performance and cost efficiency
  • Collaborate with cross‑functional teams (developers, architects, QA, DevOps) to deliver end‑to‑end solutions
  • Support deployment, monitoring, and troubleshooting of Big Data applications in production environments
What we offer
What we offer
  • Long-term collaboration
  • Technical training, certifications, and skills development
  • Competence Center mentoring
  • Clear career path
  • Employee benefits package (Multisport, private healthcare, life insurance)
  • Friendly working atmosphere, team-building events, and team-building meetings
  • Fulltime
Read More
Arrow Right

Java Developer – Big Data Processing

We are building the next generation of large-scale recommendation and personaliz...
Location
Location
Romania
Salary
Salary:
Not provided
ddroidd.com Logo
ddroidd
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on experience with Java and backend engineering
  • Solid knowledge of design patterns, clean code principles, and software architecture
  • Proven experience building and operating distributed systems
  • Strong experience within the JVM ecosystem
  • Practical experience with Big Data processing, including: Apache Spark, Spark SQL
  • Mandatory experience working in enterprise-level production environments
  • Experience building scalable backend platforms and microservices
  • Strong problem-solving mindset and ownership attitude
Job Responsibility
Job Responsibility
  • Design, develop, and maintain high-performance backend services using Java and Spring Boot
  • Build and optimize large-scale data processing pipelines using Apache Spark and Spark SQL
  • Contribute to the architecture and implementation of distributed systems and data-driven platforms
  • Collaborate closely with data scientists and ML engineers to productionize machine learning models
  • Develop and maintain high-scale recommendation and personalization engines
  • Ensure clean, maintainable, and well-tested code, applying best practices and design patterns
  • Improve system performance, scalability, and reliability in enterprise production environments
  • Participate in technical design discussions and contribute to long-term platform evolution
  • Support orchestration and automation of data workflows
  • Mentor engineers and promote strong engineering standards within the team
What we offer
What we offer
  • Private medical insurance
  • National holidays off, even when falling on weekends
  • Loyalty leave: +1 day/year
  • Continuous professional development opportunities
  • Sports subscription programs
  • Referral bonuses for bringing in new talent
  • Meal tickets
  • Bookster subscription for reading & learning
  • Community and team-building events
  • Flexible and unlimited remote work policy
  • Fulltime
Read More
Arrow Right

Lead Big Data Developer

We are looking for an experienced Lead Big Data Developer with strong expertise ...
Location
Location
India , Haveli
Salary
Salary:
Not provided
votredircom.fr Logo
Wissen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Information Technology, or a related field
  • 7 – 9 years of experience in data engineering, with a focus on big data technologies
  • Strong experience with AWS services, particularly EMR, S3, Redshift, Lambda, and Glue
  • Proficiency in programming languages Java
  • Experience with big data frameworks and tools such as Hadoop, Spark, Hive, and Pig
  • Solid understanding of data modelling, ETL processes, and data warehousing concepts
  • Experience with SQL and NoSQL databases
  • Familiarity with CI/CD pipelines and version control systems (e.g., Git)
  • Strong problem-solving skills and the ability to work independently and collaboratively in a team environment
Job Responsibility
Job Responsibility
  • Design, develop, and maintain data pipelines on AWS EMR (Elastic MapReduce) to support data processing and analytics
  • Implement data ingestion processes from various sources including APIs, databases, and flat files
  • Optimize and tune big data workflows for performance and scalability
  • Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and deliver solutions
  • Manage and monitor EMR clusters, ensuring high availability and reliability
  • Develop ETL (Extract, Transform, Load) processes to cleanse, transform, and store data in data lakes and data warehouses
  • Implement data security best practices to ensure data is protected and compliant with relevant regulations
  • Create and maintain technical documentation related to data pipelines, workflows, and infrastructure
  • Troubleshoot and resolve issues related to data processing and EMR cluster performance
  • Fulltime
Read More
Arrow Right

Senior Big Data Developer

The Applications Development Senior Programmer Analyst is an intermediate level ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8-14 years of relevant experience
  • Experience in systems analysis and programming of software applications
  • Experience in managing and implementing successful projects
  • Working knowledge of consulting/project management techniques/methods
  • Ability to work under pressure and manage deadlines or unexpected changes in expectations or requirements
  • Bachelor’s degree/University degree or equivalent experience
  • Strong Object-Oriented Programming (OOP) concepts
  • proficient in Python (specifically for PySpark)
  • Extensive experience with Apache Spark (PySpark), Hadoop, and related components like Hive and Sqoop
  • skilled in writing shell scripts
Job Responsibility
Job Responsibility
  • Conduct tasks related to feasibility studies, time and cost estimates, IT planning, risk technology, applications development, model development, and establish and implement new or revised applications systems and programs to meet specific business needs or user areas
  • Monitor and control all phases of development process and analysis, design, construction, testing, and implementation as well as provide user and operational support on applications to business users
  • Utilize in-depth specialty knowledge of applications development to analyze complex problems/issues, provide evaluation of business process, system process, and industry standards, and make evaluative judgement
  • Recommend and develop security measures in post implementation analysis of business usage to ensure successful system design and functionality
  • Consult with users/clients and other technology groups on issues, recommend advanced programming solutions, and install and assist customer exposure systems
  • Ensure essential procedures are followed and help define operating standards and processes
  • Serve as advisor or coach to new or lower level analysts
  • Has the ability to operate with a limited level of direct supervision
  • Can exercise independence of judgement and autonomy
  • Acts as SME to senior stakeholders and /or other team members
  • Fulltime
Read More
Arrow Right

Senior Python Big Data Developer

The Applications Development Senior Programmer Analyst is an intermediate level ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7 - 12 years of relevant experience
  • Experience in systems analysis and programming of software applications
  • Experience in managing and implementing successful projects
  • Working knowledge of consulting/project management techniques/methods
  • Ability to work under pressure and manage deadlines or unexpected changes in expectations or requirements
  • Bachelor’s degree/University degree or equivalent experience
  • Strong expertise in Big Data technologies (Spark, Hadoop, Hive, Impala, Kafka, Scala, Cloudera)
  • Design, develop, and maintain robust and scalable data pipelines using Python, SQL, PySpark, and streaming technologies like Kafka
  • Strong SQL and NoSQL experience (Oracle, MongoDB, PostgreSQL) for data extraction, reconciliation, and transformation
  • Proficiency in Python and Shell scripting for data processing and automation
Job Responsibility
Job Responsibility
  • Conduct tasks related to feasibility studies, time and cost estimates, IT planning, risk technology, applications development, model development, and establish and implement new or revised applications systems and programs to meet specific business needs or user areas
  • Monitor and control all phases of development process and analysis, design, construction, testing, and implementation as well as provide user and operational support on applications to business users
  • Utilize in-depth specialty knowledge of applications development to analyze complex problems/issues, provide evaluation of business process, system process, and industry standards, and make evaluative judgement
  • Recommend and develop security measures in post implementation analysis of business usage to ensure successful system design and functionality
  • Consult with users/clients and other technology groups on issues, recommend advanced programming solutions, and install and assist customer exposure systems
  • Ensure essential procedures are followed and help define operating standards and processes
  • Serve as advisor or coach to new or lower level analysts
  • Has the ability to operate with a limited level of direct supervision
  • Can exercise independence of judgement and autonomy
  • Acts as SME to senior stakeholders and /or other team members
  • Fulltime
Read More
Arrow Right