CrawlJobs Logo

Sr. Data Analyst - Databricks

United States, Atlanta, GA · Job Posted February 08, 2026
Apply Position
Job Link Share

Job Description

Serve as an independent QA / Quality Review (QR) resource providing effective challenge over the Enterprise Data Mart build on Databricks. The role focuses on validating data mappings, transformation logic, and end-to-end data integrity prior to Lending and Retail LOB consumption. Merger between two large financial entities - work to be completed March 2027 and beyond

Job Responsibility

  • Review source-to-target mappings, data models, and transformation logic for completeness, accuracy, and alignment to business intent
  • Execute SQL-based validation and reconciliation across source, curated, and consumption layers
  • Perform completeness, accuracy, reasonableness, and threshold checks on critical data elements
  • Identify data gaps, anomalies, undocumented assumptions, and logic defects
  • document findings clearly
  • Partner with data engineers and product owners to remediate issues prior to release
  • Support go-live readiness decisions for Lending and Retail data products

Requirements

  • Strong hands-on SQL experience (ability to independently query, reconcile, and validate large datasets)
  • Experience reviewing data mappings, transformation rules, and data pipelines (Databricks / Spark-based environments preferred)
  • Familiarity with financial services data (Lending, Retail Banking, or Enterprise data domains)
  • Ability to operate independently and challenge constructively without slowing delivery

What we offer

  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Sr. Data Analyst - Databricks

8 matching positions

Sr Data Analyst

Resource Informatics Group, Inc. is actively seeking a skilled Sr Data Analyst t...
Location
Location
United States , Irving
Salary
Salary:
Not provided
rigusinc.com Logo
Resource Informatics Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or foreign equivalent in Computer Science, Data Science, Data Analytics, Business Analytics, or a related field and six (6) years of experience in Data Analytics
  • Alternatively, a Master’s degree or foreign equivalent in Computer Science, Data Science, Data Analytics, Business Analytics, or a related field and three (3) years of experience in Data Analytics
  • Required skills include experience in SQL, Tableau, Python, data analysis, Unix scripting, AWS, Jira, Software Development Life Cycle (SDLC), Agile, and PL/SQL
  • Expertise in Machine Learning and Data Science, along with strong problem-solving capabilities and collaboration skills, is essential
Job Responsibility
Job Responsibility
  • Developing Python scripts to automate validations for data quality and quantity
  • Creating and maintaining automation scripts using Python and Linux Shell Scripting for platform maintenance activities and health checks
  • Extracting data from Salesforce Cloud and ensuring adherence to PRD specifications
  • Reconciling data between source and target tables, ensuring data consistency, accuracy, and integrity
  • Analyzing client data files and Developing procedures for preparing, transforming, and cleansing data prior to analysis
  • Generating and documenting reports using Python, SQL, Excel and Extracting data from disparate sources using technologies like Snowflake and Redshift
  • Creating scripts in Databricks to migrate the data from various sources to Data Warehouse
  • Performing data cleaning, transformation, enrichment and Utilizing data mining techniques on data for analysis
  • Design metrics and build dashboards in Tableau to track customer behavior, KPIs, and marketing effectiveness
  • create PL/SQL reports and should be moving them to AWS Redshift
  • Fulltime
Read More
Arrow Right

Sr Data Engineer

As a Senior Data Engineer at Amgen, you will be responsible for managing and opt...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master's degree with 8 - 13 years of experience in Computer Science, IT or related field
  • Hands-on experience in data engineering technologies such as Databricks, PySpark, SparkSQL Apache Spark, AWS, Python, SQL, and Scaled Agile methodologies
  • Proficiency in workflow orchestration, performance tuning on big data processing
  • Strong understanding of AWS services
  • Experience with Data Fabric, Data Mesh, or similar enterprise-wide data architectures
  • Ability to quickly learn, adapt and apply new technologies
  • Strong problem-solving and analytical skills
  • Excellent communication and teamwork skills
  • Experience with Scaled Agile Framework (SAFe), Agile delivery practices, and DevOps practices
  • Excellent analytical and troubleshooting skills
Job Responsibility
Job Responsibility
  • Design, develop, and maintain scalable ETL/ELT pipelines to support structured, semi-structured, and unstructured data processing across the Enterprise Data Engineering for Biotech or Pharma functional knowledge of R&D
  • Implement real-time and batch data processing solutions, integrating data from multiple sources into a unified, governed data fabric architecture
  • Optimize big data processing frameworks using Apache Spark, Hadoop, or similar distributed computing technologies to ensure high availability and cost efficiency
  • Work with metadata management and data lineage tracking tools to enable enterprise-wide data discovery and governance
  • Ensure data security, compliance, and role-based access control (RBAC) across data environments
  • Optimize query performance, indexing strategies, partitioning, and caching for large-scale data sets
  • Develop CI/CD pipelines for automated data pipeline deployments, version control, and monitoring
  • Implement data virtualization techniques to provide seamless access to data across multiple storage systems
  • Collaborate with cross-functional teams, including data architects, business analysts, and DevOps teams, to align data engineering strategies with enterprise goals
  • Stay up to date with emerging data technologies and best practices, ensuring continuous improvement of Enterprise Data Fabric architectures
  • Fulltime
Read More
Arrow Right

Sr. Analyst, Marketing Delivery

Cella by Randstad Digital is partnering with a prominent North American financia...
Location
Location
Canada , Montréal
Salary
Salary:
48.79 - 51.41 CAD / Hour
https://www.randstad.com Logo
Randstad
Expiration Date
July 27, 2026
Flip Icon
Requirements
Requirements
  • SQL Expertise: 3+ years of experience programming and interpreting complex SQL queries for data extraction and analysis
  • Salesforce Marketing Cloud: 2+ years of hands-on automation and journey orchestration experience using SFMC (including Journey Builder, Email Studio, Automation Studio, and Data Extensions)
  • Regulated Industry Background: 2+ years of marketing automation or journey orchestration experience within financial services or another highly regulated industry
  • Matrixed Environments: 2+ years of experience collaborating effectively with cross-functional teams in a matrixed organization, with a proven ability to work autonomously through ambiguity
  • Technical Compliance: Strong working knowledge of CASL and Canadian digital marketing regulatory frameworks
  • Project Management: Excellent attention to detail, strong organizational habits, and experience managing multiple overlapping pipelines in a deadline-driven environment
Job Responsibility
Job Responsibility
  • Campaign Design & Execution: Create, deploy, and measure high-impact cross-channel marketing campaigns and user journeys based on strategic business goals
  • Platform Management: Build and optimize Salesforce Marketing Cloud (SFMC) journeys, managing audience entry/exit criteria, decision splits, and template setup utilizing Email Studio, Automation Studio, and Journey Builder
  • Data & Scripting: Write and execute SQL queries within Databricks to manage, filter, and segment audience data in Salesforce CDC. Utilize HTML and AMPscript to enable dynamic personalization and complex email customization
  • Quality Assurance & Troubleshooting: Oversee technical reviews, perform rigorous QA testing across various devices, and troubleshoot template rendering, subscriber lookups, and link accuracy
  • Analytics & Optimization: Provide analytical guidance to stakeholders, translate campaign results into actionable insights, and participate in optimization sessions to enhance overall marketing ROI
  • Compliance & Risk Management: Ensure all email marketing initiatives align with CASL (Canada’s Anti-Spam Legislation) and maintain a culture of regulatory control and infrastructure integrity
What we offer
What we offer
  • Collaborative Innovation: Work within a North American center of excellence focused on high-quality, innovative marketing delivery
  • Skill Expansion: Deepen your expertise in advanced data environments (Databricks, complex SQL) and cutting-edge Salesforce features
  • Hybrid Flexibility: Enjoy a balanced working model situated in a vibrant downtown Toronto hub
  • Professional Growth: Collaborate with cross-functional teams in a fast-paced environment that values continuous learning and agile methodologies
  • Fulltime
Read More
Arrow Right

Sr. Data Engineer - Python Developer

Seeking a hands-on Senior Data Engineer (ETL / Python Developer) to support the ...
Location
Location
United States , Springfield
Salary
Salary:
52.00 - 54.00 USD / Hour
myticas.com Logo
Myticas Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of data engineering experience with a focus on enterprise data warehousing
  • 5+ years of hands-on ETL development using Informatica PowerCenter, Azure Data Factory, or similar tools
  • 5+ years of Python development for data engineering and automation
  • 3+ years of experience with Spark-based processing frameworks (Databricks or equivalent)
  • Strong SQL expertise and experience with relational databases (such as Teradata, Snowflake, Oracle, SQL Server)
  • Experience with source control and DevOps practices (Azure DevOps, GitHub, CI/CD)
  • Bachelor's degree or higher in Computer Science, Engineering, Analytics, or a related field
  • Strong analytical, problem-solving, and troubleshooting skills
Job Responsibility
Job Responsibility
  • Design, develop, and maintain enterprise ETL pipelines using Azure Data Factory (ADF), Informatica PowerCenter, and Python-based frameworks
  • Build and optimize scalable data processing solutions using Python, Spark, and Databricks
  • Support Medicaid analytics and federal reporting initiatives (e.g., T-MSIS, PERM, MARS, Quality of Care)
  • Develop robust data validation, reconciliation, and audit-traceable data pipelines
  • Write and optimize SQL and stored procedures across relational platforms such as Snowflake, Oracle, and SQL Server
  • Participate in cloud migration and modernization initiatives within Azure-based architectures
  • Collaborate with analysts, QA, and reporting teams to ensure data quality, accuracy, and timeliness
  • Follow data engineering best practices for performance, reliability, reusability, and security
  • Support production operations, incident resolution, and root-cause analysis
  • Participate in code reviews, source control, and CI/CD processes using Azure DevOps and GitHub
  • Fulltime
Read More
Arrow Right

Sr Data Scientists

Sr Data Scientists is located in Frisco, TX and will support teams’ mission to p...
Location
Location
United States , Frisco
Salary
Salary:
141773.00 - 155000.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Mathematics, Statistics, Economics, Computer Science, Physics, Electronic Engineering, or related, and 5 years of relevant work experience
  • Master’s degree in Mathematics, Statistics, Economics, Computer Science, Physics, Electronic Engineering, or related, and 3 years of relevant work experience
  • Experience in developing and deploying predictive models, advanced machine learning, deep learning, NLP, and generative AI solutions by applying a wide range of algorithms
  • Experience in developing solutions using Python, PySpark, SQL, and R, with libraries LangChain, LangGraph, Keras, Pandas, NumPy, SciPy, Matplotlib, and Scikit-Learn
  • Experience in working with data querying, wrangling, cleaning, and feature engineering across relational and non-relational databases: SQL, Snowflake, and Redshift in big data environments: Azure, AWS, and GCP, and leveraging Spark, Hadoop, Hive, and Kafka
  • Experience in building CI/CD pipelines, automating training and retraining workflows, deploying inference services, and monitoring ML algorithms in production environments in Databricks using tools: MLflow, and cloud-native services
  • Experience in articulating and reframing business problems, applying statistical and advanced analytics techniques in Python, R, and SQL, and leveraging SciPy, Scikit-Learn, and PySpark to generate actionable insights and recommendations
  • Experience in delivering impactful, data-driven presentations and effectively communicating machine learning and analytical concepts to technical teams, business stakeholders, and senior leadership, supported by visualizations created in Tableau, Power BI, Matplotlib, and Seaborn
  • At least 18 years of age
  • Legally authorized to work in the United States
Job Responsibility
Job Responsibility
  • Support business partners and product owners to understand business challenges, develop business cases, capture requirements, co-create solutions that drive business change that solve the challenges and deliver impactful business outcomes
  • Provide senior-level guidance and mentorship to the data science team, including reviewing projects, models, and code for peers and junior team members
  • Design advanced analytics to solve business problems
  • preprocess and perform exploratory data analysis on structured and unstructured data
  • create features based on expertise in the domain
  • use predictive modeling techniques and statistical analysis to predict outcomes and behaviors
  • Leverage the Agile methodology to ensure alignment of data science roadmap, features, and stories to business priorities and value streams
  • Collaborate with cross-functional team comprised of other data scientists, data engineers, ML engineers, and data analysts
  • Partner with other technology partners such as architects, engineers, product managers, scrum masters, release train engineers, and agile coaches to deliver on targeted business outcomes
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Annual bonus or periodic sales incentive or bonus
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off
  • Up to 12 paid holidays
  • Fulltime
Read More
Arrow Right

Sr Data Engineer

The Senior Data Engineer role involves leading data architecture and modernizati...
Location
Location
India , Chennai
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong expertise in Azure and Databricks
  • Focus on data modeling and integration
  • Azure
  • Databricks
  • Data Modeling
  • Team Leadership
  • Client Interviews
  • SQL
Job Responsibility
Job Responsibility
  • Engage heavily with business users across North America and Europe, facilitating workshops and data discovery sessions
  • Drive consensus on business rules, data definitions, and data sources, especially where regional processes differ
  • Serve as the architectural thought leader enabling teams to transition from manual, inconsistent processes to standardized, modernized workflows
  • Partner closely with business analysts, data analysts, product owners, and engineering teams across multiple geographies
  • Architect a unified master stitched data model to replace downstream reliance on Varicent for data assembly
  • Lead the re‑architecture of compensation data processing—including internal and external compensation flows—into a scalable, cloud‑native Azure environment
  • Define patterns, frameworks, and integration strategies across Azure services (Data Factory, Databricks, Data Lake, SQL, etc.)
  • Evaluate and evolve the use of rules engines/ODM/Drools to externalize and modernize embedded business logic currently locked in application code
  • Guide decisions to shift logic and data ownership into enterprise‑owned systems rather than third‑party tools
  • Analyze current‑state processes (38 in NA, 9 in Europe) and identify opportunities for re‑engineering, automation, and consolidation
Read More
Arrow Right

Sr Data Engineer

Resource Informatics Group, Inc. is actively seeking a skilled Senior Data Engin...
Location
Location
United States , Irving
Salary
Salary:
Not provided
rigusinc.com Logo
Resource Informatics Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related fields
  • Strong expertise in data engineering and cloud-based solutions
  • 6+ years of experience in data engineering, architecture, and implementation of large-scale data solutions
  • Proficiency in designing and implementing data models, data structures, and algorithms
  • Advanced knowledge of SQL and NoSQL databases
  • Demonstrated expertise in optimizing data pipelines and improving data reliability, efficiency, and quality
  • Excellent problem-solving capabilities with a keen attention to detail
  • Strong communication and collaboration skills, with the ability to work effectively across diverse teams
  • Relevant certifications in cloud technologies (Azure, AWS, or GCP) advantageous
  • Master’s in Data Science or Computer Science or foreign equivalent, plus 6+ years of experience, OR Bachelor’s in Computer Science, Information Technology, or Electronics and Communication Engineering or foreign equivalent
Job Responsibility
Job Responsibility
  • Develop and execute ETL processes for data extraction, transformation, and loading into warehouses and data lakes
  • Architect data warehousing solutions using Azure Synapse Analytics for efficient querying and reporting
  • Optimize query performance, data processing speed, and resource utilization within Azure environments
  • Construct seamless data pipelines across Azure services utilizing Azure Data Factory, Databricks, and SQL Server Integration Services
  • Collaborate with stakeholders, including data scientists and analysts, to understand data requirements and deliver effective solutions
  • Manage large data volumes leveraging the Hadoop ecosystem for diverse source collection and loading
  • Design, maintain, and optimize data processing jobs using Hadoop MapReduce, Spark, and Hive, with coding in Java or Python for custom applications
  • Monitor job and cluster performance using tools like Ambari and custom monitoring scripts, scaling and maintaining Hadoop clusters and Azure data services
  • Ensure adherence to data security measures and governance standards
  • Integrate cross-cloud data with AWS and GCP services
  • Fulltime
Read More
Arrow Right

Applications Development Sr Programmer Analyst

Location
Location
Canada , Mississauga
Salary
Salary:
94300.00 - 141500.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of overall experience in large-scale application development with recent mandatory platform for the secure and scalable deployment of AI agents into application contexts
  • Minimum of 5+ years of proven experience in a Python and pyspark Engineering lead role focused on building enterprise-grade, high-volume ELT/ETL processes using the PySpark and Databricks ecosystem
  • Hands-on experience with agentic AI development using YAML, JSON, FAST API or Spring boot, Google ADK, LLM integrations, including Devin.AI or Github Copilot, and integrating models via platforms like MCP using advanced prompt engineering
  • Proven experience developing and automating microservice integrations to support data-intensive applications
  • Proficiency in at least one programming language commonly used for data analytics, engineering, such as Python or Scala
  • Strong SQL skills and experience with various relational databases
  • Deep understanding of data modeling, data warehousing concepts, Data Mesh architecture, and data federation
  • Excellent communication, collaboration, and problem-solving skills
  • Bachelor's degree in Computer Science, Engineering, or a related field
Job Responsibility
Job Responsibility
  • Design, develop, and maintain scalable, enterprise-grade AI agents, supporting ELT/ETL processes to handle large data volumes using the Python, FAST API, Microservices, PySpark, Kafka and Databricks ecosystem
  • Build and Deploy GEN AI Agents using Googles ADK and Google Flash 2.5+ LLMs to support application automation supports and its deep insights, workflow support with HIL - Human in loop architecture
  • Build and maintain data federation layers for lambda and Data Mesh architectures using tools like Starburst, with a strategy for adopting AI-based use cases (e.g., machine learning, deep learning, NLP) to drive efficiency
  • Develop, deploy, and automate microservice integrations to support data-intensive applications, ensuring scalability, resilience, and maintainability using cloud native infrastructure and openshift or Kubernates architecture including CI/CD pipelines
  • Integrate and leverage agentic AI tools (e.g., Devin.AI, Github Copilot) and platforms (e.g., MCP) through advanced prompt engineering to enhance development and operational efficiency
  • Ensure data quality, integrity, and security throughout the entire data lifecycle
  • Contribute to the continuous improvement of data engineering processes, standards, and best practices within the team
  • Appropriately assess risk when business decisions are made, demonstrating consideration for the firm's reputation and safeguarding Citi, its clients, and assets by driving compliance with applicable laws, rules, and regulations. Adhere to Policy, apply sound ethical judgment, and escalate, manage, and report control issues with transparency
  • Fulltime
Read More
Arrow Right