Member of Technical Staff, Data Engineering Job at Cohere

Member Of Technical Staff (Data Scientist)

Perplexity is AI for people who expect more. This role brings that same standard...

Location

United States , San Francisco

Salary:

175000.00 - 330000.00 USD / Year

Perplexity

Expiration Date

Until further notice

Requirements

6-8+ years in data science, analytics engineering, or a related role
Strong product sense
Deep SQL expertise
Pipeline experience
Enough software engineering chops to be dangerous
Genuinely excited about AI
Builder mentality
Autonomy

Job Responsibility

Accelerate the AI-native data workflow
Build AI agents that do data science
Make the warehouse AI-readable
Automate the data lifecycle
Ship AI-powered experiment analysis
Own the full lifecycle
Turn the data team into a product team

What we offer

Equity
Health
Dental
Vision
Retirement
Fitness
Commuter and dependent care accounts

Fulltime

Member of Technical Staff - Data Scientist

As a data scientist at Microsoft AI, you will be tasked with helping us to maxim...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 1+ year(s) data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
OR Master's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 3+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
OR Bachelor's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 5+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) OR equivalent experience

Job Responsibility

Drive product insights, opportunity analysis, and track metrics to support efforts across Microsoft Copilot
Drive new ways of instrumenting and measuring impact to evaluate new feature performance through experimentation
Define metrics and build basic data pipelines to enable A/B experimentation for new features and mitigating abusive users
Hands-on analysis of large volumes of telemetry data using various algorithms and tools including your own
Articulate insights, storyboard with data and communicate to influence leadership and other key decision makers
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Work collaboratively with engineers, Product Managers, and marketing to take ambiguous projects that drive user growth, engagement, and retention
Embody our Culture and Values

Fulltime

Member of Technical Staff - Data Infra - MAI Superintelligence Team

Help build the world’s most advanced multimodal dataset at Microsoft AI. We are ...

Location

United States , Mountain View

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
OR Master’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ year(s) experience in business analytics, data science, software development, or data engineering work
OR equivalent experience
Bachelor’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 8+ years experience in business analytics, data science, software development, data modeling or data engineering work
OR Master’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years of business analytics, data science, software development, data modeling or data engineering work experience
OR equivalent experience

Job Responsibility

Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video)
Own and maintain critical data infrastructures, including spark, ray, vector databases, and others
Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models
Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation
Embody our culture and values

Fulltime

Member of Technical Staff - Data Scientist

We’re looking for data scientists to help build the next generation of post-trai...

Location

Switzerland , Zürich

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Hands‑on experience with large language models, including training or applying them in production (not just prompting)
Designing and running post‑training experiments (evals, ablations, preference tuning / RLHF‑style methods)
Building and owning scalable data pipelines for training and evaluation data
Strong Python skills for ML experimentation, data processing, and analysis
Solid statistical, experimental, and general engineering fundamentals

Job Responsibility

Design evaluations of advanced model capabilities and use them to drive rapid, high-signal iteration loops
Work with vendors to produce high quality evaluation and training data
Build data pipelines to produce high quality evaluation and training data
Build data flywheels to hill-climb on model weaknesses, using data from various surfaces where our models are deployed
Ensure optimal quality, quantity and coverage of data across our post-training stages
Run post-training experiments and ablations to produce models that climb our evals
Embody our culture and values

Fulltime

Member of Technical Staff, Data Analysis and Evaluation

As a Member of Technical Staff in Data Analysis and Evaluation, you will play a ...

Location

Salary:

Not provided

Cohere

Expiration Date

Until further notice

Requirements

Extremely strong software engineering skills
Strong expertise in designing and conducting data collection tasks, including working with human annotators
Strong statistical skills and experience evaluating scientific experiments related to data collection and model performance
Experience analysing datasets with respect to their quality, biases, and suitability for training ML models
Hands-on experience training large language models (LLMs) on distributed training infrastructures
Familiarity with evaluating and improving the generalisability and robustness of ML systems
Proficiency in programming languages such as Python and ML frameworks (e.g., PyTorch, TensorFlow, JAX)
Excellent communication skills to collaborate effectively with cross-functional teams and present findings
One or more papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)

Job Responsibility

Design and oversee data collection tasks, including supporting human annotators and ensuring data quality
Develop and apply statistical methods to evaluate the quality and reliability of datasets
Analyse and assess the generalisability and robustness of ML systems across diverse use cases
Collaborate with teams to improve dataset quality and model performance
Train and fine-tune large language models (LLMs) on distributed training infrastructures
Conduct experiments to evaluate model performance and identify areas for improvement

What we offer

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Fulltime

Member of Technical Staff - Data Scientist

We’re looking for data scientists to help build the next generation of post-trai...

Location

United States , Mountain View

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Job Responsibility

Design evaluations of advanced model capabilities and use them to drive rapid, high-signal iteration loops
Work with vendors to produce high quality evaluation and training data
Build data pipelines to produce high quality evaluation and training data
Build data flywheels to hill-climb on model weaknesses, using data from various surfaces where our models are deployed
Ensure optimal quality, quantity and coverage of data across our post-training stages
Run post-training experiments and ablations to produce models that climb our evals
Embody our culture and values.

Fulltime

Member of Technical Staff - Data Platform

If you are excited by the challenge of designing distributed systems that proces...

Location

United States , Mountain View; Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 3+ years experience in business analytics, data science, software development, data modeling, or data engineering OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years experience in business analytics, data science, software development, data modeling, or data engineering OR equivalent experience
Proficiency in Python, Scala, Java, or Go
Deep Distributed Systems Knowledge: Demonstrated technical understanding of massive-scale compute engines (e.g., Apache Spark, Flink, Ray, Trino, or Snowflake)
Experience architecting Lakehouse environments at scale (using Delta Lake, Iceberg, or Hudi)
Experience building internal developer platforms or "Data-as-a-Service" APIs
Strong background in streaming technologies (Kafka, Azure EventHubs, Pulsar) and stateful stream processing
Experience with container orchestration (Kubernetes) for deploying data applications
Experience enabling AI/ML workloads (Feature Stores, Vector Databases)

Job Responsibility

Core Platform Engineering: Design and build the underlying frameworks (based on Spark/Databricks) that allow internal teams to process massive datasets efficiently
Distributed Systems Architecture: Modernize our data stack by moving from batch-heavy patterns to event-driven architectures
Unstructured AI Data Pipelines: Architect high-throughput pipelines capable of processing complex, non-tabular data (documents, code repositories, chat logs) for LLM pre-training, fine-tuning and evaluations datasets
AI Feedback Loops: Engineer the high-throughput telemetry systems that capture user interactions with Copilot
Infrastructure as Code: Treat the data platform as software. Define and deploy all storage, compute, and networking resources using IaC (Bicep/Terraform)
Data Reliability Engineering: Move beyond simple "validation checks" to build automated governance and observability systems
Compute Optimization: Deep-dive into query execution plans and cluster performance. Optimize shuffle operations, partition strategies, and resource allocation

Fulltime

Member of Technical Staff - Data Infra - MAI Superintelligence Team

Help build the world’s most advanced multimodal dataset at Microsoft AI. We are ...

Location

United States , Mountain View

Salary:

163000.00 - 296400.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling, or data engineering
OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 8+ years experience in business analytics, data science, software development, data modeling, or data engineering
OR equivalent experience
Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 12+ years experience in business analytics, data science, software development, data modeling, or data engineering
OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 15+ years experience in business analytics, data science, software development, data modeling, or data engineering
OR equivalent experience
4+ years experience with data governance, data compliance and/or data security
Passionate about the role of data in large-scale AI model training
Thrive in a highly collaborative, fast-paced environment
Have a high degree of expertise and pay close attention to details

Job Responsibility

Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video)
Own and maintain critical data infrastructures, including spark, ray, vector databases, and others
Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models
Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation
Embody our culture and values

Fulltime

Select Country

Member of Technical Staff, Data Engineering

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?