CrawlJobs Logo

Machine Learning Platform / Backend Engineer

everseen.ai Logo

Everseen

Location Icon

Location:
Serbia; Romania , Belgrade

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are seeking a Machine Learning Platform/Backend Engineer to design, build, and maintain scalable infrastructure that empowers our data scientists and machine learning engineers to develop, train, benchmark, and monitor machine learning models efficiently. You will be instrumental in shaping our internal Machine Learning Platform and driving automation, reproducibility, and performance across the machine learning lifecycle.

Job Responsibility:

  • Design, build, and maintain scalable infrastructure that empowers data scientists and machine learning engineers
  • Own the design and implementation of the internal ML platform, enabling end-to-end workflow orchestration, resource management, and automation using cloud-native technologies (GCP/Azure)
  • Design and manage Kubernetes-based infrastructure for multi-tenant GPU and CPU workloads with strong isolation, quota control, and monitoring
  • Integrate and extend orchestration tools (Airflow, Kubeflow, Ray, Vertex AI, Azure ML or custom schedulers) to automate data processing, training, and deployment pipelines
  • Develop shared services for model behavior/performance tracking, data/datasets versioning, and artifact management (MLflow, DVC, or custom registries)
  • Build out documentation in relation to architecture, policies and operations runbooks
  • Share skills, knowledge, and expertise with members of the data engineering team
  • Foster a culture of collaboration and continuous learning by organizing training sessions, workshops, and knowledge-sharing sessions
  • Collaborate and drive progress with cross-functional teams to design and develop new features and functionalities
  • Ensure that the developed solutions meet project objectives and enhance user experience
  • Have influence over the technology stack and internal technical improvements, contributing to strategic decision-making
  • Based on requirements and a longer-term product and feature strategy, design and implement reusable, testable, efficient, and elegant code
  • Ensure adherence to coding standards and best practices
  • Create, maintain, and run unit tests for new and existing applications and services
  • Aim to deliver defect-free and well-tested solutions
  • Analyze and collect data from various sources such as log files, application stack traces, and thread dumps
  • Utilize data analysis to identify trends, patterns, and potential areas for improvement
  • Begin to implement changes based on data analysis
  • Create and maintain CI/CD integration using various tools
  • Automate the build, test, and deployment processes to ensure efficiency and reliability
  • Research and propose third-party software solutions to optimize system performance
  • Expand product capabilities by integrating compatible third-party solutions
  • Monitor update and tracking of third-party solutions' compatibility with Everseen stack according to internal development guidelines
  • Monitor production logs to identify and troubleshoot issues promptly
  • Ensure seamless operation and timely resolution of any anomalies to maintain system reliability
  • Responsible for creating, reviewing, and maintaining high-quality technical documentation to ensure clarity, consistency, and knowledge sharing within the development team

Requirements:

  • 4-5+ years of work experience in either ML infrastructure, MLOps, or Platform Engineering
  • Bachelors degree or equivalent focusing on the computer science field is preferred
  • Excellent communication and collaboration skills
  • Expert knowledge of Python
  • Experience with CI/CD tools (e.g., GitLab, Jenkins)
  • Hands-on experience with Kubernetes, Docker, and cloud services
  • Understanding of ML training pipelines, data lifecycle, and model serving concepts
  • Familiarity with workflow orchestration tools (e.g., Airflow, Kubeflow, Ray, Vertex AI, Azure ML)
  • A demonstrated understanding of the ML lifecycle, model versioning, and monitoring
  • Experience with ML frameworks (e.g., TensorFlow, PyTorch)
  • Experience with GPU orchestration (e.g., NVIDIA GPU Operator, MIG)
  • Experience with Infrastructure as Code (e.g., Terraform)
  • Experience with Data engineering tools (e.g., Snowflake, Databricks, BigQuery, Airbyte, Kafka)
  • Familiarity with feature stores and model registries
  • Exposure to large-scale distributed systems and performance optimisation
  • Ability to work with Linux systems, including troubleshooting skills such as log investigations, performance testing, and connectivity investigation
  • Possesses a deep understanding of technical concepts and terminology relevant to Everseen's products and services
  • Expert knowledge of advanced concepts like microservices and distributed systems
  • In-depth knowledge of Azure Kubernetes Services for container orchestration, Azure Blob Storage for data storage, and ElasticSearch for search and analytics
  • Ability to leverage cloud computing technologies and services for testing and validation purposes
  • In-depth knowledge of cloud security, scalability, and performance optimization principles
  • Excellent understanding of cloud computing technologies and services, including infrastructure as a service (IaaS), platform as a service (PaaS), and software as a service (SaaS)
  • Broad understanding of the software engineering and architecture space, including knowledge of various programming languages, frameworks, techniques, and industry trends in AI

Nice to have:

  • Interest in Learning and Growth Mindset
  • Demonstrated interest in learning and a strong desire to expand knowledge in their respective field
  • Curiosity to explore new technologies, methodologies, and best practices to enhance skills and capabilities
  • Results-oriented attitude, with a drive to achieve objectives efficiently
  • Analytical and Problem-Solving Skills
  • Possesses strong analytical and problem-solving abilities, leveraging data to inform product decisions

Additional Information:

Job Posted:
December 08, 2025

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Machine Learning Platform / Backend Engineer

Senior Machine Learning Engineering Manager, Gen AI

We're seeking a Senior Machine Learning Manager (M60) to lead a cross-functional...
Location
Location
United States
Salary
Salary:
193500.00 - 303150.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in ML, search, or backend engineering roles, with 3+ years leading teams
  • Strong track record of shipping ML-powered or LLM-integrated user-facing products
  • Experience with RAG systems (vector search, hybrid retrieval, LLM orchestration)
  • Deep experience in either modeling (e.g., LLMs, search, NLP) or engineering (e.g., backend infra, full-stack), with the ability to lead end-to-end
  • Deep understanding of LLM ecosystems (OpenAI, Claude, Mistral, OSS), orchestration frameworks (LangChain, LlamaIndex), and vector databases (Weaviate, Pinecone, FAISS, etc.)
  • Strong product intuition and ability to translate complex tech into valuable user features
  • Familiarity with GenAI evaluation methods: hallucination detection, groundedness scoring, and human-in-the-loop feedback loops
  • Master’s or PhD in Computer Science, Machine Learning, or related field preferred—or equivalent practical experience
Job Responsibility
Job Responsibility
  • Lead the vision, design, and execution of LLM-powered AI products, leveraging advance AI modeling (e.g. SLM post-training/fine-tuning), RAG architectures and hybrid ranking system
  • Define system architecture across retrievers, rankers, orchestration layers, prompt templates, and feedback mechanisms
  • Work closely with product and design teams to ensure delightful, fast, and grounded user experiences
  • Build and manage a cross-disciplinary team including ML engineers, backend/frontend engineers, and applied scientists
  • Foster a culture of E2E ownership — empowering the team to move from prototype to production quickly and iteratively
  • Mentor individuals to grow in both technical depth and product acumen
  • Shape the technical roadmap and long-term strategy for GenAI search across Atlassian’s product suite
  • Partner with platform and infra teams to scale inference, evaluate performance, and integrate usage signals for continuous improvement
  • Champion data quality, grounding, and responsible AI practices in all deployed features
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
  • Fulltime
Read More
Arrow Right

Manager, Machine Learning - Community Support Engineering

The Community Support Platform (CSP) at Airbnb is a critical system that drives ...
Location
Location
United States
Salary
Salary:
204000.00 - 255000.00 USD / Year
airbnb.com Logo
Airbnb
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expertise in various machine learning and AI methodologies, including LLMs and non-LLMs, tailored for user-facing products
  • Proven experience in leading teams that develop large-scale ML models and systems to improve online user experiences
  • Strong leadership skills with a track record of nurturing an innovative and collaborative team environment
  • Exceptional verbal and written communication abilities, with a keen eye for detail
  • Demonstrated capability to work effectively with stakeholders at all organizational levels, both internally and externally
  • Skilled in navigating and resolving ambiguous challenges through proactive and strategic approaches
  • PhD, or Master's degree in Computer Science, Mathematics, Statistics, or related technical field
  • 10+ years of experience in building and shipping AI models and products, including 2+ years of experience with LLMs
  • 5+ years managing machine learning teams that deliver large impact
  • Expert knowledge of machine learning algorithms and techniques
Job Responsibility
Job Responsibility
  • Lead and mentor a dynamic team of highly skilled applied scientists and machine learning engineers in the research, design and optimization of AI models and services
  • Develop and refine the overarching strategy for the ML and AI aspects of our community support products, focusing on scalability, quality, safety, performance, and reliability
  • Foster rapid development cycles without sacrificing quality, collaborating closely with platform, backend, and frontend engineers to engineer robust ML models and systems that enhance community support initiatives
  • Evaluate technical trade-offs in key decisions, ensuring optimal outcomes through data-backed strategies
  • Conduct thorough design and architecture reviews to continually elevate our standards of technical excellence
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Employee Travel Credits
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer (Health)

WHOOP is an advanced health and fitness wearable, on a mission to unlock human p...
Location
Location
United States , Boston
Salary
Salary:
150000.00 - 210000.00 USD / Year
whoop.com Logo
Whoop
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Data Science, Applied Mathematics, or a related field. Master’s preferred
  • 5+ years of professional experience as a Machine Learning Engineer or Software Engineer with focus on ML systems
  • Proven expertise working with time series data (wearable, physiological, or high-frequency sensor data strongly preferred)
  • Experience designing and deploying ML inference systems at scale: both real-time streaming and large-scale batch pipelines
  • Strong coding skills in Python (scientific stack) and SQL, with a track record of writing clean, production-quality code
  • Strong communication skills to collaborate across engineering, research, and product teams
  • Proven experience deploying and maintaining ML systems on cloud platforms (AWS or GCP)
  • Working familiarity with MLOps best practices: model versioning, CI/CD for ML, observability, and monitoring for inference systems
  • Ability to reason about and design for performance trade-offs (latency vs. throughput vs. cost) when building ML inference systems
  • Strong understanding of backend service development (APIs and service reliability) as it applies to serving ML models at scale
Job Responsibility
Job Responsibility
  • Create, improve, and maintain production services that provide analysis for health features in collaboration with Data Scientists and MLOps Engineers
  • Collaborate with Data Engineers to improve ML data pipelines, tooling, and validation systems that support robust model performance
  • Work alongside data scientists to translate research prototypes into production ML systems optimized for scale, latency, and cost efficiency
  • Collaborate with researchers and product teams to align model development with health insights and member impact
  • Participate in on-call rotations for data science services, ensuring uptime and performance in production environments
What we offer
What we offer
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer II - Training

As a Machine Learning Engineer II on our Training team, you will develop algorit...
Location
Location
United States , Boston
Salary
Salary:
125000.00 - 170000.00 USD / Year
whoop.com Logo
Whoop
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Mathematics, Statistics, Computer Science, or a related field
  • 2+ years of ML engineering, applied research, or a similar role
  • 2+ years experience applying advanced mathematical and statistical techniques
  • Experience deploying and maintaining production ML systems on cloud platforms (e.g., Kubernetes, AWS, GCP)
  • Familiarity with MLOps best practices and the ability to collaborate effectively with infrastructure teams on Docker, CI/CD workflows, model versioning, and observability tools
  • Proficiency in scientific Python and SQL
  • Excellent verbal and written communication skills
Job Responsibility
Job Responsibility
  • Design, train, and optimize machine learning algorithms for movement, exercise and training applications across diverse backend platforms
  • Collaborate closely with data scientists, ML Ops and software engineering teams to ensure reliable deployment, observability, and robust integration with the WHOOP ecosystem
  • Contribute to technical roadmap development and architectural decision-making for projects that you are involved in
  • Work closely with a team of data scientists in developing algorithms that power member-facing features
  • Work with Data Engineers to improve data pipelining, tooling for machine learning, and systems for quality and validation
  • Periodically serve as the on-call data scientist to respond in real time to incidents affecting production services
What we offer
What we offer
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Senior Staff Machine Learning Engineer

Help design our AI platform and develop our next generation of machine learning ...
Location
Location
United States , San Francisco
Salary
Salary:
216500.00 - 324500.00 USD / Year
gofundme.com Logo
GoFundMe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 9+ years of hands-on experience in machine learning engineering, AI development, software engineering, or related fields
  • Experience emphasizing secure, large-scale, distributed system design, AI/ML pipeline development, and implementation
  • Extensive experience designing, developing, and operating scalable backend systems
  • Experience applying software engineering best practices such as domain-driven design, event-driven architectures, and microservices
  • Deep expertise in agentic workflows, AI evaluation solutions, prompt management, and secure AI development and testing practices
  • Strong knowledge of relational and document-based databases, data storage paradigms, and efficient RESTful API design
  • Experience establishing robust CI/CD pipelines, automated testing (unit and integration), and deployment practices
  • Strong leadership skills, including effective planning and management of complex projects, mentoring of team members, and fostering a collaborative, high-performing engineering culture
  • Excellent communicator, able to articulate complex technical concepts clearly to both technical and non-technical stakeholders
  • Bachelor's degree in Computer Science, Software Engineering, or a related technical field (preferred)
Job Responsibility
Job Responsibility
  • Design and implement AI platforms to enable scalable and secure access to LLMs from multiple model providers for diverse use cases
  • Design and implement agentic workflows, agentic tool ecosystems, and LLM prompt management solutions
  • Design, build, and optimize scalable model training, fine tuning, and inference pipelines, ensuring robust integration with production systems
  • Influence technical strategy and approach to developing embedding stores, vector databases, and other reusable assets
  • Lead initiatives to streamline ML and AI workflows, improve operational efficiency, and establish standardized procedures to achieve consistent, high-quality results across our AI systems
  • Design and develop backend services and RESTful APIs using Python and FastAPI, integrating seamlessly with ML pipelines and services
  • Take operational responsibility for team-owned services, including performance monitoring, optimization, troubleshooting, and participation in an on-call rotation
  • Collaborate with both technical and non-technical colleagues, including data and applied scientists, software engineers, product managers, and business stakeholders, to deliver reliable and scalable ML-driven products
  • Coach and mentor fellow ML engineers, promoting a culture of collaboration, continuous improvement, and engineering excellence within the team
  • Employ a diverse set of tools and platforms including Python, AWS, Databricks, Docker, Kubernetes, FastAPI, Terraform, Snowflake, Coralogix, and GitHub to build, deploy, and maintain scalable, highly available machine learning infrastructure
What we offer
What we offer
  • Competitive pay
  • Comprehensive healthcare benefits
  • Financial assistance for things like hybrid work, family planning
  • Generous parental leave
  • Flexible time-off policies
  • Mental health and wellness resources
  • Learning, development, and recognition programs
  • Fulltime
Read More
Arrow Right

Senior Product Engineer, AI Platform

We’re looking for Senior Product Engineers to join the AI Infrastructure team to...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
intercom.com Logo
Intercom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience shipping high-quality products, preferably backend
  • Deep knowledge of a high-level programming language (for example, Python, Ruby etc.)
  • Ability to deep dive into any part of the tech stack
  • Bias towards progress over perfection
  • Excited about progress in the AI space
Job Responsibility
Job Responsibility
  • Build the systems that power Intercom’s flagship AI products
  • Work alongside our ML engineers and ML Scientists to bring proof-of-concept code to production
  • Partner with ML engineers and ML scientists to build the underlying platform
  • Contribute to all phases of software development including ideation, prototyping, and implementation and testing
  • Play an active role in the hiring, mentoring, and career development of other engineers
  • Raise the bar for technical standards, performance, reliability, and operational excellence
What we offer
What we offer
  • Competitive salary, annual bonus and equity
  • Regular compensation reviews
  • Generous paid time off above statutory minimum
  • Hybrid working
  • MacBooks are our standard, but we also offer Windows for certain roles when needed
  • Fun events for Intercomrades, friends, and family
  • Fulltime
Read More
Arrow Right

Senior Product Engineer - AI Platform

We’re looking for Senior Product Engineers to join the AI Infrastructure team to...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
intercom.com Logo
Intercom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience shipping high-quality products, preferably backend
  • Deep knowledge of a high-level programming language (for example, Python, Ruby etc.)
  • Strong willingness to fight for good outcomes
  • Ability to deep dive into any part of the tech stack
  • Bias towards progress over perfection
  • Excited about progress in the AI space
Job Responsibility
Job Responsibility
  • Build the systems that power Intercom’s flagship AI products
  • Work alongside our ML engineers and ML Scientists to bring proof-of-concept code to production
  • Partner with ML engineers and ML scientists to build the underlying platform
  • Contribute to all phases of software development including ideation, prototyping, and implementation and testing
  • Play an active role in the hiring, mentoring, and career development of other engineers
  • Raise the bar for technical standards, performance, reliability, and operational excellence
What we offer
What we offer
  • Competitive salary and equity in a fast-growing start-up
  • We serve lunch every weekday, plus a variety of snack foods and a fully stocked kitchen
  • Regular compensation reviews
  • Pension scheme & match up to 4%
  • Life assurance
  • Comprehensive health and dental insurance for you and your dependents
  • Flexible paid time off policy
  • Paid maternity leave
  • 6 weeks paternity leave for fathers
  • Cycle-to-Work Scheme
  • Fulltime
Read More
Arrow Right

Senior AI Product Engineer, Backend

Our Backend Engineering team builds all of the highly scalable distributed servi...
Location
Location
United States
Salary
Salary:
125000.00 - 225000.00 USD / Year
arize.com Logo
Arize
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience working with high-performance backend systems
  • Strong experience writing Go, Python, TypeScript/Node, Java, or similar server programming languages
  • Enthusiasm and interest in the AI and LLM ecosystem, with a desire to learn and stay updated on emerging technologies
  • Previous work building and operating highly complex SaaS platforms/systems
  • Knowledge of working with public clouds & container orchestration - AWS, GCP, Azure, Kubernetes, etc.
Job Responsibility
Job Responsibility
  • Write maintainable, scalable, and performant backend code primarily in Go, Java, and Python, with opportunities to work in TypeScript
  • Build high-volume and highly available analytics systems
  • Design and build APIs specific to our customers’ Machine Learning and LLM workflows
  • Prototype, optimize, and maintain scalable backend services that power the Arize core platform
  • Extend, and contribute back to, open source OLAP databases and distributed message queue frameworks
  • Develop and integrate collection tools for robust monitoring of ML and LLM pipelines
  • Research and implement cutting-edge visualization & dimensionality reduction algorithms in a distributed environment
  • Collaborate with our product, design, and directly with customer engineering teams to enhance and expand our product offerings
  • Contribute to the build our own in-house AI Agents
What we offer
What we offer
  • competitive equity package
  • comprehensive benefits package, including medical, dental, vision
  • a 401(k) plan
  • unlimited paid time off
  • a generous parental leave plan
  • additional support for mental health and wellness
  • WFH monthly stipend to pay for co-working spaces
  • Fulltime
Read More
Arrow Right