CrawlJobs Logo

Software Development Engineer II – Machine Learning Operations

Serbia, Belgrade · Job Posted December 08, 2025
Apply Position
Job Link Share

Job Description

We are seeking a Full-Stack Engineer to be a key member of the Everseen ML Operations team. As part of that team, you will own the design and implementation of the front-end and back-end components of the Everseen internal ML platform, supporting the AI researchers requirements for dataset management and video/image annotation tools. You will be instrumental in shaping our internal Machine Learning Platform and driving automation, reproducibility, and performance across the machine learning lifecycle.

Job Responsibility

  • Design and develop new features and functionalities
  • Ensure that the developed solutions meet project objectives and enhance user experience
  • Design and implement reusable, testable, efficient, and elegant code based on requirements
  • Ensure adherence to coding standards and best practices
  • Create, maintain, and run unit tests for both new and existing applications and services
  • Aim to deliver defect-free and well-tested solutions
  • Analyze and collect data from various sources such as log files, application stack traces, and thread dumps
  • Utilize data analysis to identify trends, patterns, and potential areas for improvement
  • Create and maintain CI/CD integration using various tools
  • Automate the build, test, and deployment processes to ensure efficiency and reliability
  • Evaluate and integrate third-party software solutions to optimize system performance
  • Expand product capabilities by integrating compatible third-party solutions
  • Update and track third-party solutions' compatibility with Everseen stack according to internal development guidelines
  • Monitor production logs to identify and troubleshoot issues promptly
  • Ensure seamless operation and timely resolution of any anomalies to maintain system reliability
  • Responsible for creating, maintaining, and updating technical documentation to ensure code, systems, and processes are clearly understood and easily accessible by team members and stakeholders

Requirements

  • 2-3 years of work experience in a relevant role and global SaaS company
  • Experience in ML infrastructure, MLOps, or Platform Engineering
  • Strong programming skills, with experience in Front-End development, in React and Angular
  • Understanding ML lifecycle, model versioning, and monitoring
  • Experience with back-end frameworks on top of NodeJS ( NestJS )
  • Hands-on experience with Kubernetes, Docker, and cloud services
  • Experience with CI/CD tools (e.g., GitLab, Jenkins)
  • Excellent communication and collaboration skills
  • Experience with Infrastructure as Code (e.g., Terraform)
  • Possesses a comprehensive understanding of technical concepts and terminology relevant to Everseen's products and services
  • Ability to work with Linux systems, including troubleshooting skills such as log investigations, performance testing, and connectivity investigation
  • Knowledge of advanced concepts like microservices and distributed systems
  • Advanced knowledge of a public cloud provider services, including Kubernetes services for container orchestration, Cloud data storage, testing processes
  • Good understanding of cloud security, scalability, and performance optimization principles
  • Demonstrated interest in learning and a strong desire to expand knowledge
  • Curiosity to explore new technologies, methodologies, and best practices
  • Results-oriented attitude
  • Possesses strong analytical and problem-solving abilities, leveraging data to inform product decisions

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Software Development Engineer II – Machine Learning Operations

8 matching positions

Machine Learning Operations Engineer II

Kensho is S&P Global’s hub for AI innovation and transformation. With expertise ...
Location
Location
United States , Cambridge; New York
Salary
Salary:
130000.00 - 175000.00 USD / Year
kensho.com Logo
Kensho Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of experience in ML infra, ML Ops, ML Engineering or some similar skillset
  • Experience managing distributed systems with Kubernetes
  • Cloud Platform (AWS) understanding
  • Python proficiency
  • Familiarity with distributed computing frameworks and workflow orchestration (ie. Ray, Airflow)
  • Familiarity with software engineering best practices in an ML context
  • Some basic understanding of ML concepts, LLMs and agents
  • Ability to debug distributed systems across infrastructure, networking and application layers
  • Excellent communication skills to drive adoption of new tools and best practices across multiple teams
  • Someone who’s very curious, driven, low-ego and eager to learn across a range of engineering disciplines
Job Responsibility
Job Responsibility
  • Iterate on Kensho’s ML processes to develop tools, services, and frameworks that make every stage of the ML workflow robust, auditable, and usable
  • Work closely with ML engineers to understand their unique processes, identify pain points, and form effective solutions
  • Empower engineers with the stable tooling necessary to rapidly experiment and actualize their research into demonstrable prototypes and mature products
  • Provide resources and training for ML teams on best practices, enabling them to efficiently productionize their work to be leveraged by high-value products and services
  • Evaluate, select and champion open source and third-party solutions, driving their adoption across teams and integrating into Kensho’s existing platform ecosystem
  • Ship scalable, efficient, and automated processes for model fine-tuning and reinforcement learning and for the evaluation of LLMs/Agents
  • Improve LLM and Agentic observability to help monitor agentic applications in production, detecting performance, decay and drift issues
  • Stay at the frontier by actively tracking emerging tools and frameworks, promote best practices and strengthen the technical expertise of the team with your unique skill set
What we offer
What we offer
  • Medical, Dental, and Vision insurance
  • 100% company paid premiums
  • Unlimited Paid Time Off
  • 26 weeks of 100% paid Parental Leave (paternity and maternity)
  • 401(k) plan with 6% employer matching
  • Generous company matching on donations to non-profit charities
  • Up to $20,000 tuition assistance toward degree programs, plus up to $4,000/year for ongoing professional education such as industry conferences
  • Plentiful snacks, drinks, and regularly catered lunches
  • Dog-friendly office (CAM office)
  • Bike sharing program memberships
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer II

Uber's Marketplace is at the core of the business. The Earner Incentive team in ...
Location
Location
United States , San Francisco; Sunnyvale
Salary
Salary:
171000.00 - 190000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or M.S. degree in Computer Science, Statistics, Mathematics, Machine Learning, Operations Research, or a related technical field, or equivalent practical experience
  • 2+ years of experience in software engineering with an emphasis on data-driven methodologies, deep learning, and online experimentation
  • Solid understanding of machine learning and statistical techniques, including deep learning (e.g., multi-task learning), tree-based models, and experimentation
  • Proficiency in at least one production-grade language (Python, Scala, Java, or Go) and familiarity with common ML frameworks (e.g., PyTorch, TensorFlow, or scikit-learn)
  • Solid software engineering fundamentals, including the ability to write clean, maintainable production code, conduct thorough code reviews, and implement testing best practices
  • Experience operating and monitoring ML models in a production setting, with a basic understanding of MLOps workflows
  • Strong learning mindset, proactive ownership, and effective communication skills
  • ability to collaborate effectively within cross-functional teams
Job Responsibility
Job Responsibility
  • Build, productionize, and maintain ML solutions and data pipelines for the large-scale systems that power Uber's driver incentives
  • Implement and iterate on advanced ML and optimization techniques to improve marketplace efficiency and reliability, directly impacting the earning opportunities of millions of drivers
  • Translate business requirements into actionable technical tasks and practical, production-ready code, navigating technical trade-offs to ensure system reliability
  • Develop a deep understanding of incentives, pricing, and marketplace dynamics to build systems that align with operational needs and business goals
  • Contribute to high engineering standards by participating in design and code reviews, maintaining robust testing, and ensuring the stability of production ML systems
  • Partner closely with engineers, product managers, and scientists to ensure the successful delivery of high-impact solutions to marketplace problems
  • Own technical workstreams from development through production rollout, ensuring consistent execution and measurable impact on your immediate team's goals
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • Eligible to participate in a 401(k) plan
  • Various benefits
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer II

The Marketplace Signals team at Uber is responsible for building and optimizing ...
Location
Location
United States , San Francisco
Salary
Salary:
171000.00 - 190000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • B.S. in Statistics, Mathematics, Computer Science, or Machine Learning
  • 2 years of experience in software engineering with an emphasis on data-driven methodologies, deep learning, and online experimentation
  • Strong problem-solving skills, with expertise in ML methodologies
  • Experience in applying ML, statistics, or optimization techniques to solve large-scale real-world problems (e.g. ads tech, recommender systems)
  • Experience in ML frameworks (e.g. Tensorflow, Pytorch, or JAX) and complex data pipelines
  • programming languages such as Python, Spark SQL, Presto, Go, Java
Job Responsibility
Job Responsibility
  • Develop and optimize ML models to enhance key marketplace signals (e.g., ETA predictions, supply availability metrics, demand forecasts)
  • Collaborate with cross-functional teams (Pricing, Matching, Driver Incentives, etc.) to ensure marketplace signals are effectively utilized
  • Improve operational efficiency by building a centralized, scalable system for marketplace signals that serves multiple use cases
  • Leverage cutting-edge ML techniques (deep learning, probabilistic modeling, reinforcement learning, etc.) to continuously refine marketplace signals
What we offer
What we offer
  • Bonus program
  • Equity award
  • Other types of compensation
  • 401(k) plan
  • Various benefits
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer II

As a ML Engineer at Axon, you will contribute to architecting and implementing t...
Location
Location
United States , Seattle
Salary
Salary:
156750.00 - 250800.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Engineering, Physics, Mathematics or an equivalent highly technical field
  • 10+ years of software engineering experience and a proven track record of successfully architecting and maintaining large-scale distributed platforms
  • Experience with AI on chips, on device model deployment and management
  • Proficiency in python, C++, familiarity with ML frameworks such as TensorFlow, or PyTorch
  • Advanced knowledge and hands-on experience with on chips development
  • Excellent problem solving skills and ability to dive into system architecture, design, performance metrics, code, test plans, project plans, deployments and operations
  • Comfort communicating and interacting with scientists, engineers and product managers
Job Responsibility
Job Responsibility
  • Architect and develop secure, privacy-preserving, on device solutions to enable the continuous improvement of existing AI models
  • Collaborate with scientists in architecting and implementing state-of-the-art edge distributed training techniques
  • Implement on device monitoring solutions used for continuous model improvement
  • Implement innovative model compression solutions to enable AI at the edge
  • Impact the team by bringing your own expertise and deep knowledge of the state-of-the-art to introduce new techniques leading to tangible impact in terms of model fairness, performance, and platform scalability
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Employee Resource Groups (ERGs)
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Software Engineer Ii, Behavior Planning Ml Platform

Aurora’s mission is to deliver the benefits of self-driving technology safely, q...
Location
Location
United States , Pittsburgh
Salary
Salary:
126000.00 - 201000.00 USD / Year
aurora.tech Logo
Aurora Innovation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS or higher degree in Computer Science/Engineering or related fields. > 6 months of experience
  • Strong programming skills in C++ or Python, ideally both
  • Experience with machine learning frameworks (PyTorch or TensorFlow)
  • Solid foundation in computer science fundamentals - especially operating system concepts including concurrency, memory management and process scheduling.
Job Responsibility
Job Responsibility
  • Develop large scale pipelines for data extraction, model training and model evaluation
  • Build and optimize onboard ML infrastructure used to deploy models and run inference onboard the vehicle
  • Collaborate closely with motion planning, systems engineering, and other autonomy groups to define and develop critical ML workflow requirements.
Read More
Arrow Right

Back-End Software Engineer II

Make a difference protecting government assets! The Machine Learning Engine team...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Citizenship & Citizenship Verification: This role will require access to information that is controlled for export under export control regulations, potentially under the U.S. International Traffic in Arms Regulations or Export Administration Regulations, the EU Dual Use Regulation, and/or other export control regulations.  As a condition of employment, the successful candidate will be required to provide either proof of their country of citizenship or proof of their U.S. permanent residency or other protected status (e.g., under 8 U.S.C. 1324b(a)(3)) for assessment of eligibility to access the export-controlled information. To meet this legal requirement, and as a condition of employment, the successful candidate's citizenship will be verified with a valid passport. Lawful permanent residents, refugees, and asylees may verify status using other documents, where applicable
  • Citizenship & Citizenship Verification: This position requires verification of citizenship due to citizenship-based legal restrictions. Specifically, this position supports United States federal, state, and/or local government agency customers and is subject to certain citizenship-based restrictions where required or permitted by applicable law. To meet this legal requirement, and as a condition of employment, the successful candidate's US citizenship will be verified with a valid passport.
Job Responsibility
Job Responsibility
  • Leverage TDD and mocking to speed up our engineering OODA loop, and use telemetry and monitoring to speed up our customer pain points OODA loop
  • Architects and Implements software systems to solve a variety of problems
  • Works with appropriate stakeholders to determine user requirements for a set of features
  • Contributes to the identification of dependencies, and the development of design documents for a product area with little oversight
  • Creates and implements code for a product, service, or feature, reusing code as applicable
  • Contributes to efforts to break down larger work items into smaller work items and provides estimation
  • Acts as a Designated Responsible Individual (DRI) working on-call to monitor system/product feature/service for degradation, downtime, or interruptions and gains approval to restore system/product/service for simple problems
  • Curates deployment processes and scripts
  • Remains current in skills by investing time and effort into staying abreast of current developments that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Software Engineer II

We are seeking a passionate and technically skilled Software Engineer II to join...
Location
Location
United States , Redmond
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Preferred Qualifications: Bachelor’s or Master’s degree in Computer Science, Statistics, Mathematics, or related field
  • 2+ years of experience in data science, analytics, or applied machine learning
  • Proficiency in Python, SQL, and ML frameworks (e.g., Scikit-learn, TensorFlow, PyTorch)
  • Experience with cloud platforms (Azure preferred) and big data technologies
  • Understanding of statistical modeling, predictive analytics, and experimentation design
  • Excellent communication and stakeholder management skills
  • Demonstrated experience leveraging AI tools and technologies to enhance engineering effectiveness, coupled with a strong curiosity and commitment to continuous learning in the field of Artificial Intelligence
Job Responsibility
Job Responsibility
  • Design and implement advanced analytics solutions to support commerce data platform initiatives including analytics based on Machine Learning Models
  • Design skill should include scale, extensibility, performance, re-training for the ML models
  • Partner with engineering and product teams to define data requirements and ensure high-quality data pipelines
  • Conduct exploratory data analysis, feature engineering, and model evaluation using structured and unstructured datasets
  • Ensure the models built are operable, scalable, extensible and performant
  • Develop dashboards, visualizations, and storytelling artifacts to communicate insights to stakeholders
  • Lead experimentation efforts to evaluate new features, forecasting, data quality and anomaly detection systems
  • Build extensible solutions on LLM models to improve productivity of engineers across the commerce organization
What we offer
What we offer
  • Certain roles may be eligible for benefits and other compensation
  • Fulltime
Read More
Arrow Right

Staff II Software Engineer AI/ML Ops

We're looking for a Lead Data Engineer to design, build, and optimize data pipel...
Location
Location
United States , Pleasanton
Salary
Salary:
245000.00 - 307000.00 USD / Year
blackline.com Logo
BlackLine
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming skills in languages such as Python, Java, or Scala
  • Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
  • Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
  • Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
  • Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
  • Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
  • Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads
  • Proficiency in containerization technologies (e.g., Docker, Kubernetes)
  • Proficient in scripting languages (e.g., Bash, python) for automation
  • Experience with workflow orchestration tools (e.g., Apache Airflow)
Job Responsibility
Job Responsibility
  • Lead data pipeline development: Build and maintain PySpark ETL pipelines with high data quality and performance
  • Manage integrations: Establish robust connections to client data sources via APIs and tools like FiveTran, Plaid, and BlackLine's own internal connector ecosystem
  • Ensure reliability: Monitor pipeline performance, automate testing, and validate data accuracy
  • Optimize for scale: Implement performance improvements (e.g., CDC mechanisms, indexing strategies) for large-scale datasets
  • Collaborate & innovate: Work with business stakeholders to refine data requirements and integrate cutting-edge AI and big data technologies
  • Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
  • Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
  • Lead incident response and reliability strategies for ML/AI systems
  • Collaborate with development teams to integrate AI solutions into existing workflows and applications
  • Ensure seamless integration with different platforms and technologies
What we offer
What we offer
  • Short-term and long-term incentive programs
  • Robust offering of benefit and wellness plans
  • Fulltime
Read More
Arrow Right