CrawlJobs Logo

Software Development Engineer II – Machine Learning Operations

everseen.ai Logo

Everseen

Location Icon

Location:
Serbia , Belgrade

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are seeking a Full-Stack Engineer to be a key member of the Everseen ML Operations team. As part of that team, you will own the design and implementation of the front-end and back-end components of the Everseen internal ML platform, supporting the AI researchers requirements for dataset management and video/image annotation tools. You will be instrumental in shaping our internal Machine Learning Platform and driving automation, reproducibility, and performance across the machine learning lifecycle.

Job Responsibility:

  • Design and develop new features and functionalities
  • Ensure that the developed solutions meet project objectives and enhance user experience
  • Design and implement reusable, testable, efficient, and elegant code based on requirements
  • Ensure adherence to coding standards and best practices
  • Create, maintain, and run unit tests for both new and existing applications and services
  • Aim to deliver defect-free and well-tested solutions
  • Analyze and collect data from various sources such as log files, application stack traces, and thread dumps
  • Utilize data analysis to identify trends, patterns, and potential areas for improvement
  • Create and maintain CI/CD integration using various tools
  • Automate the build, test, and deployment processes to ensure efficiency and reliability
  • Evaluate and integrate third-party software solutions to optimize system performance
  • Expand product capabilities by integrating compatible third-party solutions
  • Update and track third-party solutions' compatibility with Everseen stack according to internal development guidelines
  • Monitor production logs to identify and troubleshoot issues promptly
  • Ensure seamless operation and timely resolution of any anomalies to maintain system reliability
  • Responsible for creating, maintaining, and updating technical documentation to ensure code, systems, and processes are clearly understood and easily accessible by team members and stakeholders

Requirements:

  • 2-3 years of work experience in a relevant role and global SaaS company
  • Experience in ML infrastructure, MLOps, or Platform Engineering
  • Strong programming skills, with experience in Front-End development, in React and Angular
  • Understanding ML lifecycle, model versioning, and monitoring
  • Experience with back-end frameworks on top of NodeJS ( NestJS )
  • Hands-on experience with Kubernetes, Docker, and cloud services
  • Experience with CI/CD tools (e.g., GitLab, Jenkins)
  • Excellent communication and collaboration skills
  • Experience with Infrastructure as Code (e.g., Terraform)
  • Possesses a comprehensive understanding of technical concepts and terminology relevant to Everseen's products and services
  • Ability to work with Linux systems, including troubleshooting skills such as log investigations, performance testing, and connectivity investigation
  • Knowledge of advanced concepts like microservices and distributed systems
  • Advanced knowledge of a public cloud provider services, including Kubernetes services for container orchestration, Cloud data storage, testing processes
  • Good understanding of cloud security, scalability, and performance optimization principles
  • Demonstrated interest in learning and a strong desire to expand knowledge
  • Curiosity to explore new technologies, methodologies, and best practices
  • Results-oriented attitude
  • Possesses strong analytical and problem-solving abilities, leveraging data to inform product decisions

Additional Information:

Job Posted:
December 08, 2025

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Development Engineer II – Machine Learning Operations

Senior Staff Engineer, Applied AI

GEICO is seeking a Senior Staff Engineer, Applied AI to provide technical archit...
Location
Location
United States , Chevy Chase, MD; Palo Alto, CA
Salary
Salary:
130000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8 or more years of professional software engineering or applied machine learning experience
  • 2 or more years working with Generative AI or LLM-based systems in production
  • Proven track record of architecting and delivering complex AI/ML capabilities that span multiple teams and have measurable business impact
  • Deep hands-on expertise with Python and modern AI frameworks including LangChain, LangGraph, LangSmith, LlamaIndex, Hugging Face, OpenAI/Anthropic APIs, and emerging agentic frameworks
  • Demonstrated experience building and deploying production RAG (Retrieval-Augmented Generation) systems including document ingestion, chunking strategies, vector search, and context retrieval
  • Demonstrated experience designing and operating production AI systems including multi-agent architectures, intelligent automation, and workflow orchestration
  • Strong understanding of agent architectures, workflow orchestration, retrieval-augmented generation (RAG), vector databases, knowledge graphs, and semantic reasoning
  • Familiarity with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) for building interoperable AI systems
  • Experience ensuring platform scalability, cross-domain coherence, and alignment with AI platform capabilities and strategy
  • Strong expertise in distributed systems, microservices architecture, service design, performance optimization, and reliability engineering
Job Responsibility
Job Responsibility
  • Specify architectures and system decompositions for AI/ML capabilities that involve significant integrations and cross-team collaboration across multiple product areas
  • Provide technical architecture and leadership for medium to large, complex, cross-functional AI initiatives with visibility at the tech VP level
  • Architect and lead implementation of advanced Generative AI solutions including agent-based systems, intelligent automation, document intelligence, and decision support systems that span multiple business domains
  • Design and implement sophisticated agentic workflows that orchestrate multiple AI agents, tools, APIs, reasoning steps, and business logic to automate complex enterprise processes at scale
  • Question status quo with an eye for simpler designs and more secure approaches, influencing tech VPs to set direction for multiple teams
  • Build systems and platforms that meet the highest standards for scalability, resilience, performance, availability, security, and compliance
  • Identify and scope opportunities for automating business processes using AI across multiple product areas and business domains
  • Advance the state-of-the-art in applied AI by integrating knowledge graphs, vector reasoning, retrieval architectures, and multi-agent systems to solve complex business problems
  • Drive innovation by exploring new models, frameworks, reasoning techniques, and AI architectures and applying them strategically to high-impact business challenges
  • Run rigorous experimentation programs including hypothesis definition, A/B testing, measurement frameworks, and iterative improvement across production AI systems
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Sdet ii - ai applications

As a Software Development Engineer in Test (SDET) II, you will play a critical r...
Location
Location
United States , Mendon
Salary
Salary:
Not provided
asirobots.com Logo
Autonomous Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Software Engineering, or a related field
  • 3-5 years of experience in software testing, validation engineering, machine learning engineering, or autonomous systems development
  • Strong understanding of AI behavior, model evaluation, data pipelines, and real time system interactions
  • Hands on experience with automated testing frameworks, simulation tools, scenario generation, or hardware in the loop validation
  • Ability to design testing architectures that scale across cloud, embedded, and robotics environments
  • Experience analyzing metrics, failure cases, regression patterns, and long tail performance challenges
  • Ability to collaborate with research, robotics, infrastructure, and product teams to define and execute complex testing plans
  • Strong programming skills in languages used for verification and automation such as Python, C++, or similar
  • Experience with CI/CD systems, version control, and structured testing workflows
  • Strong problem solving and analytical capabilities with a focus on reliability and safety
Job Responsibility
Job Responsibility
  • Define and own the AI driven testing strategy for autonomy across simulation, hardware, software, and real-world validation
  • Develop automated verification pipelines that use AI, data driven analysis, and intelligent test generation to evaluate system performance at scale
  • Design tests that expose edge cases, failure modes, rare events, and long tail conditions critical for safe autonomous operation
  • Integrate testing workflows with model training pipelines, deployment systems, data infrastructure, and robotics platforms
  • Build metrics, dashboards, and evaluation frameworks that measure reliability, robustness, safety, and regression impacts across model updates
  • Collaborate with AI researchers, robotics engineers, software developers, and safety teams to ensure testing requirements align with system capabilities and operational constraints
  • Use simulation tools, digital twins, and scenario generation to replicate diverse operating conditions and evaluate autonomous behaviors
  • Validate AI performance on hardware in the loop, software in the loop, and real-world testing environments
  • Develop tools that automate labeling, anomaly detection, and performance triage to accelerate debugging and model improvement
  • Identify gaps in test coverage, implement continuous improvements in test methodologies, and maintain high verification standards
What we offer
What we offer
  • Full Benefits - 90% Medical, ESOP, 401K, Generous PTO
  • Fulltime
Read More
Arrow Right

Sdet ii - ai applications

As a Software Development Engineer in Test (SDET) II, you will play a critical r...
Location
Location
United States , Lehi
Salary
Salary:
Not provided
asirobots.com Logo
Autonomous Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Software Engineering, or a related field
  • 3-5 years of experience in software testing, validation engineering, machine learning engineering, or autonomous systems development
  • Strong understanding of AI behavior, model evaluation, data pipelines, and real time system interactions
  • Hands on experience with automated testing frameworks, simulation tools, scenario generation, or hardware in the loop validation
  • Ability to design testing architectures that scale across cloud, embedded, and robotics environments
  • Experience analyzing metrics, failure cases, regression patterns, and long tail performance challenges
  • Ability to collaborate with research, robotics, infrastructure, and product teams to define and execute complex testing plans
  • Strong programming skills in languages used for verification and automation such as Python, C++, or similar
  • Experience with CI/CD systems, version control, and structured testing workflows
  • Strong problem solving and analytical capabilities with a focus on reliability and safety
Job Responsibility
Job Responsibility
  • Define and own the AI driven testing strategy for autonomy across simulation, hardware, software, and real-world validation
  • Develop automated verification pipelines that use AI, data driven analysis, and intelligent test generation to evaluate system performance at scale
  • Design tests that expose edge cases, failure modes, rare events, and long tail conditions critical for safe autonomous operation
  • Integrate testing workflows with model training pipelines, deployment systems, data infrastructure, and robotics platforms
  • Build metrics, dashboards, and evaluation frameworks that measure reliability, robustness, safety, and regression impacts across model updates
  • Collaborate with AI researchers, robotics engineers, software developers, and safety teams to ensure testing requirements align with system capabilities and operational constraints
  • Use simulation tools, digital twins, and scenario generation to replicate diverse operating conditions and evaluate autonomous behaviors
  • Validate AI performance on hardware in the loop, software in the loop, and real-world testing environments
  • Develop tools that automate labeling, anomaly detection, and performance triage to accelerate debugging and model improvement
  • Identify gaps in test coverage, implement continuous improvements in test methodologies, and maintain high verification standards
What we offer
What we offer
  • Full Benefits - 90% Medical, ESOP, 401K, Generous PTO
  • Fulltime
Read More
Arrow Right

Senior Software Engineer II, ML Platform

Dandy is building the operating system for dental offices around the world. As a...
Location
Location
Brazil
Salary
Salary:
Not provided
meetdandy.com Logo
Dandy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as a Machine Learning Engineer or Software Engineer, ideally within a high-growth startup environment
  • Deep proficiency in building and operating ML platform components, including feature stores, model registries, distributed training infrastructure, and experiment tracking
  • Experience designing and running ML systems on cloud infrastructure, including containerization and orchestration technologies such as Docker and Kubernetes, and public cloud platforms (AWS or GCP or Azure)
  • Expertise in large-scale data processing, with proven experience building reliable ML data pipelines to support complex model training and evaluation
  • Experience creating and maintaining automated build, test, and deployment workflows across multiple environments (e.g., Buildkite, CI/CD pipelines)
  • Strong background in observability, including implementing metrics, logging, and tracing for complex, distributed production systems
  • Ability to communicate clearly and concisely about complex architectural problems and propose iterative, pragmatic solutions
Job Responsibility
Job Responsibility
  • Collaborate with Machine Learning Engineers to build the ML training pipelines that process massive 3D datasets, orchestrate model training, and enable continuous model improvements
  • Streamline the ML lifecycle, from data labeling and experimentation to deployment, by optimizing internal ML components and reducing technical debt
  • Develop and maintain cloud-native systems and tooling (GCP/Kubernetes) that support Dandy’s 3D dental products in a secure, well-tested, and high-performing manner
  • Write clean, maintainable code and tests that set the standard for our internal best practices
  • Partner with stakeholders across the Engineering organization to influence long-term architectural goals and maintain a high-quality bar
What we offer
What we offer
  • Healthcare
  • dental
  • mental health support
  • parental planning resources
  • retirement savings options
  • generous paid time off
  • Fulltime
Read More
Arrow Right

Software Engineer II

Microsoft Copilot is revolutionizing how people work and has created an unpreced...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • 1+ years of developing for mobile platforms
  • 1+ years of experience leveraging AI or ML to build solutions to large scale problems
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Works with appropriate stakeholders to determine user requirements for a set of features
  • Contributes to the identification of dependencies, and the development of design documents for a product area with little oversight
  • Creates and implements code for a product, service, or feature, reusing code as applicable
  • Contributes to efforts to break down larger work items into smaller work items and provides estimation
  • Acts as a Designated Responsible Individual (DRI) working on-call to monitor service for degradation, downtime, or interruptions and takes action to restore service for simple problems
  • Remains current in skills by investing time and effort into staying abreast of current developments that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Embody the Microsoft culture and values
  • Fulltime
Read More
Arrow Right

Engineering Manager II, Data & ML Systems

As an Engineering Manager on the FinTech Data & ML Systems team, you will lead a...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience and proven experience as a Software or Data Engineering Manager, leading teams that deliver large-scale data infrastructure or platform solutions
  • Deep technical expertise in distributed data systems, including data ingestion, transformation, storage, and streaming
  • Working knowledge of machine learning workflows and supporting infrastructure (e.g., feature engineering, model training, deployment, and monitoring)
  • Strong leadership, communication, and cross-functional collaboration skills — especially when partnering with analytics, data science, and product teams
  • Demonstrated ability to set vision, define roadmaps, and deliver data-driven solutions that support analytics and ML applications
  • Passion for mentoring engineers and fostering an environment of learning, innovation, and accountability
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field with 10+ years of experience
Job Responsibility
Job Responsibility
  • Lead a high-performing team of data engineers and platform specialists in designing, implementing, and scaling data and ML solutions that power analytics, decision-making, and automation across FinTech
  • Drive the architecture and delivery of robust data pipelines, feature stores, and data platforms that enable machine learning and advanced analytics use cases
  • Collaborate closely with product managers, data scientists, and ML engineers to define and deliver reliable data and model workflows that support critical FinTech applications
  • Provide technical leadership in data architecture, ETL design, model training pipelines, and productionization of ML workflows
  • Identify opportunities to use data and ML to solve key business challenges, improve efficiency, and unlock new capabilities across payments, compliance, and financial systems
  • Promote a culture of technical excellence, encouraging best practices in system design, testing, observability, and maintainability across both data and ML domains
  • Mentor and develop engineers, fostering a collaborative, inclusive, and high-performance culture where teams can experiment, learn, and grow
  • Ensure reliability and scalability of FinTech data and ML systems through strong engineering discipline and well-defined operational practices
Read More
Arrow Right

Software Engineer II

The Microsoft AI – Bing Places Data team is seeking a passionate engineer to acc...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or Python
  • OR equivalent experience
  • 2+ years of hands-on software design and coding experience
  • 2+ years of strong background in software development and system design
  • 2+ years of strong problem-solving design, implementation, and communication skills
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check
Job Responsibility
Job Responsibility
  • Understand User Requirements – Collaborate with product managers and technical leads to clarify requirements and incorporate continuous feedback loops
  • Design and Architecture – Contribute to solution architecture, create design documents, and ensure alignment with security, privacy, and compliance requirements
  • Coding and Code Quality – Implement maintainable, extensible code and participate in reviews that uphold Microsoft engineering standards
  • Testing and Automation – Develop and refine test plans, integrate automation, and ensure robust test coverage for backend services
  • Engineering Excellence – Integrate telemetry, improve tools and processes, and maintain compliance with reliability, accessibility, and regulatory expectations
  • Design secure, reliable, and sustainable software that is resilient to change
  • Develop, validate, release, and derive insights from software that enhances our customers experiences
  • Champion customer-first design leads to increased loyalty to Copilot, Search and our Partners across various organizations
  • Demonstrate partnership, accountability, clarity in communication and technical depth while delivering results
  • Apply critical analyses to data to validate hypotheses and inform decisions
  • Fulltime
Read More
Arrow Right

Software Engineer II

Imagine helping millions of users discover the best local businesses and service...
Location
Location
United States , Bellevue
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • 2+ years of experience in solving, design, coding, and debugging skills
  • 1+ with data engineering leveraging tools such as Apache Hadoop or Spark or equivalent experience
Job Responsibility
Job Responsibility
  • Works with appropriate stakeholders to determine user requirements for a set of features
  • Contributes to the identification of dependencies, and the development of design documents for a product area with little oversight
  • Creates and implements code for a product, service, or feature, reusing code as applicable
  • Contributes to efforts to break down larger work items into smaller work items and provides estimation
  • Acts as a Designated Responsible Individual (DRI) working on-call to monitor system/product feature/service for degradation, downtime, or interruptions and gains approval to restore system/product/service for simple problems
  • Remains current in skills by investing time and effort into staying abreast of current developments that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Fulltime
Read More
Arrow Right