This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a Data Scientist to train and develop NLP/NER for LLM solutions within an agentic AI framework (LangGraph). You must be able to perform supervised and unsupervised model training and validation for automated knowledge extraction from unstructured natural language data in multiple languages without a predefined ontology. Familiarity with customer data sources and data retrieval techniques is necessary for producing preprocessed training data, which will require an understanding of techniques to ensure data quality and readiness for integration into the system. Understanding of enterprise data compliance and policy concerns are necessary to ensure solutions are built for end user access.
Job Responsibility:
Train and develop NLP/NER for LLM solutions within an agentic AI framework (LangGraph)
Perform supervised and unsupervised model training and validation for automated knowledge extraction from unstructured natural language data in multiple languages without a predefined ontology
Familiarity with customer data sources and data retrieval techniques is necessary for producing preprocessed training data, which will require an understanding of techniques to ensure data quality and readiness for integration into the system
Understanding of enterprise data compliance and policy concerns are necessary to ensure solutions are built for end user access
Requirements:
Bachelor’s Degree with 10 years of relevant experience
Associate’s Degree with 12 years of relevant experience
Bachelor’s Degree must be in Mathematics, Applied Mathematics Statistics, Applied Statistics, Machine learning, Data Science, Operations Research, or Computer Science or a degree in a related field (Computer Information Systems, Engineering), a degree in the physical/hard sciences (e.g. physics, chemistry, biology, astronomy), or other science disciplines with a substantial computational component (i.e. behavioral, social, or life) may be considered if it included a concentration of coursework (5 or more courses) in advanced Mathematics (typically 300 level or higher, such as linear algebra, probability and statistics, machine learning) and/or computer science (e.g. algorithms, programming, data structures, data mining, artificial intelligence)
Broader range of degrees will be considered if accompanied by a Certificate in Data Science from an accredited college/university
Relevant experience must be in designing/implementing machine learning, data science, advanced analytical algorithms, programming (skill in at least one high level language (e.g. Python)), statistical analysis (e.g. variability, sampling error, inference, hypothesis testing, EDA, application of linear models), data management (e.g. data cleaning and transformation), data mining, data modeling and assessment, artificial intelligence, and/or software engineering
Position requires active Security Clearance with appropriate Polygraph
Data Processing: (Data management and curation, data description and visualization, workflow and reproducibility)
Modeling, Inference, and Prediction: (Data modeling and assessment, domain-specific considerations)
Ability to make and communicate principal conclusions from data using elements of mathematics, statistics, computer science, and applications-specific knowledge
Ability to use analytic modeling, statistical analysis, programming, and/or another appropriate scientific method, develop and implement qualitative and quantitative methods for characterizing, exploring, and assessing large datasets in various states of organization, cleanliness, and structure that account for the unique feature and limitations inherent in Customer data holdings
Translate practical mission needs and analytic questions related to large datasets into technical requirements and, conversely, assist others with drawing appropriate conclusions from the analysis of such data
Effectively communicate complex technical information to non-technical audiences
What we offer:
Healthcare Coverage + Insurance: Medical: Three (3) rich healthcare options through CareFirst with 100% or majority company-paid premiums
Tax-advantaged health savings account available with generous employer contribution
Dental + Vision: 100% employer-paid for employees and family, with a buy-up option available
Paid Time Off + More: 4 weeks starting PTO – 11 federal holidays + 2 floating holidays – Paid hours for company-required training
Career Growth + Development: Access to FREE 24/7 learning via Udemy – Opportunities to participate in tech councils, industry initiatives, etc. – $7,500 annual Educational & Professional Development Assistance