Merck-logo
Merck
·
June 27, 2025
Apply Now
This job has closed.

Data Scientist and AI/ML Engineer – Generative AI and Natural Language Processing (Hybrid - NJ or MA)

USA - New Jersey - Rahway
Full-time
Hybrid
$104K/yr - $164K/yr
Entry, Mid Level
Merck is a biopharmaceutical company that offers medicines and vaccines for various diseases. The Data Scientist and AI/ML Engineer role focuses on developing and deploying NLP products to enhance drug discovery and development processes using AI/ML techniques.
Apply Now

Responsibilities

  • Helping to develop and deploy production-grade NLP products for unstructured and semi-structured data from across our company’s research and development pipeline.
  • Solving real-world problems and contributing to Artificial Intelligence and Machine Learning (AI/ML) in therapeutic research and development.
  • Focusing on the scalable deployment of ML and Generative AI approaches (such as Large Language Models, or LLMs) for surfacing insights from proprietary unstructured research data and biomedical literature.
  • Integrating structured information from the likes of knowledge graphs.
  • Building novel NLP/AI-enriched software that enables the discovery, development, and delivery of new therapeutics to patients in need.
  • Understanding real-world challenges and developing automated data solutions for them.
  • Opportunities to directly interact with users and stakeholders of your data science, ML, and AI products.
  • Evaluating, developing, testing, and deploying new techniques for natural language understanding and new DevOps and ML/LLMOps frameworks.
  • Freedom to propose projects that interest you and to collaborate cross-functionally on delivery.
  • Staying updated on the newest methods in NLP, ML, generative AI, and ML/LLMOps.
  • Sharing the approaches you implement and their impact with internal company audiences and externally.

Qualification

Required

  • High School Diploma required.
  • B.S. with focus on Computer Science, Computer Engineering, Semantic Engineering, NLP, data science, AI/ML/LLM engineering, or a related discipline preferred.
  • Minimum of 2 years of industry, internship/co-op experiences.
  • Minimum of 1 year of industry experience with Python programming, version control and collaborative software development with git, DevOps and orchestration tools including Github Actions and Apache Airflow, and at least one AI/ML framework such as Pytorch.

Preferred

  • Fluency in Python programming, version control and collaboration with git, environment management (e.g., poetry, conda, docker), standard Python packages for data exploration (e.g., pandas, numpy, matplotlib)
  • Fluency with data science and NLP approaches such as exploratory data analysis, performance metrics and benchmarks, supervised and unsupervised learning, transformers, and LLMs.
  • Fluency with standard cloud and DevOps tools, such as Infrastructure as Code (IaC) and Github Actions.
  • Experience with at least one ML framework (e.g., pytorch, tensorflow, fairseq) and with ML model deployment and operations (MLOps/LLMOps)
  • Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, semantic search and retrieval frameworks (e.g., development and benchmarking of embedding models and retrieval approaches in the context of Retrieval Augmented Generation, RAG), and/or semantic knowledge frameworks (e.g. RDF triplestores, property graphs, ontology management).
  • Experience with standard operations on non-relational (e.g., Elasticsearch/Opensearch, MongoDB, Neptune), relational databases (e.g., PostgreSQL), and vector databases (e.g., pgvector, Elasticsearch dense vectors) and deployment of APIs and web applications (e.g., flask, fastAPI, django, or dash)
  • Working knowledge of NLP and/or Generative AI libraries (e.g., regular expressions, spacy, langchain) and text/document annotation tools (e.g., Prodigy, BRAT)
  • A demonstrated ability to engage cross-functional teams and stakeholders, including an eagerness to acquire a level of domain knowledge
  • Excellent communication, teamwork, didactic, and leadership skills, including skills for scientific communication (authoring scientific articles and presenting) and guidance and mentorship of junior employees and less experienced collaborators

Benefit

  • Medical, dental, vision healthcare and other insurance benefits (for employee and family)
  • Retirement benefits, including 401(k)
  • Paid holidays
  • Vacation
  • Compassionate and sick days
Merck is a biopharmaceutical company that offers medicines and vaccines for various diseases.
Glassdoor
This is some text inside of a div block.
Founded in 1891
Rahway, New Jersey, USA
10001+ employees
http://www.merck.com