What you'd actually do

Be a key contributor to the design and implementation of a scalable knowledge graph infrastructure focused on data standardization and interoperability, focusing on Oncology R&D data.

Apply graph-based data modeling for efficient Oncology R&D organization, integration and retrieval to ensure system flexibility and long-term maintainability.

Work with a larger community of Data Scientists, Clinical Scientists, and Discovery Scientists to standardize, curate and create AI-Ready datasets.

Curate and extend ontologies for clear mapping into established biomedical ontologies and controlled terminologies using resource description framework (RDF) standards.

Work with SPARQL/GraphQL/REST services; develop ingestion and curation pipelines to ingest, normalize and map concepts across data sources.

Skills

Required

semantic technologies
ontology
graph data modeling
SPARQL
RDF
life sciences domain
data standardization
data interoperability

Nice to have

Ph.D. or Master's degree in bioengineering, computer science, IT, bioinformatics, physics, mathematics, or related fields, emphasis on semantic technologies for biomedical application
5+ years professional experience in health informatics
large-scale knowledge graphs construction
pharmaceutical or healthcare domains integration
parser combinators
natural language processing
linked data (RDF Triple Stores and property graphs)
OWL
graph databases (Neo4j, Amazon Neptune)
complex biomedical datasets (e.g. clinical, genomics, proteomics)
SQL
key-value
column
document
graph stores
taxonomies
CI/CD implementations
git usage
CI/CD stacks (Jenkins, GitLab, Azure DevOps)
DevOps tools
metrics/monitoring
containerization technologies (Docker, Singularity)
stakeholder management
requirements gathering
business analysis
planning
manage a numerous projects simultaneously
prioritize work
organizational skills
flexibility

At Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to innovate across the full spectrum of healthcare solutions today to deliver the breakthroughs of tomorrow, and profoundly impact health for humanity. Learn more at jnj.com.

As guided by Our Credo, Johnson & Johnson is responsible to our employees who work with us throughout the world. We provide an inclusive work environment where each person is considered as an individual. At Johnson & Johnson, we respect the diversity and dignity of our employees and recognize their merit.

**Job Function: **

Data Analytics & Computational Sciences

**Job Sub Function: **

Data Science

Job Category:

Scientific/Technology

All Job Posting Locations:

Cambridge, Massachusetts, United States of America, Raritan, New Jersey, United States of America, San Diego, California, United States of America, Spring House, Pennsylvania, United States of America, Titusville, New Jersey, United States of America

Job Description:

Our expertise in Innovative Medicine is informed and inspired by patients, whose insights fuel our science-based advancements. Visionaries like you work on teams that save lives by developing the medicines of tomorrow.

Join us in developing treatments, finding cures, and pioneering the path from lab to life while championing patients every step of the way.

Learn more at https://www.jnj.com/innovative-medicine

Johnson & Johnson Innovative Medicine is recruiting for a **Principal Data Scientist - Oncology **to join our Data Science and Digital Health team (DSDH). This position will be located at one of our offices in either Spring House PA (preferred), Cambridge MA, or San Diego CA (La Jolla area). Consideration may be given for our Titusville and Raritan, NJ locations.

The **Principal Data Scientist - Oncology, **will play a pivotal role to standardize and connect biomedical and clinical data. You will be a hands-on technical contributor with depth in semantic technologies, ontology, and graph data modeling, and strong familiarity with the life sciences domain.

You will connect enterprise master data with R&D data across the entire product lifecycle so trusted, interoperable knowledge powers analytics, search, and AI across Johnson and Johnson Innovative Medicine.

Primary Responsibilities

Be a key contributor to the design and implementation of a scalable knowledge graph infrastructure focused on data standardization and interoperability, focusing on Oncology R&D data.
Apply graph-based data modeling for efficient Oncology R&D organization, integration and retrieval to ensure system flexibility and long-term maintainability.
Work with a larger community of Data Scientists, Clinical Scientists, and Discovery Scientists to standardize, curate and create AI-Ready datasets.
Curate and extend ontologies for clear mapping into established biomedical ontologies and controlled terminologies using resource description framework (RDF) standards.
Work with SPARQL/GraphQL/REST services; develop ingestion and curation pipelines to ingest, normalize and map concepts across data sources.
Extend and curate Oncology R&D-relevant ontologies (e.g., diseases, drugs, targets, pathways, etc.) and maintain synonyms, cross-references, and provenance.
Partner with cross-functional teams to enable NLP/RAG over graphs, features for predictive modeling and terminology services for search and study design tools.
Work with Data Science & Digital Health colleagues, IT and DevOps teams to deploy and manage the graph database infrastructure, focusing on high availability, scalability, and recovery operations specifically geared toward Oncology R&D needs and applications.
Draft and manage documentation, such as data dictionaries, data lineage, and data flow diagrams, to facilitate understanding of the knowledge graph.

Preferred Qualifications:

Desired Ph.D. or Master's degree in bioengineering, computer science, IT, bioinformatics, physics, mathematics, or related fields, emphasis on semantic technologies for biomedical application.
5+ years professional experience in health informatics.
Demonstrated experience in large-scale knowledge graphs construction, ontology development, pharmaceutical or healthcare domains integration.
Programming background in parser combinators, natural language processing, and linked data (RDF Triple Stores and property graphs).
Proficiency in semantic web technologies (e.g. SPARQL, RDF, OWL), familiarity with graph databases (Neo4j, Amazon Neptune).
Proven work with complex biomedical datasets (e.g. clinical, genomics, proteomics)
Proficiency in various data storage solutions (SQL, key-value, column, document, graph stores) and data modeling techniques (semantic data, ontologies, taxonomies).
Experience in CI/CD implementations, git usage, CI/CD stacks (Jenkins, GitLab, Azure DevOps), DevOps tools, metrics/monitoring, and containerization technologies (Docker, Singularity).
Demonstrated stakeholder management capabilities- including requirements gathering, business analysis and planning. Must have the capacity to translate discussions into user requirements and project plans.
Ability to manage a numerous projects simultaneously, prioritize work, exhibit organizational skills and flexibility to deliver maximum business value.
Willingness to conduct periodic travel (<15% of time) to conferences and internal meetings.

This position will be located at one of our offices in either Spring House PA (preferred), Cambridge MA, or San Diego CA (La Jolla area). Consideration may be given for our Titusville and Raritan, NJ locations. (No remote option.)

Johnson & Johnson is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, age, national origin, disability, protected veteran status or other characteristics protected by federal, state or local law. We actively seek qualified candidates who are protected veterans and individuals with disabilities as defined under VEVRAA and Section 503 of the Rehabilitation Act.

Johnson & Johnson is committed to providing an interview process that is inclusive of our applicants’ needs. If you are an individual with a disability and would like to request an accommodation, external applicants please contact us via https://www.jnj.com/contact-us/careers , internal employees contact AskGS to be directed to your accommodation resource.

The anticipated base pay range for this position is $117,000 to $201,250. The Company maintains highly competitive, performance-based compensation programs. Under current guidelines, this position is eligible for an annual performance bonus in accordance with the terms of the applicable plan. The annual performance bonus is a cash bonus intended to provide an incentive to achieve annual targeted results by rewarding for individual and the corporation’s performance over a calendar/performance year. Bonuses are awarded at the Company’s discretion on an individual basis. Employees and/or eligible dependents may be eligible to participate in the following Company sponsored employee benefit programs: medical, dental, vision, life insurance, short- and long-term disability, business accident insurance, and group legal insurance. Employees may be eligible to participate in the Company’s consolidated retirement plan (pension) and savings plan (401(k)).

Employees are eligible for the following time off benefits: Vacation – up to 120 hours per calendar year Sick time - up to 40 hours per calendar year Holiday pay, including Floating Holidays – up to 13 days per calendar year of Work, Personal and Family Time - up to 40 hours per calendar year Additional information can be found through the link below. https://www.careers.jnj.com/employee-benefits

The compensation and benefits information set forth in this posting applies to candidates hired in the United States. Candidates hired outside the United States will be eligible for compensation and benefits in accordance with their local market.

#LI-SL #JNJDataScience #JNJIMRND-DS #JRDDS #LI-Hyrbid

**Required Skills: **

Preferred Skills:

Advanced Analytics, Coaching, Critical Thinking, Data Analysis, Data Privacy Standards, Data Quality, Data Reporting, Data Savvy, Data Science, Data Visualization, Digital Fluency, Econometric Models, Organizing, Process Improvements, Strategic Thinking, Technical Credibility, Workflow Analysis