Principal / Sr. Principal Data Scientist – Machine Learning and Predictive Analytics

Northrop Grumman Northrop Grumman · Aerospace · Melbourne, FL +1 · Data Science

This role focuses on architecting and building custom, high-performance machine learning tools and data science solutions from the ground up, involving deep data modeling, pattern abstraction, and writing modular code. It emphasizes software engineering best practices within the data domain and tuning complex algorithms within on-premise infrastructure. The role requires a strong understanding of ML/DL, Python, SQL, and CI/CD.

What you'd actually do

  1. Collaborate closely with software engineers, analysts, and other data scientists to understand data requirements and deliver robust, integrated solutions.
  2. Architect and develop custom, maintainable data science solutions from the ground up.
  3. Write and maintain tests and documentation for data architecture, workflows, and processes.
  4. Champion software engineering best practices within the data domain, including comprehensive unit/integration testing, CI/CD automation (Git, Docker), and robust documentation.
  5. Tune and optimize complex algorithms within our on-premise infrastructure.

Skills

Required

  • Bachelor of Science degree in Data Science, Computer Science, Software Engineering, Applied Mathematics, or related field, with a minimum of 5 years of relevant experience, OR Master of Science degree in relevant fields with 3 years of relevant experience.
  • Deep understanding of Machine Learning and Deep Learning
  • Recent professional experience in software development and/or data engineering.
  • Strong programming skills in Python and SQL.
  • Familiarity with CI/CD pipelines and version control (e.g., Git, Docker, etc.)
  • Active In Scope Top Secret Clearance required to start.

Nice to have

  • Master’s degree or higher, in Data Science or related discipline.
  • 7 years of experience in software development.
  • Experience with C/C++.
  • Experience with machine learning pipelines and data science workflows.
  • Familiarity with SQL Alchemy, Numpy, Pandas, SKLearn, PyTorch.
  • Ability to troubleshoot and optimize complex data queries and workflows.
  • Familiarity with big data concepts, machine learning pipelines, and MLOps.
  • Excellent problem-solving skills with the ability to troubleshoot and optimize across the entire data stack.
  • Ability to work in a fast-paced environment.

What the JD emphasized

  • Top Secret
  • Active In Scope Top Secret Clearance required to start
  • This is not a role for operating pre-built tools
  • architect and build the core data logic of our systems
  • creating custom, high-performance statistics and machine learning tools from the ground up

Other signals

  • architect and build the core data logic
  • creating custom, high-performance statistics and machine learning tools from the ground up
  • deep data modeling
  • abstracting reusable patterns
  • writing modular, maintainable code
  • Tune and optimize complex algorithms