Principal Software Engineer - Data

ZoomInfo ZoomInfo · Enterprise · Waltham, MA · 936 Engineering - Data Engineering

Seeking a Principal Software Engineer (Data) to drive the direction of technographic data services, building world-class data pipelines and systems to process billions of signals and data points. This role involves designing data architectures, optimizing pipelines, and collaborating with data scientists to integrate ML models.

What you'd actually do

  1. Build and optimize big data pipelines to extract and process signals from the web, job postings, and other sources
  2. Design and implement data architectures and storage solutions to efficiently handle massive data volumes
  3. Collaborate closely with data scientists to support and integrate ML models into data workflows
  4. Continuously improve data quality, performance, and scalability of our technographic data platform
  5. Drive technical strategy and roadmap for the data processing infrastructure.

Skills

Required

  • Extensive experience building and scaling big data pipelines and architectures from scratch
  • Deep expertise in big data frameworks (Hadoop, Spark)
  • JVM stack (Java, Scala)
  • Strong software engineering fundamentals
  • Ability to write efficient, high-quality code
  • Experience with entity recognition and NLP techniques
  • Proven track record delivering results and driving projects in a fast-paced environment
  • Excellent collaboration and communication skills
  • Experience working with very large datasets and distributed systems
  • Familiarity building data pipelines at large tech companies or data-driven organizations

Nice to have

  • Experience with entity recognition and NLP techniques

What the JD emphasized

  • build and scale big data pipelines and architectures from scratch
  • Deep expertise in big data frameworks
  • Strong software engineering fundamentals
  • Proven track record delivering results and driving projects in a fast-paced environment