Principal Software Engineer

ZoomInfo ZoomInfo · Enterprise · CA · Remote · 31827 Engineering - ZI Marketing

Principal Software Engineer role focused on designing and implementing core systems for a real-time marketing platform, leveraging LLMs and agent frameworks for data analysis, classification, and intelligent decision-making. The role involves building distributed systems, high-performance APIs, and data pipelines, with a focus on AI-native development and driving quality, performance, and observability.

What you'd actually do

  1. Design and build distributed systems that process, enrich, and respond to billions of behavioral events per day in real time
  2. Develop high-performance APIs and services that support advertising, identity, and intent features across the Marketing Platform
  3. Leverage machine learning and large language models (LLMs) to analyze behavioral data, classify content, extract signals, and enable intelligent decision-making
  4. Build intelligent agents using frameworks like LangGraph or MCP to reason over data and power user-facing insights
  5. Design and operate data pipelines using tools like Kafka, Kinesis, and ClickHouse to support both streaming and batch workloads

Skills

Required

  • Java
  • Go
  • Python
  • SQL
  • distributed systems
  • API design
  • data pipelines
  • Kafka
  • Kinesis
  • ClickHouse
  • LLM-powered systems
  • RAG pipelines
  • agent frameworks
  • LangGraph

Nice to have

  • ad tech
  • real-time bidding (RTB)
  • programmatic systems
  • identity resolution
  • attribution
  • behavioral analytics
  • open source contributions
  • product instincts

What the JD emphasized

  • 8+ years of backend, data, or infrastructure engineering experience, or equivalent impact and leadership
  • Proficiency in one or more core languages: Java, Go, Python
  • Solid grasp of SQL and large-scale data modeling
  • Skilled at designing and implementing LLM-powered systems such as RAG pipelines, agent frameworks (e.g., LangGraph), or intelligent workflows that reason across large datasets in real time

Other signals

  • LLM-powered systems
  • agent frameworks
  • real-time data analysis
  • distributed systems