Staff Software Engineer, Streaming

Attentive Attentive · Enterprise · United States · Engineering

Staff Software Engineer to lead the evolution of Attentive's streaming platform, which powers billions of events and processes millions of messages per second. The role involves setting technical direction, driving modernization, and building primitives and tooling for product and data teams. The engineer will focus on scaling real-time systems, improving reliability and observability, and enabling new capabilities for personalization and growth. This is a high-leverage role for solving distributed systems problems and influencing platform strategy.

What you'd actually do

  1. Architect and evolve Attentive’s next-generation event streaming platform; design high-throughput, low-latency, and cost-efficient solutions that power mission-critical products and use cases across Attentive’s ecosystem.
  2. Enhance streaming developer experience: Build and refine self-serve tools for event observability, debugging, load testing, and system configuration, empowering teams to experiment independently and ship quickly.
  3. Simplify and modernize streaming architecture: Remove unnecessary abstraction layers, enable direct access for power users, and ensure the platform is flexible for both “paved path” and advanced use cases.
  4. Solve complex distributed systems challenges with primitives for reliable stream processing: rate limiting, deduping, delayed message delivery, etc.
  5. Champion best practices and technology selection: Stay ahead of industry advancements in event streaming, advocating for tools and approaches that balance innovation with long-term reliability.

Skills

Required

  • Java
  • distributed systems
  • event streaming
  • messaging platforms
  • Kafka
  • Flink
  • Pulsar
  • Spark
  • observability
  • developer tooling
  • Terraform
  • Helm
  • Kubernetes
  • EKS
  • AWS

Nice to have

  • Spring Boot
  • Gradle
  • DynamoDB
  • Postgres
  • Redis
  • Istio
  • CloudFlare
  • React
  • TypeScript
  • GraphQL
  • Storybook
  • Vite

What the JD emphasized

  • 10+ years of experience architecting and supporting high-throughput, distributed systems at scale—especially those involving event streaming or messaging platforms.
  • Strong expertise in the internals, tradeoffs, and operating models of distributed streaming technologies such as Kafka, Flink, Pulsar, and/or Spark.
  • Proven track record of leading major platform or architectural initiatives that span multiple teams, including modernization, migration, simplification, or adoption of new infrastructure patterns.
  • Deep experience designing systems for scale, reliability, debuggability, and efficiency, including handling high-throughput workloads and complex failure scenarios in production.
  • Strong proficiency in Java and backend systems design, with the ability to work across application, platform, and infrastructure layers.
  • Able to debug and optimize end-to-end streaming systems, from schemas and serialization to consumer behavior, JVM performance, networking, and infrastructure bottlenecks.
  • Familiar with resource scheduling, data locality, and how infrastructure choices impact cost and system behavior.
  • Experience with observability and developer tooling for streaming (e.g., tracing, metrics, replay).
  • Infrastructure-as-code expertise (Terraform, Helm), comfortable with Kubernetes (EKS) and cloud-native environments.
  • Demonstrated ability to influence technical strategy, communicate tradeoffs clearly, and lead through collaboration rather than authority.
  • Excitement for tackling ambiguous, high-impact platform problems in a fast-moving environment, with sound judgment about where to innovate versus standardize.