Senior Software Engineer - Internal Observability

Snowflake Snowflake · Data AI · CA-Menlo Park, United States · Engineering

Snowflake is seeking a Senior Software Engineer to build AI-powered observability systems for their global data platform. This role involves designing and implementing large-scale telemetry pipelines and leveraging machine learning for anomaly detection, root cause analysis, and predictive insights to create an intelligent, autonomous system. The engineer will also focus on optimizing these systems for scale, latency, and cost efficiency, and mentor other engineers.

What you'd actually do

  1. Design and build large scale telemetry pipelines that ingest, process, and analyze metrics, logs, and traces across Snowflake’s multi cloud platform
  2. Architect AI driven observability systems that leverage machine learning for anomaly detection, root cause analysis, and predictive insights
  3. Partner with Snowflake teams to embed observability deeply into all layers of the platform
  4. Define and drive standards for instrumentation, tracing, and telemetry across services
  5. Build tools and platforms that empower engineers with deep visibility into system behavior and performance

Skills

Required

  • 7+ years of experience in software engineering with a strong focus on distributed systems
  • Deep experience building and operating large scale cloud services
  • Strong programming skills in languages such as Java, Scala, C++, or Python
  • Solid understanding of system performance, debugging, and reliability engineering principles
  • Experience with cloud platforms such as AWS, Azure, or GCP
  • Proven ability to lead complex technical projects and influence architecture decisions
  • Strong problem solving skills and ability to work in a fast paced environment

Nice to have

  • Experience designing telemetry systems including metrics, logging, and distributed tracing

What the JD emphasized

  • AI powered observability
  • AI driven observability systems
  • machine learning for anomaly detection
  • predictive insights
  • intelligent, autonomous system

Other signals

  • AI-powered observability
  • machine learning for anomaly detection
  • predictive insights
  • intelligent, autonomous system