Staff Software Engineer - Production Engineering

Snowflake Snowflake · Data AI · Warsaw, Poland · Engineering

Staff Software Engineer on the Production Engineering Team at Snowflake, focusing on reliability, SLOs, incident response, and scaling distributed systems to support the company's growth and the era of the agentic enterprise.

What you'd actually do

  1. Lead the improvement of the whole lifecycle of services—from inception and design, deployment, operation, and refinement.
  2. Drive scaling systems sustainably by automation; Drive changes that improve reliability and velocity.
  3. Establish and practice low noise incident response rotations and blameless postmortems to prevent problem recurrence.
  4. Write and review code. Develop documentation and capacity plans, and debug the hardest problems on large distributed systems.
  5. Collaborate with software engineers to establish, maintain, and optimize functional and performance SLOs.

Skills

Required

  • Bachelor's degree in Computer Science, a related technical field involving software engineering, or equivalent practical experience.
  • Proficient in at least one modern programming language, preferably Golang.
  • Systematic problem-solving methods, effective communication skills.

Nice to have

  • 10+ years industry experience designing, building and supporting large scale systems in production.
  • Experience in modern observability tools and production monitoring practices.
  • Experience with capacity and load testing of the distributed applications
  • Experience with containers and container orchestration systems such as Kubernetes
  • Experience in deploying, managing, and operating scalable and fault tolerant Linux infrastructure.
  • Experience with the SLO-driven reliability management processes.
  • Hands on experience with one of more public cloud providers (AWS, Azure, or GCP)
  • Ability to spot systematic issues, define roadmaps and guide other engineers to resolve them.