Manager, Engineering

Smartsheet Smartsheet · Seattle · United States · Engineering - Developers

Manager, Engineering role at Smartsheet focused on leading the development and operation of mission-critical, tier 0 service infrastructure and the foundational data platform. The role emphasizes high availability, low latency, and high throughput for millions of customers, requiring strong distributed systems expertise and team leadership. While the role champions AI adoption, its core focus is on the underlying platform infrastructure.

What you'd actually do

  1. Manage one or more related teams of 6–10+ software engineers, driving development of tier 0 grid services and platform infrastructure that millions of customers depend on daily.
  2. Own and uphold 99.999% service availability targets across critical platform services, embedding reliability engineering, incident management, and on-call rigor into team culture.
  3. Help architect and guide technical vision to evolve low-latency, high-throughput service platforms capable of sustaining millions requests per day with predictable, consistent performance under load.
  4. Guide and mentor engineers on distributed systems architecture, scalability patterns, and platform best practices while leading design reviews for high-stakes system components.
  5. Lead cross-functional technical efforts spanning multiple upstream dependents, partner teams, and internal consumers, building alignment and resolving systemic dependencies across the organization.

Skills

Required

  • software development
  • CS fundamentals (algorithms, data structures, distributed systems, systems design)
  • engineering management
  • leading high-availability infrastructure/platform services
  • owning and operating tier 0/mission-critical systems
  • building/operating low-latency, high-throughput service platforms
  • scalability, fault tolerance, and operational patterns for five-nines availability
  • managing complex dependencies across distributed architectures
  • distributed systems, concurrency, caching, data consistency, performance optimization
  • building and scaling large-scale, cloud-native backend services (AWS or equivalent)
  • observability: metrics, distributed tracing, alerting, and SLO/SLA management at scale
  • productionizing services for multiple internal/external platform consumers
  • building and scaling engineering teams (recruiting, onboarding, retaining senior/staff engineers)
  • Agile/Scrum and iterative delivery on long-horizon infrastructure projects
  • balancing engineering excellence with operational pragmatism
  • communicates clearly and consistently with technical and non-technical audiences, including executive leadership
  • proactively surfaces risks, blockers, and status updates without prompting
  • provides direct, constructive feedback and supports engineer growth at all levels
  • CS/Engineering degree or equivalent practical experience
  • legally eligible to work in the US on an ongoing basis

Nice to have

  • building/managing full-stack teams
  • Exposure to AI/ML platform integration or AI-powered backend features

What the JD emphasized

  • 99.999% availability
  • low-latency, high-throughput
  • five-nines (99.999%) availability
  • low-latency, high-throughput service platforms