Engineering Manager, Performance & Resilience (auth0)

Okta Okta · Enterprise · Toronto, ON · Engineering Quality-630

Engineering Manager for Performance & Resilience at Okta (Auth0) responsible for driving performance and resiliency improvements for the Auth0 product. This role involves building testing infrastructure, frameworks, tooling, and realistic datasets for performance testing, load testing, and chaos engineering. The manager will lead a global team, focusing on software quality, architecture, and ensuring the product remains performant at scale.

What you'd actually do

  1. Collaborate with architects, tech lead, product owners, security and operations engineers to implement best practices related to performance and resiliency
  2. Communicate and organize cross-team projects with high business impact
  3. Contribute to defining the strategic direction and roadmap for the team
  4. Own the evolution of the team's load testing framework — driving improvements in modularity, reusability, scalability, and developer experience so the framework can support the breadth of Auth0's services
  5. Lead the strategy and execution for test dataset generation, ensuring load and performance tests use realistic, representative data that accurately reflects production traffic patterns and user behaviors

Skills

Required

  • stakeholder management
  • management experience of a globally diverse team
  • designing or significantly improving load testing frameworks
  • test dataset generation strategies
  • synthetic data generation
  • data masking/anonymization of production data
  • APM tools
  • observability tools
  • distributed systems performance bottlenecks
  • software architecture across the entire stack
  • JavaScript ecosystem (NodeJS and/or TypeScript)
  • Chaos Engineering tooling
  • Cloud-based Infrastructure as a Service (especially AWS)

Nice to have

  • k6
  • Gatling
  • Locust
  • Chaos Toolkit
  • Gremlin
  • AWS
  • FIS

What the JD emphasized

  • performance testing
  • load testing frameworks
  • dataset generation
  • performance analysis
  • monitoring tooling
  • chaos engineering
  • load testing framework
  • test dataset generation