Director of Engineering, Workflows & Runners

GitLab GitLab · Enterprise · Canada +1 · Remote · Architecture Engineering

Director of Engineering to lead teams building core primitives for Workflows and Function Runtime at GitLab. Owns strategy, architecture, and delivery of scalable, multi-tenant, cost-efficient distributed systems. Hands-on leadership role working with senior engineers on design and technical decisions, while building and scaling a high-performing engineering organization. Collaborates with AI/ML teams.

What you'd actually do

  1. Own the technical vision and roadmap for GitLab’s core Workflow and Function Runtime primitives, in partnership with Product, Architecture, and other platform teams.
  2. Lead and grow multiple engineering teams responsible for highly available, horizontally scalable, and secure distributed systems used across GitLab.
  3. Be hands-on with architecture and code: review designs, dive into incidents, and contribute to critical paths where needed.
  4. Drive operational excellence for these shared services, including SLOs, on-call, incident response, capacity planning, and resilience across regions and tenants.
  5. Optimize for cost and efficiency at scale, balancing performance, reliability, and unit economics for long-running and bursty workloads.

Skills

Required

  • Proven experience leading teams building core platform or infrastructure services (e.g., workflow engines, function runtimes, control planes, high-scale microservices, or similar distributed systems).
  • Track record as a hands-on engineering leader: you’ve meaningfully contributed to architecture and design, and you are comfortable reading/writing code to unblock or clarify.
  • Strong background in scalable, multi-tenant distributed systems, including topics such as service decomposition, data partitioning, fault tolerance, and backpressure.
  • Demonstrated success operating mission-critical services in production (SLOs, on-call, incident management, postmortems, capacity management, chaos/DR testing).
  • Experience driving cost efficiency in cloud-native environments (profiling, performance tuning, right-sizing, storage and network optimization, and thoughtful use of managed services).
  • People leadership experience managing managers and senior/staff engineers, with a record of hiring, coaching, and building inclusive, high-trust teams.
  • Ability to work effectively in a fully remote, globally distributed organization, with excellent written and asynchronous communication skills.
  • 10+ years of professional software engineering experience, including 4–6+ years in engineering leadership (multi-team or org-level scope).

Nice to have

  • Familiarity with Kubernetes, modern cloud infrastructure (AWS/GCP/Azure), and event-driven/async architectures; experience with function-as-a-service or serverless runtimes is a plus.

What the JD emphasized

  • core primitives
  • scalable, multi-tenant, and cost-efficient distributed systems
  • highly available, horizontally scalable, and secure distributed systems
  • operational excellence
  • cost and efficiency at scale
  • platform APIs and abstractions
  • senior and staff-level engineers and managers