Senior Director, Cloud Platform & Reliability Engineering

Visa Visa · Fintech · Auckland, New Zealand, New Zealand

Senior Director to lead cloud platform and reliability engineering strategy and execution, focusing on building and operating a secure, scalable, and highly reliable cloud environment. Responsibilities include defining strategy, driving cost optimization, managing 24x7 operations with SLOs/SLAs, leading engineering teams, promoting automation, ensuring service reliability, partnering with other leaders, establishing secure practices, implementing observability, and defining resilience strategies.

What you'd actually do

  1. Define and lead the cloud platform strategy, including infrastructure, reliability, scalability, and cost optimization.
  2. Drive run cost optimization strategy across infrastructure, platforms, and shared services, ensuring cost efficiency without compromising reliability, security, or release velocity.
  3. Build, lead, and mentor high-performing engineering teams across platform engineering, cloud operations, and site reliability.
  4. Own service reliability and operational excellence, including incident management, root cause analysis, and long-term remediation.
  5. Implement strong observability standards (monitoring, logging, metrics, alerting) to improve service health and decision-making.

Skills

Required

  • Cloud platform leadership (AWS, Azure, or GCP)
  • Site Reliability Engineering (SRE)
  • Infrastructure as Code (IaC)
  • CI/CD and release engineering
  • Monitoring, logging, and alerting systems
  • Linux environments
  • Scripting or programming (e.g., Python)
  • Capacity planning
  • Performance management
  • Incident response
  • Change management
  • Disaster recovery strategies

Nice to have

  • Cost optimization
  • Security best practices
  • Vendor management

What the JD emphasized

  • 15+ years of experience in engineering, infrastructure, cloud platform, or site reliability roles, with significant leadership responsibility.
  • Proven experience leading cloud platforms or cloud-based services at scale (AWS, Azure, or GCP).
  • Demonstrated experience running production systems with high availability and reliability expectations.
  • Strong ability to lead in a matrixed, cross-functional environment.
  • Excellent communication skills, including the ability to explain complex technical topics to senior leadership.