What you'd actually do

Build and lead a team of 6-8 engineers, fostering a positive culture, handling career growth and performance conversations, and proactively removing blockers

Define and drive a clear technical vision and comprehensive roadmap for our multi-tenant distributed storage systems, balancing long-term strategic infrastructure goals with immediate engineering needs

Contribute through hands-on technical work, such as leading architectural design reviews, reviewing PRs, and stepping in to guide the team through complex operational challenges

Act as the primary liaison for the Storage Layer Services SRE team, collaborating closely with other engineering leaders to ensure platform alignment and manage stakeholder expectations

Skills

Required

software engineering
distributed systems
site reliability engineering
team leadership
Kubernetes
containerization
Infrastructure as Code (IaC)
Terraform
Crossplane
Operators
stateful storage systems
database systems
durability
consistency
recovery trade-offs
technical roadmaps
communication skills

Nice to have

multi-cloud environments (AWS, GCP, or Azure)
secure, multi-tenant runtime environments

MongoDB’s Storage Layer Services (SLS) team is re-architecting the MongoDB cloud storage layer and sits at the heart of our next-generation cloud storage architecture. This relatively new team is building performant, multi-tenant distributed storage services that both enhance today’s Atlas storage stack and enable more customer workloads to run more efficiently.

As the Site Reliability Engineering Manager for SLS, you will partner with the teams building these storage services to define SLOs, shape capacity plans, and ensure the reliability, durability, and operational safety of the storage layer that underpins Atlas. You’ll help grow and lead a small, senior team of SREs as founding members of this organization, playing a crucial role in executing on a multi-year roadmap for MongoDB’s cloud storage architecture.

We are looking to speak to candidates who are based in Cork for our hybrid working model.

Responsibilities

Build and lead a team of 6-8 engineers, fostering a positive culture, handling career growth and performance conversations, and proactively removing blockers
Define and drive a clear technical vision and comprehensive roadmap for our multi-tenant distributed storage systems, balancing long-term strategic infrastructure goals with immediate engineering needs
Contribute through hands-on technical work, such as leading architectural design reviews, reviewing PRs, and stepping in to guide the team through complex operational challenges
Act as the primary liaison for the Storage Layer Services SRE team, collaborating closely with other engineering leaders to ensure platform alignment and manage stakeholder expectations

You may be a good fit if you

Have 10+ years of experience working on software and operating distributed systems, with 2+ years managing engineering teams
Possess a customer-focused mindset, treating internal developers as your primary users
Value efficiency in processes and operations, and have a track record of optimizing team workflows
Prefer automation over manual processes, fostering a culture of building software solutions to eliminate toil
Have deep technical familiarity with Kubernetes ecosystems, containerization technologies, and modern IaC tooling (e.g., Terraform, Crossplane, or Operators) so you can effectively guide the team's technical decisions
Have operated or supported stateful storage or database systems at scale and are comfortable with durability, consistency and recovery trade-offs
Excel at translating complex business and engineering requirements into actionable, phased technical roadmaps
Have a high level of empathy, responsibility, ownership, and accountability
Excellent verbal and written technical communication skills

Strong candidates may also have experience with

Leading major architectural shifts, such as moving from legacy storage stacks to new multi-tenant storage architectures, including planning and executing large-scale data and workload migrations with tight availability and durability requirements
Managing and scaling infrastructure across multi-cloud environments (AWS, GCP, or Azure)
Designing secure, multi-tenant runtime environments at scale

About MongoDB

MongoDB is built for change, empowering our customers and our people to innovate at the speed of the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt industries with software. MongoDB’s unified database platform, the most widely available, globally distributed database on the market, helps organizations modernize legacy workloads, embrace innovation, and unleash AI. Our cloud-native platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available across AWS, Google Cloud, and Microsoft Azure.

With offices worldwide and over 60,000 customers, including 75% of the Fortune 100 and AI-native startups, relying on MongoDB for their most important applications, we’re powering the next era of software.

Our compass at MongoDB is our Leadership Commitment, guiding how and why we make decisions, show up for each other, and win. It’s what makes us MongoDB.

To drive the personal growth and business impact of our employees, we’re committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees’ wellbeing and want to support them along every step of their professional and personal journeys.Learn more about what it’s like to work at MongoDB, and help us make an impact on the world!

MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter.

MongoDB is an equal opportunities employer.

Req ID: 1273396229

We are looking to speak to candidates who are based in Cork for our hybrid working model.

Responsibilities

Build and lead a team of 6-8 engineers, fostering a positive culture, handling career growth and performance conversations, and proactively removing blockers
Define and drive a clear technical vision and comprehensive roadmap for our multi-tenant distributed storage systems, balancing long-term strategic infrastructure goals with immediate engineering needs
Contribute through hands-on technical work, such as leading architectural design reviews, reviewing PRs, and stepping in to guide the team through complex operational challenges
Act as the primary liaison for the Storage Layer Services SRE team, collaborating closely with other engineering leaders to ensure platform alignment and manage stakeholder expectations

You may be a good fit if you

Have 10+ years of experience working on software and operating distributed systems, with 2+ years managing engineering teams
Possess a customer-focused mindset, treating internal developers as your primary users
Value efficiency in processes and operations, and have a track record of optimizing team workflows
Prefer automation over manual processes, fostering a culture of building software solutions to eliminate toil
Have deep technical familiarity with Kubernetes ecosystems, containerization technologies, and modern IaC tooling (e.g., Terraform, Crossplane, or Operators) so you can effectively guide the team's technical decisions
Have operated or supported stateful storage or database systems at scale and are comfortable with durability, consistency and recovery trade-offs
Excel at translating complex business and engineering requirements into actionable, phased technical roadmaps
Have a high level of empathy, responsibility, ownership, and accountability
Excellent verbal and written technical communication skills

Strong candidates may also have experience with

Leading major architectural shifts, such as moving from legacy storage stacks to new multi-tenant storage architectures, including planning and executing large-scale data and workload migrations with tight availability and durability requirements
Managing and scaling infrastructure across multi-cloud environments (AWS, GCP, or Azure)
Designing secure, multi-tenant runtime environments at scale

About MongoDB

Our compass at MongoDB is our Leadership Commitment, guiding how and why we make decisions, show up for each other, and win. It’s what makes us MongoDB.

MongoDB is an equal opportunities employer.

Req ID: 1273396229

Manager, Site Reliability Engineering - Storage Layer Service

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Responsibilities

You may be a good fit if you

Strong candidates may also have experience with

About MongoDB

Responsibilities

You may be a good fit if you

Strong candidates may also have experience with

About MongoDB