Engineering Manager, Observability

MongoDB MongoDB · Enterprise · Dublin, Ireland · PTO Office of the CTO

Engineering Manager for MongoDB's Observability team, responsible for leading a team that designs, builds, and operates complex distributed systems for monitoring customer MongoDB deployments. The role involves managing engineers, contributing to code and architecture, collaborating with product teams, and ensuring the reliability and performance of observability services handling massive data volumes.

What you'd actually do

  1. Lead and coach a team of motivated individual contributors who are eager to learn and grow
  2. Contribute to the code, design, and architecture of the systems your team develops
  3. Work with product managers, program managers, and other engineering teams to specify, prioritize and deliver new features that delight our users, internally and externally
  4. Estimate task complexity, report progress, and voice risks for projects executed by the team
  5. Work with customers and support engineers to fix issues and become part of our on-call rotation

Skills

Required

  • at least 6 years of professional software development experience
  • at least 3 years of people management experience
  • led engineering teams that have built and maintained large scale systems
  • skilled at writing large-scale, distributed backend systems in a compiled language (Go, Java, Rust, C, etc.)
  • Good understanding of algorithms, data structures and their time and space complexity
  • experience with at least one major cloud provider technology (AWS, Azure, GCP)
  • eager to solve tough problems and debug tricky production outages
  • excellent communication skills
  • curious, collaborative, and motivated

Nice to have

  • experience in leading engineers in designing, building and operating complex distributed systems at scale

What the JD emphasized

  • strict SLO on security, durability, availability and performance
  • high-cardinality observability data
  • billions of metrics time series
  • petabytes of logs
  • traces
  • events