Staff Engineer, Observability

MongoDB MongoDB · Enterprise · Dublin, Ireland · PTO Office of the CTO

Staff Engineer, Observability for MongoDB platform, focusing on designing, building, and operating internal and external observability systems for metrics, visualization, logs, traces, and alerts. The role involves scaling systems to handle massive data volumes and ensuring strict SLOs on security, durability, availability, and performance.

What you'd actually do

  1. Help build and architect systems for the mission-critical observability platform, leading the architecture and implementation of components that drive performance, scalability, cost-efficiency, and resiliency
  2. Design and implement observability improvements that enable MongoDB engineers and customers to quickly and accurately diagnose the root cause of production issues.
  3. Handle production customer escalations from Technical Support team and coach teammates to do the same
  4. Write production-ready database code, improve the existing code, and mentor their team to write higher quality code
  5. Own areas of code that the Observability Team maintains, ensuring it achieves a high standard for quality (including security, durability, availability, and performance) and maintainability

Skills

Required

  • Designing, programming, debugging, and tuning distributed and/or highly concurrent C/C++/Java/Rust mission critical software systems
  • Running latency sensitive, high throughput systems
  • Strong systems fundamentals, including multi-threaded programming, performance profiling, and expert-level programming
  • Familiarity with database internals or building core components for data processing systems
  • Familiarity with observability ecosystem and best practices
  • Excellent verbal and written technical communication skills
  • Strong desire to collaborate with colleagues and mentor engineers
  • Excellent time and project management skills
  • Good understanding of information security management

Nice to have

  • Experience in setting direction and technical leadership for large engineering teams

What the JD emphasized

  • Minimum 10 years of experience in designing, programming, debugging, and tuning distributed and/or highly concurrent C/C++/Java/Rust mission critical software systems.
  • Experience running latency sensitive, high throughput systems
  • Strong systems fundamentals, including multi-threaded programming, performance profiling, and expert-level programming