Senior Fullstack Engineer - Observability Real User Monitoring (rum) | Canada | Remote

Grafana Labs Grafana Labs · Data AI · Canada · Remote · R&D: Observability

This role is for a Senior Fullstack Engineer focused on Grafana's Real User Monitoring (RUM) product. The primary responsibilities involve building and evolving fullstack features for systems that ingest, store, and query high-volume telemetry data (sessions, events, traces, logs) in real-time. This includes backend services, APIs, storage systems, query layers, and frontend visualization applications. The role emphasizes working with high-cardinality data, distributed systems, and observability concepts, with a preference for Go and TypeScript/React. While the company uses AI tools and provides access to frontier models, the core function of the role is not AI/ML development but rather building robust observability systems.

What you'd actually do

  1. Build and evolve fullstack features for RUM, spanning backend services, APIs, storage systems, and frontend user experiences.
  2. Design and implement systems that ingest, store, and query high-cardinality, high-volume telemetry data using columnar/analytical databases.
  3. Develop performant query layers and APIs that power real-time exploration of user sessions, traces, and events.
  4. Contribute to frontend applications that visualize RUM data, enabling users to debug performance issues and understand user behavior.
  5. Work on data modeling, indexing strategies, and query optimization to ensure low-latency, cost-efficient analytics at scale.

Skills

Required

  • 5+ years of fullstack engineering experience with strong backend fundamentals
  • Backend experience (Go is preferred) and frontend experience, we use TypeScript and React
  • Experience building or operating distributed systems in production (e.g., Kafka, WarpStream, ClickHouse, Cassandra, Postgres)
  • Familiarity with cloud-native systems (Docker, Kubernetes, AWS, GCP, Azure)
  • Experience working with high-throughput, high-cardinality data (logs, metrics, traces, events)
  • Strong understanding of data modeling, query optimization, and performance tradeoffs
  • Experience designing and building APIs and distributed services
  • Experience building data-heavy UIs (dashboards, query tools, debugging interfaces)
  • Familiarity with observability concepts (traces, logs, metrics) and/or OpenTelemetry
  • Strong communication skills and ability to work in a remote, distributed team
  • Pragmatic, self-driven, and comfortable navigating ambiguity
  • Customer-focused mindset with a passion for developer experience

Nice to have

  • Experience with browser or mobile instrumentation (RUM SDKs, telemetry collection).
  • Mobile development experience (iOS or Android) or familiarity with mobile performance and telemetry.
  • Contributions to OpenTelemetry or other observability OSS.
  • Experience building developer-facing platforms or observability products.
  • Familiarity with session replay, sampling strategies, or user behavior analytics systems.

What the JD emphasized

  • high-volume telemetry data
  • high-cardinality data
  • low-latency
  • cost-efficient analytics