Senior Software Engineer, Storage Platform

Robinhood Robinhood · Fintech · Bellevue, WA +2 · ENG Infrastructure

Robinhood is seeking a Senior Software Engineer for their Storage Platform team. This role involves building and operating the platform that powers database access across Robinhood, focusing on relational, key-value, and caching systems. The engineer will improve reliability, performance, and operational efficiency through automation and by developing key components of data and control plane systems. The role requires strong experience in backend/infrastructure systems, PostgreSQL, and cloud technologies like AWS and Kubernetes.

What you'd actually do

  1. Build and ship services that improve database reliability and performance, including connection pooling, query routing, and database access patterns
  2. Implement automation that reduces manual work for operating databases and caching clusters at scale (provisioning, configuration changes, backups, and routine operations)
  3. Improve observability for storage systems by adding metrics, logs, and alerts that help detect issues early and speed up incident response
  4. Diagnose and resolve production issues across storage infrastructure, including latency regressions, capacity constraints, and availability events
  5. Contribute to engineering standards for secure database connectivity (for example, encryption-in-transit) and safe usage guardrails

Skills

Required

  • Strong experience building and operating backend or infrastructure systems in production environments
  • Solid knowledge of PostgreSQL and relational database concepts (schema design, indexing, query performance, replication fundamentals)
  • Proficiency in at least one backend language such as Go or Rust, with the ability to learn existing codebases quickly
  • Experience with cloud infrastructure and containers (AWS and Kubernetes preferred), plus hands-on operational skills (on-call, incident response, and post-incident follow-ups)
  • Familiarity with system observability practices and performance measurement (metrics, dashboards, logs, and tracing)

What the JD emphasized

  • Availability is our highest priority
  • meet strict uptime targets
  • no downtime during market hours
  • operational excellence
  • on-call
  • incident response
  • post-incident follow-ups