Software Development Engineer, Elasticache

Amazon Amazon · Big Tech · CA, BC +1 · Software Development

Software Development Engineer role focused on the core in-memory engine of Amazon ElastiCache and MemoryDB. Responsibilities include designing and building the next-generation database engine, developing large-scale distributed in-memory storage systems, optimizing the durability layer, implementing advanced data structures like vector search, and driving performance engineering. The role involves working in C/C++ at the systems level, contributing to open-source Valkey, and mentoring other engineers.

What you'd actually do

  1. Designing and building the next-generation in-memory database engine, delivering sub-millisecond latencies and millions of operations per second to the world's most demanding applications.
  2. Developing large-scale distributed in-memory storage systems in C/C++, extending open-source Valkey with durability, replication, and advanced data structure capabilities.
  3. Building and optimizing the durability layer — transaction logging, snapshotting, and replication protocols — that enables MemoryDB to deliver in-memory speed with Multi-AZ data protection.
  4. Designing and implementing advanced data structures and query capabilities including vector search and full-text search to support emerging workloads like generative AI.
  5. Driving performance engineering at the systems level — memory allocator tuning, I/O path optimization, and lock-free concurrency — to push throughput and latency boundaries.

Skills

Required

  • C/C++
  • systems programming
  • database internals
  • distributed systems
  • memory management
  • I/O optimization
  • replication protocols
  • performance engineering
  • designing and building engines
  • large-scale distributed systems
  • durability layer optimization
  • transaction logging
  • snapshotting
  • active-active replication
  • conflict resolution
  • vector search implementation
  • full-text search implementation
  • lock-free concurrency
  • mentoring engineers
  • technical leadership

Nice to have

  • Valkey
  • generative AI workloads

What the JD emphasized

  • core in-memory engine
  • data path
  • durability layer
  • replication protocol
  • snapshot system
  • advanced data structures
  • systems programming
  • database internals
  • performance engineering
  • every microsecond matters
  • latency-sensitive applications
  • write-ahead logs
  • copy-on-write fork semantics
  • lock-free data structures
  • memory allocator design
  • millions of operations per second
  • cloud scale
  • sub-millisecond latencies
  • massive scale
  • performance-critical workloads
  • memory management
  • I/O optimization
  • replication protocols
  • distributed data systems
  • next-generation in-memory database engine
  • large-scale distributed in-memory storage systems
  • durability layer
  • transaction logging
  • snapshotting
  • replication protocols
  • Multi-AZ data protection
  • advanced data structures
  • query capabilities
  • vector search
  • full-text search
  • generative AI
  • performance engineering at the systems level
  • memory allocator tuning
  • I/O path optimization
  • lock-free concurrency
  • throughput and latency boundaries
  • active-active replication
  • conflict resolution mechanisms
  • globally distributed
  • low-latency data access
  • technical leader
  • engineering best practices
  • individual project priorities
  • deadlines
  • deliverables
  • high degree of autonomy
  • accountability
  • deep technical work
  • collaborative engineering
  • production-quality C/C++ code
  • core in-memory engine
  • optimizing data structures
  • replication paths
  • durability layer
  • maximum throughput
  • minimal latency
  • design reviews
  • architecture discussions
  • durability guarantees
  • memory efficiency
  • replication consistency
  • debug complex systems issues
  • engine level
  • crash dumps
  • memory corruption
  • profiling hot paths
  • production-scale load
  • upstream open-source Valkey contributors
  • internal partner teams
  • new capabilities
  • compatible and performant
  • emerging customer needs
  • generative AI workloads
  • vector search
  • globally distributed applications
  • active-active replication
  • engine capabilities
  • code reviews
  • design feedback
  • pairing sessions
  • systems programming skills
  • engineering judgment
  • operational excellence
  • on-call rotations
  • engine reliability
  • diagnostic tooling
  • hard problems
  • intersection of database internals and distributed systems
  • collaborative
  • intellectually curious
  • technical depth
  • ownership
  • bias for action
  • core engine
  • in-memory data path
  • durability layer
  • replication protocol
  • advanced query capabilities
  • latency-sensitive workloads
  • fast-growing startups
  • largest enterprises
  • sub-millisecond performance
  • scale
  • open-source software
  • Valkey community
  • growing together
  • senior engineers
  • mentoring
  • developing engineers
  • technical excellence
  • continuous learning