Senior, Software Engineer - Sre

Walmart Walmart · Retail · Bentonville, AR

This role is for a Senior Software Engineer focused on Site Reliability Engineering (SRE) within Walmart's Sam's Club division. The primary responsibilities include supporting SRE functions with development experience, particularly in Java and cloud environments (AWS, GCP, Azure). The role involves designing and implementing fault-tolerant architectures, automating operational tasks, setting reliability standards, and partnering on platform operations to ensure service security, compliance, and resilience. Key activities include driving automation, building observability dashboards, reducing incidents, and being available for critical production issues during off-hours and weekends. Experience with Object-Oriented Programming, Java libraries, MVC patterns, JDBC, and RESTful web services is expected.

What you'd actually do

  1. SRE support with Development exp and Cloud engg knowledge.
  2. Design and implement fault‑tolerant architectures and graceful‑degradation patterns; influence code and deployment practices to raise reliability across teams.
  3. Automate toil via scripts, pipelines, and self‑healing runbooks to reduce repetitive manual work and improve MTTR
  4. Set and enforce reliability standards (golden paths, readiness checks, deployment criteria) for In‑Club workloads; guide teams through launch and capacity reviews.
  5. Drive automation, Build observability dashboard, Drive Incident reduction and work on any critical issues during any (P1/ P2)

Skills

Required

  • Java
  • Cloud (AWS, GCP, Azure)
  • SRE
  • Object-Oriented Programming (OOP) Patterns and Concepts
  • RESTful web services
  • Automation
  • Observability dashboard
  • Incident reduction

Nice to have

  • Service-oriented architecture
  • MVC (Model-View-Controller) Pattern
  • JDBC (Java Database Connectivity)
  • web application frameworks

What the JD emphasized

  • SRE support with Development exp and Cloud engg knowledge
  • Availability during weekend and off hours
  • Availability in Off hours and weekend (on call) for support for critical Production issues