(usa) Senior Manager, Software Engineering

Walmart Walmart · Retail · Bentonville, AR

Senior Manager, Software Engineering, leads platform-level execution of software and AI/ML systems, ensuring alignment with strategic priorities and roadmap milestones. This role oversees shared infrastructure, tooling, and frameworks to support scalable, secure development while driving full lifecycle management across multiple teams. The position champions DevOps practices, cloud cost optimization, and technical excellence to deliver high-availability systems. Collaborating with cross-functional partners, the role guides architectural decisions, mentors engineering talent, and manages complex initiatives to achieve business objectives with a focus on continuous improvement and operational efficiency.

What you'd actually do

  1. Manage a high performing team of 10-12 engineers who work across multiple technology stacks.
  2. Drive design, development, implementation and documentation
  3. Establish best engineering and operational excellence practices based on product, engineering and scrum metrics
  4. Interact with Walmart engineering teams across geographies to leverage expertise and contribute to the tech community.
  5. Engage with Product and Business stakeholders to drive the agenda, set the priorities and deliver scalable and resilient products.

Skills

Required

  • Bachelor's/Master’s degree in Computer Science, engineering, or related field
  • 12+ years of experience in software development
  • 5+ years of experience in managing engineering teams
  • managing high performing agile technology teams
  • Hands on experience building Java-Scala-Spark based backend systems
  • experience of working in cloud based solutions
  • CS Fundamentals
  • Microservices
  • Data Structures
  • Algorithms & Problem Solving
  • CI/CD development environments/tools
  • Git
  • Maven
  • Jenkins
  • writing modular and testable code and test cases (unit, functional and integration)
  • JUnit
  • Mockito
  • Mock MVC
  • microservices architecture
  • distributed concepts
  • common design principles
  • design patterns
  • cloud native development concepts
  • Spring boot
  • concurrency
  • garbage collection
  • RESTful services
  • data caching services
  • ORM tools
  • Relational Database
  • complex OLAP, OLTP and SQL queries
  • NoSQL Databases
  • cosmos DB
  • Caching technology
  • Redis
  • Mem cache
  • Pub sub system
  • Kafka
  • monitoring and alert tools
  • Prometheus
  • Splunk
  • debugging and troubleshooting issues
  • Containerization tools
  • Docker
  • Helm
  • Kubernetes
  • public cloud platforms
  • Azure
  • GCP
  • disaster recovery principles
  • building resilient, fault-tolerant systems

Nice to have

  • RAG frameworks
  • vector search technologies
  • knowledge grounding techniques
  • Python
  • ML frameworks (PyTorch, TensorFlow)
  • Hugging Face Transformers
  • distributed computing
  • cloud platforms (GCP, Azure)
  • scalable architectures
  • multi-modal models (text, image, video)
  • advanced optimization techniques (e.g., quantization, distillation)
  • Generative AI models (e.g., GPT, Gemini)
  • fine-tuning
  • Finance/Procurement area

What the JD emphasized

  • Hands on experience building Java-Scala-Spark based backend systems is a must