Position Summary...

What you'll do...

About Team:

Search PTE - DevOps Team processes billions of queries for millions of products on Walmart sites and apps worldwide. Whenever a user types in a query or browses through product categories on the web or mobile, our service goes to work. We mine structured and semi-structured data from product catalogs, social web, transactions, query logs, and AI-generated signals at an unprecedented scale. We work on big data problems, cutting-edge relevance algorithms from information retrieval, machine learning, and AI-powered ranking to deliver a high-availability, low-latency service that directly impacts business metrics.

Position Summary

Being part of the Search PTE-DevOps team at Walmart provides deep insight into the full lifecycle of a product — from content acquisition to being sold on Walmart.com. As a Senior Software Engineer in DevOps & AI Platform, you must support all systems and services to ensure high availability and reliability, while embracing AI-augmented workflows to accelerate engineering velocity. You will work closely with developers, AI/ML engineers, and platform teams to support new application features, AI model deployments, and service launches. You will design, build, and operate the tools that help in developing, scaling, and monitoring cutting-edge technology — including GenAI and LLMOps pipelines. You must be able to triage complex technical issues in collaboration with engineering, NOC, NetEng, and Platform teams. If you are passionate about five 9’s reliability and excited about the intersection of AI and platform engineering, this position is for you.

We are looking for an expert in continuous integration and delivery pipelines, containerized infrastructure, and AI-assisted development practices. You will play a critical role in all search application and AI model release cycles, working closely with Engineering, QE, and DevOps.

What You'll Do:

Build, manage, and evolve QE & Release Automation frameworks, incorporating AI-assisted test generation and self-healing test capabilities
Build and support Kubernetes-based containerization in production, including GPU-backed workloads for AI/ML inference
Lead independently the investigation and resolution of high-impact search system and AI service incidents
Build, manage, and support comprehensive monitoring and observability for applications and AI model performance (drift, latency, accuracy)
Maintain and improve automation pipelines supporting application build, release, and AI model deployment cycles (CI/CD + MLOps/LLMOps)
Integrate AI coding assistants and GenAI tooling (e.g., Wibey, GitHub Copilot) into engineering workflows to accelerate development
Design and implement AI-powered observability solutions using intelligent alerting, anomaly detection, and predictive incident management
Collaborate with AI/ML teams to operationalize LLM-based features within search, including prompt pipeline management and vector search infrastructure
Drive execution and lead medium- to large-scale projects from Dev to Ops, including AI/ML platform initiatives
Analyze, design, and build frameworks using cutting-edge technology and AI tools to fulfill Operational Excellence
Lead and independently handle high-impact, critical search system and AI service incidents
Improve, optimize, and identify opportunities within the software development and AI deployment lifecycle (SDLC + MLOps)
Provide engineering and QE teams with architectural guidance on solutions, automation frameworks, and AI integration patterns
Work with product and engineering teams to review new functional and AI-driven requirements; develop comprehensive test plans and automate test cases — including AI model validation
Perform quality assurance for large-scale eCommerce backend search services and AI-powered features
Write programs and scripts to automate testing and validation of search backend services and LLM/AI inference pipelines
Expertise in WCNP, Concord, Looper, Python, Golang, and Java — with hands-on experience in AI/ML tooling, LLMOps, and GenAI platforms

What You'll Bring:

Bachelor’s or Master’s Degree in Computer Science, Engineering, or related field
5+ years of experience building scalable eCommerce applications or distributed backend services
3+ years of industry experience in application releases, CI/CD pipelines, and distributed system testing
Strong expertise in containerization and orchestration using Kubernetes (including multi-cluster and GPU-node management)
2+ years of programming experience in Python, Go, Java, and Shell scripting, with exposure to REST and gRPC API frameworks
Experience with modern CI/CD platforms (e.g., Concord, GitHub Actions, Looper) and GitOps workflows (e.g., ArgoCD, Flux)
Working knowledge of AI/ML workflows: model serving, inference optimization, or LLM deployment pipelines
Familiarity with observability stacks: OpenTelemetry, distributed tracing, log aggregation (e.g., Splunk, OpenObserve), and AI-assisted anomaly detection

Additional Preferred Qualifications

Experience with LLMOps and GenAI platforms: prompt engineering, RAG pipelines, vector databases (e.g., Pinecone, Weaviate, Elasticsearch KNN), and LLM evaluation frameworks
Hands-on experience with AI coding assistants (e.g., Wibey, GitHub Copilot) and AI-augmented DevOps tooling
Proficiency with WCNP (Walmart Cloud Native Platform) and cloud-native infrastructure on GCP or Azure
Knowledge of eBPF-based observability tools (e.g., Cilium, Pixie) and advanced networking concepts (VIP, TCP, Envoy/Istio service mesh)
Experience with GPU infrastructure management for AI workloads (CUDA, NVIDIA device plugins for Kubernetes)
Familiarity with MLflow, Kubeflow, Ray, or similar MLOps platforms for experiment tracking and model lifecycle management
Experience with performance and load testing tools (e.g., Gatling, k6, Locust) to measure server and client-side metrics
Knowledge of AI safety and responsible AI practices in production environments (guardrails, content filtering, bias monitoring)
Contributions to open-source DevOps, AI/ML, or platform engineering projects are a strong plus

Why Join Us

Work at the intersection of AI and large-scale distributed systems — one of the most impactful domains in modern engineering
Shape the future of search for hundreds of millions of Walmart customers globally
Leverage cutting-edge GenAI tooling internally developed at Walmart (Wibey, ElementAI)
Collaborate with world-class engineers on problems of enormous scale and complexity

About Walmart Global Tech

Imagine working in an environment where one line of code can make life easier for hundreds of millions of people. That’s what we do at Walmart Global Tech. We’re a team of software engineers, data scientists, cybersecurity expert's and service professionals within the world’s leading retailer who make an epic impact and are at the forefront of the next retail disruption. People are why we innovate, and people power our innovations. We are people-led and tech-empowered. We train our team in the skillsets of the future and bring in experts like you to help us grow. We have roles for those chasing their first opportunity as well as those looking for the opportunity that will define their career. Here, you can kickstart a great career in tech, gain new skills and experience for virtually every industry, or leverage your expertise to innovate at scale, impact millions and reimagine the future of retail.

Benefits:

Beyond our great compensation package, you can receive incentive awards for your performance. Other great perks include 401(k) match, stock purchase plan, paid maternity and parental leave, PTO, multiple health plans, and much more.

Equal Opportunity Employer:

Walmart, Inc. is an Equal Opportunity Employer – By Choice. We believe we are best equipped to help our associates, customers, and the communities we serve live better when we really know them. That means understanding, respecting, and valuing unique styles, experiences, identities, ideas, and opinions – while being inclusive of all people.

The above information has been designed to indicate the general nature and level of work performed in the role. It is not designed to contain or be interpreted as a comprehensive inventory of all responsibilities and qualifications required of employees assigned to this job. The full Job Description can be made available as part of the hiring process.

At Walmart, we offer competitive pay as well as performance-based bonus awards and other great benefits for a happier mind, body, and wallet. Health benefits include medical, vision and dental coverage. Financial benefits include 401(k), stock purchase and company-paid life insurance. Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty, and voting. Other benefits include short-term and long-term disability, company discounts, Military Leave Pay, adoption and surrogacy expense reimbursement, and more. You will also receive PTO and/or PPTO that can be used for vacation, sick leave, holidays, or other purposes. The amount you receive depends on your job classification and length of employment. It will meet or exceed the requirements of paid sick leave laws, where applicable. For information about PTO, see https://one.walmart.com/notices. Live Better U is a Walmart-paid education benefit program for full-time and part-time associates in Walmart and Sam's Club facilities. Programs range from high school completion to bachelor's degrees, including English Language Learning and short-form certificates. Tuition, books, and fees are completely paid for by Walmart. Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms. For information about benefits and eligibility, see One.Walmart. The annual salary range for this position is $117,000.00 - $234,000.00 Additional compensation includes annual or quarterly performance bonuses. Additional compensation for certain positions may also include :

Stock

ㅤ

‎

Minimum Qualifications...

__Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications. __

Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 3 years’ experience in software engineering or related area. Option 2: 5 years’ experience in software engineering or related area.

Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.

Master’s degree in Computer Science, Computer Engineering, Computer Information Systems, Software Engineering, or related area and 1 year's experience in software engineering or related area., We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly. The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart’s accessibility standards and guidelines for supporting an inclusive culture.

Primary Location...

840 W California Ave, Sunnyvale, CA 94086-4828, United States of America

Walmart and its subsidiaries are committed to maintaining a drug-free workplace and has a no tolerance policy regarding the use of illegal drugs and alcohol on the job. This policy applies to all employees and aims to create a safe and productive work environment.

Position Summary...

What you'll do...

About Team:

Position Summary

What You'll Do:

Build, manage, and evolve QE & Release Automation frameworks, incorporating AI-assisted test generation and self-healing test capabilities
Build and support Kubernetes-based containerization in production, including GPU-backed workloads for AI/ML inference
Lead independently the investigation and resolution of high-impact search system and AI service incidents
Build, manage, and support comprehensive monitoring and observability for applications and AI model performance (drift, latency, accuracy)
Maintain and improve automation pipelines supporting application build, release, and AI model deployment cycles (CI/CD + MLOps/LLMOps)
Integrate AI coding assistants and GenAI tooling (e.g., Wibey, GitHub Copilot) into engineering workflows to accelerate development
Design and implement AI-powered observability solutions using intelligent alerting, anomaly detection, and predictive incident management
Collaborate with AI/ML teams to operationalize LLM-based features within search, including prompt pipeline management and vector search infrastructure
Drive execution and lead medium- to large-scale projects from Dev to Ops, including AI/ML platform initiatives
Analyze, design, and build frameworks using cutting-edge technology and AI tools to fulfill Operational Excellence
Lead and independently handle high-impact, critical search system and AI service incidents
Improve, optimize, and identify opportunities within the software development and AI deployment lifecycle (SDLC + MLOps)
Provide engineering and QE teams with architectural guidance on solutions, automation frameworks, and AI integration patterns
Work with product and engineering teams to review new functional and AI-driven requirements; develop comprehensive test plans and automate test cases — including AI model validation
Perform quality assurance for large-scale eCommerce backend search services and AI-powered features
Write programs and scripts to automate testing and validation of search backend services and LLM/AI inference pipelines
Expertise in WCNP, Concord, Looper, Python, Golang, and Java — with hands-on experience in AI/ML tooling, LLMOps, and GenAI platforms

What You'll Bring:

Bachelor’s or Master’s Degree in Computer Science, Engineering, or related field
5+ years of experience building scalable eCommerce applications or distributed backend services
3+ years of industry experience in application releases, CI/CD pipelines, and distributed system testing
Strong expertise in containerization and orchestration using Kubernetes (including multi-cluster and GPU-node management)
2+ years of programming experience in Python, Go, Java, and Shell scripting, with exposure to REST and gRPC API frameworks
Experience with modern CI/CD platforms (e.g., Concord, GitHub Actions, Looper) and GitOps workflows (e.g., ArgoCD, Flux)
Working knowledge of AI/ML workflows: model serving, inference optimization, or LLM deployment pipelines
Familiarity with observability stacks: OpenTelemetry, distributed tracing, log aggregation (e.g., Splunk, OpenObserve), and AI-assisted anomaly detection

Additional Preferred Qualifications

Experience with LLMOps and GenAI platforms: prompt engineering, RAG pipelines, vector databases (e.g., Pinecone, Weaviate, Elasticsearch KNN), and LLM evaluation frameworks
Hands-on experience with AI coding assistants (e.g., Wibey, GitHub Copilot) and AI-augmented DevOps tooling
Proficiency with WCNP (Walmart Cloud Native Platform) and cloud-native infrastructure on GCP or Azure
Knowledge of eBPF-based observability tools (e.g., Cilium, Pixie) and advanced networking concepts (VIP, TCP, Envoy/Istio service mesh)
Experience with GPU infrastructure management for AI workloads (CUDA, NVIDIA device plugins for Kubernetes)
Familiarity with MLflow, Kubeflow, Ray, or similar MLOps platforms for experiment tracking and model lifecycle management
Experience with performance and load testing tools (e.g., Gatling, k6, Locust) to measure server and client-side metrics
Knowledge of AI safety and responsible AI practices in production environments (guardrails, content filtering, bias monitoring)
Contributions to open-source DevOps, AI/ML, or platform engineering projects are a strong plus

Why Join Us

Work at the intersection of AI and large-scale distributed systems — one of the most impactful domains in modern engineering
Shape the future of search for hundreds of millions of Walmart customers globally
Leverage cutting-edge GenAI tooling internally developed at Walmart (Wibey, ElementAI)
Collaborate with world-class engineers on problems of enormous scale and complexity

About Walmart Global Tech

Benefits:

Equal Opportunity Employer:

Stock

ㅤ

‎

Minimum Qualifications...

__Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications. __

Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.

Primary Location...

840 W California Ave, Sunnyvale, CA 94086-4828, United States of America

Senior, Software Engineer

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Position Summary...

What you'll do...

Position Summary

Additional Preferred Qualifications

Why Join Us

Minimum Qualifications...

Preferred Qualifications...

Primary Location...

Position Summary...

What you'll do...

Position Summary

Additional Preferred Qualifications

Why Join Us

Minimum Qualifications...

Preferred Qualifications...

Primary Location...