Software Engineer Iii– AI Systems at Walmart

Position Summary...

We’re seeking a Software Engineer to design and build AI-first systems with a focus on agentic AI, high performance data/compute frameworks, and scalable, production-grade services. You’ll work across model-driven features and platform layers—integrating LLMs/agents, orchestrating pipelines with Ray, accelerating data science workloads with RAPIDS, and delivering robust APIs and services that power high-impact AI applications at scale. The ideal candidate blends strong software engineering fundamentals with practical ML systems exposure and a passion for performance, reliability, and developer experience.

What you'll do...

Key Responsibilities AI Systems & Agentic Workflows

Build agentic AI services (planning, tool use, retrieval, feedback loops) and integrate them with internal systems and APIs.
Implement orchestration, memory, tooling, evaluation, and guardrails for agentic workflows.
Collaborate with DS/MLE partners to productionize models (LLMs, GNNs, embedding services) behind stable APIs and SDKs.

Accelerated Compute & Data Pipelines

Develop GPU‑accelerated pipelines using RAPIDS (cuDF/cuML/cuGraph) and optimize end‑to‑end performance.
Use Ray (or similar) for distributed compute, batch/stream processing, and scalable workflow orchestration.
Profile and optimize bottlenecks across CPU/GPU, memory, and I/O layers; implement caching, vectorization, and async patterns.

Service & Platform Engineering

Design and maintain reliable microservices for training/inference, vector indexing, and real-time decisioning.
Implement observability (tracing/metrics/logging), fault tolerance, auto-scaling, and cost-aware execution.
Create internal SDKs/CLIs to streamline developer workflows, testing, and reproducibility.

Quality, Security & MLOps Integration

Establish CI/CD for AI services (unit/integration/e2e tests, canaries, blue/green, rollback).
Integrate with feature stores, vector databases, artifact registries, and model catalogs.
Enforce security, privacy, and compliance (data minimization, PII handling, governance, auditability).

Collaboration & Influence

Partner with product, platform, and DS/MLE teams to align requirements, SLAs, and success metrics.
Document systems thoroughly; contribute to design reviews and engineering best practices.
Mentor peers on AI systems patterns, distributed compute, and performance engineering.

Minimum Qualifications

Bachelor’s/Master’s in CS, Engineering, or equivalent industry experience.
4+ years building production backend or platform services (preferably in AI/ML contexts).
Proficiency in:
Languages: Python (primary), plus one of Go/Java/C++ for performance services.
Distributed frameworks: Ray, Spark, or Dask.
Accelerated compute: RAPIDS (cuDF/cuML/cuGraph) and GPU-aware programming concepts (streams, memory).
Service frameworks: FastAPI/Flask (Python), K8s (Kubernetes) and containerization (Docker).
Strong foundations in data structures/algorithms, concurrency, networking, and systems design.

Preferred Qualifications

Production experience with agent frameworks (e.g., LangGraph-style planners, tool-use patterns, retrieval and memory components).
Experience with vector databases (e.g., FAISS, Milvus, pgvector, Pinecone) and feature stores.
Familiarity with LLM and embedding services, prompt/tooling patterns, and evaluation harnesses.
Hands-on with Kubernetes, autoscaling (HPA/KEDA), and GPU scheduling/operators.
Performance profiling: PyTorch profiler, Nsight, line-profiler, Ray dashboard.
Experience with vLLM, Triton Inference Server, ONNX Runtime, or TensorRT for high‑throughput inference.

Soft Skills & Leadership

Pragmatic problem solver with a bias for measurable outcomes (latency, throughput, reliability).
Excellent communicator able to translate between research goals and production constraints.
Drives clarity in ambiguous problem spaces; mentors others and uplifts engineering standards.

About Walmart Global Tech Imagine working in an environment where one line of code can make life easier for hundreds of millions of people. That’s what we do at Walmart Global Tech. We’re a team of software engineers, data scientists, cybersecurity expert's and service professionals within the world’s leading retailer who make an epic impact and are at the forefront of the next retail disruption. People are why we innovate, and people power our innovations. We are people-led and tech-empowered. We train our team in the skillsets of the future and bring in experts like you to help us grow. We have roles for those chasing their first opportunity as well as those looking for the opportunity that will define their career. Here, you can kickstart a great career in tech, gain new skills and experience for virtually every industry, or leverage your expertise to innovate at scale, impact millions and reimagine the future of retail. Walmart’s culture is a competitive advantage, and it’s fostered by being together. Working together in person allows us to collaborate, align quickly and innovate with greater speed. We use our campuses to create purposeful connection rooted in deepening understanding and investing in the development of our associates. Our hubs: Walmart is a global company with offices across the United States and around the world. Our global headquarters is in Bentonville, Arkansas, with primary hubs in the San Francisco Bay area and New York/New Jersey.

Benefits: Benefits: Beyond our great compensation package, you can receive incentive awards for your performance. Other great perks include 401(k) match, stock purchase plan, paid maternity and parental leave, PTO, multiple health plans, and much more.

Equal Opportunity Employer: Walmart, Inc. is an Equal Opportunity Employer – By Choice. We believe we are best equipped to help our associates, customers, and the communities we serve live better when we really know them. That means understanding, respecting, and valuing unique styles, experiences, identities, ideas, and opinions – while being inclusive of all people.

The above information has been designed to indicate the general nature and level of work performed in the role. It is not designed to contain or be interpreted as a comprehensive inventory of all responsibilities and qualifications required of employees assigned to this job. The full Job Description can be made available as part of the hiring process. At Walmart, we offer competitive pay as well as performance-based bonus awards and other great benefits for a happier mind, body, and wallet. Health benefits include medical, vision and dental coverage. Financial benefits include 401(k), stock purchase and company-paid life insurance. Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty, and voting. Other benefits include short-term and long-term disability, company discounts, Military Leave Pay, adoption and surrogacy expense reimbursement, and more. You will also receive PTO and/or PPTO that can be used for vacation, sick leave, holidays, or other purposes. The amount you receive depends on your job classification and length of employment. It will meet or exceed the requirements of paid sick leave laws, where applicable. For information about PTO, see https://one.walmart.com/notices. Live Better U is a Walmart-paid education benefit program for full-time and part-time associates in Walmart and Sam's Club facilities. Programs range from high school completion to bachelor's degrees, including English Language Learning and short-form certificates. Tuition, books, and fees are completely paid for by Walmart. Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms. For information about benefits and eligibility, see One.Walmart. Bentonville, Arkansas US-10735: The annual salary range for this position is $90,000.00 - $180,000.00 Sunnyvale, California US-11657: The annual salary range for this position is $117,000.00 - $234,000.00 Bellevue, Washington US-11075: The annual salary range for this position is $108,000.00 - $216,000.00 Additional compensation includes annual or quarterly performance bonuses. Additional compensation for certain positions may also include :

Stock

ㅤ

‎

Minimum Qualifications...

__Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications. __

Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 2 years’ experience in software engineering or related area. Option 2: 4 years’ experience in software engineering or related area.

Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.

Master’s degree in Computer Science, Computer Engineering, Computer Information Systems, Software Engineering, or related area, We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly. The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart’s accessibility standards and guidelines for supporting an inclusive culture.

Masters: Computer Science

Primary Location...

2501 Se J St, Ste A, Bentonville, AR 72716-3724, United States of America

Walmart and its subsidiaries are committed to maintaining a drug-free workplace and has a no tolerance policy regarding the use of illegal drugs and alcohol on the job. This policy applies to all employees and aims to create a safe and productive work environment.

Position Summary...

What you'll do...

Key Responsibilities AI Systems & Agentic Workflows

Build agentic AI services (planning, tool use, retrieval, feedback loops) and integrate them with internal systems and APIs.
Implement orchestration, memory, tooling, evaluation, and guardrails for agentic workflows.
Collaborate with DS/MLE partners to productionize models (LLMs, GNNs, embedding services) behind stable APIs and SDKs.

Accelerated Compute & Data Pipelines

Develop GPU‑accelerated pipelines using RAPIDS (cuDF/cuML/cuGraph) and optimize end‑to‑end performance.
Use Ray (or similar) for distributed compute, batch/stream processing, and scalable workflow orchestration.
Profile and optimize bottlenecks across CPU/GPU, memory, and I/O layers; implement caching, vectorization, and async patterns.

Service & Platform Engineering

Design and maintain reliable microservices for training/inference, vector indexing, and real-time decisioning.
Implement observability (tracing/metrics/logging), fault tolerance, auto-scaling, and cost-aware execution.
Create internal SDKs/CLIs to streamline developer workflows, testing, and reproducibility.

Quality, Security & MLOps Integration

Establish CI/CD for AI services (unit/integration/e2e tests, canaries, blue/green, rollback).
Integrate with feature stores, vector databases, artifact registries, and model catalogs.
Enforce security, privacy, and compliance (data minimization, PII handling, governance, auditability).

Collaboration & Influence

Partner with product, platform, and DS/MLE teams to align requirements, SLAs, and success metrics.
Document systems thoroughly; contribute to design reviews and engineering best practices.
Mentor peers on AI systems patterns, distributed compute, and performance engineering.

Minimum Qualifications

Bachelor’s/Master’s in CS, Engineering, or equivalent industry experience.
4+ years building production backend or platform services (preferably in AI/ML contexts).
Proficiency in:
Languages: Python (primary), plus one of Go/Java/C++ for performance services.
Distributed frameworks: Ray, Spark, or Dask.
Accelerated compute: RAPIDS (cuDF/cuML/cuGraph) and GPU-aware programming concepts (streams, memory).
Service frameworks: FastAPI/Flask (Python), K8s (Kubernetes) and containerization (Docker).
Strong foundations in data structures/algorithms, concurrency, networking, and systems design.

Preferred Qualifications

Production experience with agent frameworks (e.g., LangGraph-style planners, tool-use patterns, retrieval and memory components).
Experience with vector databases (e.g., FAISS, Milvus, pgvector, Pinecone) and feature stores.
Familiarity with LLM and embedding services, prompt/tooling patterns, and evaluation harnesses.
Hands-on with Kubernetes, autoscaling (HPA/KEDA), and GPU scheduling/operators.
Performance profiling: PyTorch profiler, Nsight, line-profiler, Ray dashboard.
Experience with vLLM, Triton Inference Server, ONNX Runtime, or TensorRT for high‑throughput inference.

Soft Skills & Leadership

Pragmatic problem solver with a bias for measurable outcomes (latency, throughput, reliability).
Excellent communicator able to translate between research goals and production constraints.
Drives clarity in ambiguous problem spaces; mentors others and uplifts engineering standards.

Stock

ㅤ

‎

Minimum Qualifications...

__Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications. __

Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.

Masters: Computer Science

Primary Location...

2501 Se J St, Ste A, Bentonville, AR 72716-3724, United States of America

Software Engineer Iii– AI Systems

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Position Summary...

What you'll do...

Minimum Qualifications...

Preferred Qualifications...

Primary Location...

Position Summary...

What you'll do...

Minimum Qualifications...

Preferred Qualifications...

Primary Location...