(usa) Staff, Software Engineer | Mle

Walmart · Retail · Sunnyvale, CA

Staff Software Engineer/MLE on the Emerging Tech Extended Reality team at Walmart, focusing on Computer Vision, Machine Learning, and Deep Learning for AR/VR experiences. Responsibilities include developing multi-modal GenAI models for content generation and intelligence, scaling data pipelines, establishing model evaluation frameworks, driving production integration of AI applications, and translating research into production-grade solutions. Requires experience with multi-modal LLMs, diffusion models, 3D representation learning, and Python frameworks like PyTorch/TensorFlow.

What you'd actually do

  1. Develop Multi-Modal GenAI Models: Design and build models for content generation (imagery, video, 3D assets) and content intelligence (segmentation, object detection, spatial awareness, and keypoint detection).
  2. Scale Data Pipelines: Process large-scale datasets using distributed cloud computing technologies to create and maintain high-fidelity 2D and 3D assets.
  3. Ensure Model Excellence: Establish robust experimentation frameworks to evaluate model performance across realism, accuracy, latency, and downstream usability.
  4. Drive Production Integration: Collaborate end-to-end with product, engineering, and annotation partners to deploy scalable AI applications and manage their lifecycle.
  5. Innovate & Optimize: Translate state-of-the-art research papers into production-grade solutions, improving every aspect of the pipeline—from data generation techniques to efficient model inference.

Skills

Required

  • Master's degree in Computer Science with a specialization in Computer Vision, Machine Learning, or equivalent practical experience.
  • 3+ years of experience with machine learning algorithms and tools, with at least 1 year dedicated to deep learning.
  • Proven experience launching ML models in production environments and building scalable pipelines/services.
  • Hands-on experience with multi-modal LLMs, diffusion models, 3D representation learning, or spatial/keypoint modeling.
  • Proficiency in Python, along with frameworks such as PyTorch, PyTorch3D, TensorFlow, TensorFlow 3D, JAX, or NumPy.
  • Experience with Unix, Docker, and optimizing workflows for CPU/GPU architectures.

Nice to have

  • PhD in Machine Learning, Computer Science, or a related technical field.
  • Experience in digital commerce, AR/VR, or rich media personalization.
  • Deep expertise in 3D Deep Learning (e.g., PyTorch3D, NVIDIA Kaolin) and 3D Modeling tools (e.g., Maya, Blender, photogrammetry).
  • Experience building Augmented Reality applications using Unity or other AR SDKs.
  • Deep experience with scalable ML infrastructure and content delivery systems.
  • Understanding of human-in-the-loop evaluation, perceptual quality metrics, and responsible AI frameworks.
  • Recognized contributions to the field through publications, patents, open-source projects, or public talks.

What the JD emphasized

  • Proven experience launching ML models in production environments and building scalable pipelines/services.
  • Hands-on experience with multi-modal LLMs, diffusion models, 3D representation learning, or spatial/keypoint modeling.

Other signals

  • Develop Multi-Modal GenAI Models
  • Scale Data Pipelines
  • Ensure Model Excellence
  • Drive Production Integration
  • Innovate & Optimize
  • Cross-Functional Collaboration