Principal, Data Scientist :generative AI (3d & Multi-modal)

Walmart · Retail · Sunnyvale, CA

Principal Data Scientist role focused on Generative AI for 3D and Multi-Modal applications within Walmart's Emerging Tech Extended Reality team. Responsibilities include processing large datasets, developing deep learning models for image generation and synthesis, building and maintaining ML/DL pipelines, and collaborating with product and engineering teams. The role requires experience in computer vision, deep learning, and launching ML models in production.

What you'd actually do

  1. Process large scale Walmart datasets using distributed cloud computing technologies to create and maintain 3D assets for the Walmart catalog.
  2. Use computer vision techniques to create advanced deep learning models for 2D/3D image generation and synthesis, image classification, scene segmentation, object detection, keypoint detection.
  3. Build and maintain ML/DL model pipelines and services with a focus on latency, throughput, and scalability.
  4. End to end ownership and collaboration with product, engineering, vendors, and annotation partners to drive production ML application integrations and maintenance.
  5. Collaborate with other internal Walmart Data Science teams to enable high quality user experiences powered by AI/ML around computer vision.

Skills

Required

  • TensorFlow
  • Tensorflow 3D
  • Pytorch
  • Pytorch3D
  • JAX
  • numpy
  • C++
  • Python
  • Unix
  • Docker
  • CPU and GPU architectures
  • computer vision
  • machine learning
  • deep learning

Nice to have

  • 3D Real-time rendering of scenes, objects using NeRF
  • CNNs
  • vision transformers
  • pre-training from scratch
  • fine-tuning
  • deep learning for synthetic data generation using GANs
  • diffusion models
  • multimodal architectures like CLIP
  • Few-shot learning
  • transfer learning
  • unsupervised and semi-supervised methods
  • contrastive learning
  • active learning
  • weakly supervised auto labeling
  • semantic search index
  • GCS
  • BigQuery
  • Nvidia Toolkits
  • Java
  • Bash
  • Spark (SQL, Streaming)
  • Hadoop Map-Reduce
  • Build Manager (Maven)
  • Distributed Version Control (GIT)
  • Continuous Integration (Jenkins)
  • Pytorch 3D
  • Nvidia Kaolin
  • Augmented Reality applications using Unity or other AR SDK’s
  • Mixed Reality/Virtual Reality(Holo Lens, Gear, Occulus, Vive, Rift.)
  • 3D Modelling tools such as Maya, Blender and other photogrammetry tooling
  • Game Design
  • 3D modelling
  • 3D Computer Graphics

What the JD emphasized

  • Experience in launching Machine Learning models in products and building production pipelines/services
  • Experience with publishing at conferences/journals in Machine learning/Computer Vision

Other signals

  • building production ML pipelines and services
  • end to end ownership of ML applications
  • implementing research papers for improvements