Machine Learning Engineer III / Senior Machine Learning Engineer - AI Platform

Workday Workday · Enterprise · Boulder, CO

Workday is seeking Machine Learning Engineers to build the next generation of AI-first products, focusing on Agentic AI, Information Retrieval, and Evaluation. The role involves architecting reasoning and planning agents, optimizing LLM configurations, advancing semantic search and Text-to-SQL capabilities, and engineering cloud-based pipelines for evaluation and monitoring. This position bridges research and production, deploying cutting-edge agents into the Workday ecosystem to deliver transformative value to millions of users.

What you'd actually do

  1. Architect Agentic AI: Design and deploy sophisticated reasoning, planning, and swarm agents that interact seamlessly with enterprise data and support continuous, life-long learning.
  2. Drive Meta-ML & Optimization: Develop algorithms for automated node-level optimization within agent graphs, identifying the best LLM and prompt configurations for every workflow step. Build recommender systems for engineering teams to drive optimal evaluation for their agents.
  3. Advance Information Retrieval: Build hybrid, agentic search systems and semantic parsing products (Text-to-SQL/Python) utilizing vector search, reasoning, and fine-tuning for structured output.
  4. Scale Evaluation & Observability: Engineer cloud-based pipelines (Kubeflow) and A/B testing frameworks for rigorous offline/online evaluation, failure attribution, and safety monitoring.
  5. Lead the ML Lifecycle: Own the end-to-end MLOps process—from exploration and prompt engineering to scalable production deployment—ensuring high-quality, reliable performance.

Skills

Required

  • PyTorch or TensorFlow
  • Python
  • Kubeflow
  • LangChain/LangGraph

Nice to have

  • deep learning
  • NLP
  • Information Retrieval
  • recommender systems
  • Generative AI
  • LLM
  • RAG architectures
  • agentic frameworks
  • long-context LLM applications
  • Text-to-SQL
  • modular library design
  • asynchronous patterns
  • scalable system architecture
  • state management
  • error handling

What the JD emphasized

  • Agent Optimization & Evaluation
  • Information Retrieval
  • Agentic AI
  • rigorous data-driven frameworks
  • intelligence layer
  • semantic search
  • natural language-to-code
  • Agentic AI
  • enterprise data
  • life-long learning
  • automated node-level optimization
  • LLM and prompt configurations
  • recommender systems
  • agent evaluation
  • hybrid, agentic search systems
  • semantic parsing products
  • Text-to-SQL/Python
  • vector search
  • fine-tuning
  • structured output
  • Scale Evaluation & Observability
  • cloud-based pipelines
  • Kubeflow
  • A/B testing frameworks
  • rigorous offline/online evaluation
  • failure attribution
  • safety monitoring
  • ML Lifecycle
  • end-to-end MLOps process
  • exploration
  • prompt engineering
  • scalable production deployment
  • high-quality, reliable performance
  • Deep Technical ML Capability
  • production-grade ML systems
  • deep learning
  • NLP
  • Information Retrieval
  • recommender systems
  • PyTorch or TensorFlow
  • Generative AI & Agentic Systems
  • NLP and LLM-powered products
  • RAG architectures
  • agentic frameworks
  • LangChain/LangGraph
  • long-context LLM applications
  • Text-to-SQL
  • Engineering Excellence
  • expert-level Python
  • modular library design
  • asynchronous patterns
  • scalable system architecture
  • state management/error handling
  • non-deterministic AI outputs
  • Deep Technical ML Leadership
  • production-grade ML systems
  • deep learning
  • NLP
  • Information Retrieval
  • recommender systems
  • PyTorch or TensorFlow
  • Generative AI & Agentic Systems
  • NLP and LLM-powered products
  • RAG architectures
  • agentic frameworks
  • LangChain/LangGraph
  • long-context LLM applications
  • Text-to-SQL
  • Engineering Excellence
  • expert-level Python
  • modular library design
  • asynchronous patterns
  • scalable system architecture
  • state management/error handling

Other signals

  • building the next generation of "AI-first" products
  • embedding cutting-edge agents directly into the Workday ecosystem
  • delivering transformative value to millions of users
  • reaching 31 million users globally