Solution Architect (ai/llm Inference)

Baseten · Data AI · San Francisco, CA · EPD

Solution Architect role focused on AI/LLM inference, partnering with Sales and customers to design and deploy technical solutions. Responsibilities include customer discovery, technical scoping, leading demos, managing deployments, and driving POC execution. Requires an AI/ML background and customer-facing communication skills, with the ability to script and prototype.

What you'd actually do

  1. Partner with Sales on customer discovery calls (most often second calls, occasionally first calls for large accounts).
  2. Lead demos and technical scoping to align on success criteria, architecture, and deployment approach.
  3. Own benchmarking and repeatable deployments, including:
  4. Driving consistent “playbook” style deployments for common models and use cases.
  5. Drive POC and project execution, including:

Skills

Required

  • AI/ML background
  • customer-facing communication skills
  • script and prototype

Nice to have

  • Experience running or supporting benchmarks for ML inference deployments.
  • Familiarity with infrastructure tradeoffs relevant to inference performance and cost
  • Experience serving as a cross-functional technical lead for customer POCs

What the JD emphasized

  • customer discovery calls
  • demos
  • technical scoping
  • benchmarking
  • repeatable deployments
  • POC and project execution

Other signals

  • customer-facing technical role
  • AI/LLM inference
  • production deployments
  • benchmarking
  • customer POCs