Senior Software Engineer, Infrastructure and Tooling Lead - Automation

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +1 · Remote

Senior Software Engineer to lead infrastructure and tooling development for an Automation team, focusing on building scalable internal platforms, automation frameworks, developer productivity tools, and LLM-powered workflows to enhance engineering efficiency. The role involves designing and developing services, APIs, dashboards, workflow engines, and integrating tooling with CI/CD systems. Key responsibilities include defining architecture, coding standards, evaluation methods, and mentoring engineers, with a strong emphasis on LLM-based workflows for various tasks like triage, summarization, and code analysis.

What you'd actually do

  1. Lead the design and development of infrastructure, automation frameworks, and internal engineering tools.
  2. Build scalable services, APIs, dashboards, workflow engines, and integrations that improve developer efficiency and operational visibility.
  3. Develop LLM-based workflows for triage, summarization, code and log analysis, test workflow assistance, report generation, and knowledge retrieval.
  4. Integrate tooling with CI/CD systems, source control, issue tracking, test infrastructure, dashboards, and internal engineering services.
  5. Define architecture, coding standards, evaluation methods, and reliability practices for automation and LLM-enabled systems.

Skills

Required

  • Python
  • Bash
  • C
  • C++
  • Linux
  • LLM-based workflows
  • RAG systems
  • technical leadership
  • architecture ownership
  • mentoring

Nice to have

  • embeddings
  • vector search
  • agent orchestration
  • Kubernetes
  • cloud infrastructure
  • on-prem infrastructure
  • distributed systems

What the JD emphasized

  • LLM-based workflows
  • evaluation methods
  • reliability practices

Other signals

  • LLM-powered workflows
  • developer productivity tools
  • automation frameworks
  • engineering efficiency