Senior Software Engineer - Environments Accelerator

Datadog Datadog · Enterprise · Denver, CO +1 · Dev Eng

Senior Software Engineer role focused on designing and building systems for infrastructure provisioning and datacenter operations, with a specific mention of supporting AI inference workloads. The role involves working with Kubernetes, cloud infrastructure, and IaC, aiming to reduce operational overhead and complexity.

What you'd actually do

  1. Design and build systems that reduce the operational overhead, cost, and complexity of provisioning new infrastructure environments and datacenters
  2. Develop tooling and orchestration patterns for declarative environment bootstrap, dependency coordination, and infrastructure validation
  3. Improve the reliability and repeatability of infrastructure provisioning workflows by reducing manual coordination and implicit system dependencies
  4. Work across infrastructure domains including Kubernetes, cloud infrastructure, identity systems, service bootstrap, and datacenter configuration management
  5. Collaborate broadly across baseline infrastructure, platform, and product engineering teams to build reusable environment and provisioning capabilities that support the full Datadog platform stack

Skills

Required

  • Golang or similar systems programming languages
  • Kubernetes and container orchestration systems
  • Scalable automation solutions across multi-cloud environments (AWS, Azure, GCP)
  • Infrastructure as Code (IaC)
  • CI/CD pipelines
  • Linux
  • Distributed systems
  • Infrastructure platforms
  • Internal developer tooling

Nice to have

  • debugging across multiple layers of the stack
  • pragmatic tradeoffs between speed, reliability, maintainability, and operational complexity
  • clear communication and effective collaboration

What the JD emphasized

  • experience in Golang or similar systems programming languages
  • experience with Kubernetes and container orchestration systems
  • demonstrated expertise in designing and implementing scalable automation solutions across multi-cloud environments (e.g., AWS, Azure, GCP), leveraging Infrastructure as Code (IaC), CI/CD pipelines, and cloud-native tooling to improve operational efficiency, consistency, and reliability
  • experience with Linux, CI/CD systems, from deployment automation to debugging
  • experience building or operating distributed systems, infrastructure platforms, or internal developer tooling