Senior Software Engineer - Soc Devops, Mla-mi - Annapurna Labs

Amazon Amazon · Big Tech · Austin, TX · Software Development

Senior Software Engineer focused on DevOps for AWS's custom ML chips (Trainium/Inferentia), enabling rapid software development for AI training and inference. The role owns CI/CD pipelines, release processes, and hardware-in-the-loop testing infrastructure to ensure timely and quality software delivery for silicon development.

What you'd actually do

  1. Own the end to end CI/CD pipelines and release processes for all SoC software components including firmware hardware abstraction layers and modeling tools.
  2. Develop hardware in the loop test infrastructure that validates SoC software on actual silicon in laboratory and automated testing settings.
  3. Build observability tools such as dashboards that track build health test coverage and pipeline performance along with alerting systems that notify teams of regressions.
  4. Identify and remove friction in development workflows such as slow build times or complex release steps using data driven insights to prioritize improvements that accelerate team productivity.
  5. Solve novel problems like bridging disconnected environments and orchestrating synchronized releases across multiple domains.

Skills

Required

  • 7+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Experience programming in Python and at least one of: Bash, Go, C++, or Java
  • Experience with infrastructure-as-code (CDK, CloudFormation, Terraform, etc.)
  • Experience with AWS services (Lambda, S3, EC2, CloudWatch, IAM, Secrets Manager, etc.)
  • Experience with Linux-based build and development environments

Nice to have

  • Bachelor's degree in computer science or equivalent
  • Experience with Amazon's internal build and release systems (Brazil, Pipelines, CRUX, Apollo, etc.)
  • Experience building cross-environment or cross-account automation (e.g., bridging corporate and isolated/air-gapped environments)
  • Experience with Jenkins pipeline development and administration
  • Experience with hardware-in-the-loop testing or supporting hardware/silicon development teams
  • Experience building observability infrastructure: dashboards, metrics pipelines, alerting (CloudWatch, QuickSight, or similar)

What the JD emphasized

  • primary owner of infrastructure
  • primary owner
  • critical to supporting
  • critical
  • foundational position
  • foundational
  • direct impact
  • direct impact
  • strict quality standards
  • strict quality standards
  • significant focus
  • significant focus
  • novel technical contexts
  • novel problems