Senior Autonomy Engineering Specialist - Platform

Caterpillar Caterpillar · Industrial · Chennai, Tamil Nadu

Seeking a Senior Principal Software Engineer for Platform Engineering to lead the architecture, evolution, and technical direction of enterprise platform capabilities supporting mission-critical applications at scale. This role involves deep platform expertise, architectural judgment, and influencing technical strategy across multiple teams while remaining hands-on. Responsibilities include defining platform standards, driving technical decisions for availability and resiliency, shaping Kubernetes environments, and mentoring engineers. Experience with AI-assisted engineering tools for software delivery is a plus.

What you'd actually do

  1. Lead architecture and design decisions across platform domains including Kubernetes, compute, networking, storage, observability, security, and production operations.
  2. Define and evolve platform standards, reference architectures, reusable patterns, and technical guardrails that enable scalable, reliable, and secure systems.
  3. Drive complex technical decisions across availability, resiliency, scalability, fault isolation, disaster recovery, and lifecycle management.
  4. Shape the direction of Kubernetes-based platform environments, including cluster architecture, upgrades, hardening, recovery, and operational readiness.
  5. Partner closely with SRE, DevSecOps, Security, Infrastructure, and application engineering teams to ensure platform solutions are observable, supportable, secure, and production-ready.

Skills

Required

  • Software engineering
  • Platform engineering
  • Infrastructure engineering
  • Distributed systems
  • Kubernetes
  • Linux
  • Networking
  • Storage
  • System design
  • Platform reliability engineering
  • Architectural thinking
  • Engineering judgement

Nice to have

  • on-premises, air-gapped, edge, or self-managed Kubernetes environments
  • infrastructure automation
  • platform operations practices
  • Terraform
  • Ansible
  • GitOps
  • platform security
  • observability
  • service mesh
  • enterprise infrastructure patterns
  • Platform Engineering + SRE shared-ownership model
  • industrial, autonomy, or other mission-critical environments
  • Generative AI or agentic engineering tools applied to software delivery, platform operations, or developer productivity

What the JD emphasized

  • 12+ years of experience in software engineering, platform engineering, infrastructure engineering, or distributed systems.
  • Proven experience designing, building, and operating large-scale, highly available production platforms.
  • Deep expertise in Kubernetes and container-based platform architecture in production environments.
  • Strong grounding in distributed systems, Linux, networking, storage, system design, and platform reliability engineering.
  • Ability to influence technical direction and drive alignment across teams without depending on formal organisational authority.
  • Strong architectural thinking, sound engineering judgement, and the ability to balance long-term platform strategy with practical delivery outcomes.