Lead Software Engineer - Data Ops

Caterpillar Caterpillar · Industrial · Bangalore, Karnataka +1

Lead Software Engineer for Data Ops role at Caterpillar, focusing on designing, architecting, and scaling secure AWS cloud platforms for digital and AI/ML workloads. The role involves owning CI/CD platform strategy, Infrastructure as Code, cloud reliability, DevSecOps governance, and enterprise standardization, enabling consistent and scalable delivery for application, data, and ML engineering teams. Key responsibilities include building CI/CD platforms with GitHub Actions, engineering AWS-based platforms (SageMaker, ECS/EKS), managing the MLOps lifecycle, implementing IaC with CloudFormation, enforcing DevSecOps controls, managing artifact lifecycles with JFrog Artifactory, and driving SRE practices for platform reliability.

What you'd actually do

  1. Owns architecture and implementation of enterprise DevOps and cloud platforms across programs.
  2. Drives standardization, scalability, and reuse of platform capabilities.
  3. Influences DevOps and MLOps practices across multiple teams and domains.
  4. Mentors engineers and elevates engineering quality and platform maturity.

Skills

Required

  • AWS CloudFormation
  • Python
  • Go
  • GitHub Actions
  • JFrog Artifactory
  • SonarQube
  • SageMaker
  • ECS/EKS
  • CodePipeline
  • IAM
  • S3

Nice to have

  • AWS Firewall and networking expertise
  • Security and governance for AI platforms
  • NVIDIA-based environments
  • Azure cloud services

What the JD emphasized

  • Deep AWS expertise with DevOps and MLOps services (SageMaker, ECS/EKS, CodePipeline, IAM, S3)
  • Strong proficiency in GitHub and GitHub Actions (enterprise-scale CI/CD)
  • Advanced experience with AWS CloudFormation
  • Proven experience designing and scaling DevOps/MLOps platforms

Other signals

  • MLOps lifecycle including model deployment, automation, monitoring, and governance
  • Lead engineering of AWS-based platforms supporting application and ML workloads
  • Owns architecture and implementation of enterprise DevOps and cloud platforms across programs