Development Engineer 3

Comcast Comcast · Media · Chennai, India

Seeking an experienced Level 3 OpenStack Administrator to manage, optimize, and troubleshoot a large-scale private cloud platform. Responsibilities include migration, upgrade, design, implementation, and ongoing management of OpenStack infrastructure, ensuring high availability, scalability, and performance. Requires strong experience in Linux system administration, Infrastructure as Code (IaC) using Ansible, Git, Terraform, Kubernetes Operators, and Python. Familiarity with monitoring tools (Prometheus, Grafana, ELK) and operational best practices is essential. Experience with RedHat OpenStack and Ceph storage is preferred. Role involves assisting with migration and maintenance tasks outside regular business hours and participating in an on-call rotation.

What you'd actually do

  1. Plan, design, test, and execute migration from legacy hardware platforms to OpenStack-based infrastructure, ensuring minimal downtime and seamless service continuity.
  2. Monitor, optimize, and troubleshoot OpenStack components and underlying hardware to maintain high availability, performance, capacity, and platform security.
  3. Collaborate closely with network, storage, and platform engineering teams to ensure efficient integration, streamlined operations, and optimized performance across all OpenStack services.
  4. Plan, coordinate, and execute OpenStack upgrades to the latest stable releases, performing rolling upgrades, validating component compatibility, executing regression testing, and carrying out post-upgrade tuning to ensure optimal platform stability, performance, and reliability.
  5. Stay updated on industry trends, emerging cloud technologies, and OpenStack ecosystem advancements, applying best practices to continuously improve the platform.

Skills

Required

  • 4-6 years designing, building, and operating OpenStack environments
  • 3+ years proficiency in automation and DevOps tools such as Ansible, AWX, Terraform, GitHub
  • 3+ years proficiency with IP Networks and networking design and operations
  • 3+ years hands on experience with server hardware deployments, maintenance, and troubleshooting
  • 3+ years of strong Linux administration skills

Nice to have

  • Experience engineering and/or operating SAN or CEPH storage systems
  • Experience in Cybersecurity.
  • Experience with administering and operating HPE/Dell hardware and firmware
  • Proficiency in coding such as Python, Ansible for automation playbooks

What the JD emphasized

  • high availability (99.99% availability)
  • low latency enterprise applications