Senior Devops Engineer

NVIDIA NVIDIA · Semiconductors · Pune, India

NVIDIA is seeking a Senior DevOps Engineer to join its Software Infrastructure team, focusing on developing tools to optimize development workflows and enhance efficiency within a large-scale private cloud system. The role involves designing and implementing CI/CD systems, managing Kubernetes, and ensuring the health and scalability of infrastructure supporting thousands of developers.

What you'd actually do

  1. Drive automation to monitor and gain more insight into applications and system health.
  2. Design solutions with service discovery, networking, monitoring, logging, scheduling in Kubernetes.
  3. Implement, manage & maintain end to end Jenkins instances - tools, plugins, nodes, user management, back up, restore, monitoring, etc.
  4. Implement & support end-to-end CI/CD system using open-source software.
  5. Develop, Improve and Maintain our infrastructure codebase.

Skills

Required

  • python
  • scripting languages
  • cloud infrastructure
  • highly-available production environment
  • debugging
  • problem solving
  • analytical skills
  • architectural requirements
  • development processes
  • reliable, robust, scalable data products and pipelines
  • Databases
  • SQL
  • MySQL
  • NoSQL
  • Elastic Search
  • MongoDB
  • Cassandra
  • configuration management tools
  • Ansible
  • Puppet
  • Chef
  • Jenkins
  • CI/CD systems
  • Kubernetes
  • dockers
  • virtualization
  • monitoring systems
  • Zabbix
  • Prometheus
  • 5+ years of proven experience
  • Bachelor's or Master’s degree in Computer Science, Software Engineering, or equivalent experience

Nice to have

  • Windows server infrastructure
  • using and improving data centers
  • computer algorithms
  • best possible algorithms
  • scaling challenge
  • Analyze sophisticated problems into simple sub problems
  • reuse available solutions
  • design simple systems
  • work efficiently without needing much support
  • Curiosity about LLMs, NLP, or AI-driven developer tools

What the JD emphasized

  • critical path
  • highly-available production environment
  • reliable, robust, scalable data products and pipelines