Senior System Software Engineer - Metropolis

NVIDIA NVIDIA · Semiconductors · Pune, India

Senior System Software Engineer role focused on full-stack development of microservices for NVIDIA's Metropolis team. The role involves designing and implementing front-end interfaces, back-end services, and cloud-native applications using containers and orchestration. Key responsibilities include building high-performance, low-latency systems, real-time features, streaming pipelines, and event-driven architectures. Collaboration with AI/ML teams is expected, along with troubleshooting, performance optimization, and CI/CD implementation. Requires strong full-stack development experience, backend proficiency in Node.js or Python, and understanding of cloud fundamentals, Docker, and Kubernetes.

What you'd actually do

  1. Lead full-stack development end-to-end: design and implement responsive front-end interfaces, robust back-end services, microservices architecture, and cloud-native applications using containers (Docker) and orchestration (Kubernetes).
  2. Build high-performance, low-latency systems including real-time features, streaming pipelines, event-driven architectures with message brokers (e.g., Kafka, MQTT, Redis Streams), and scalable RESTful APIs.
  3. Collaborate closely with Architects, designers, DevOps, and AI/ML teams to translate requirements into production-ready features, participate in code reviews, and contribute to architectural decisions.
  4. Troubleshoot complex issues across the stack, optimize application performance, implement CI/CD pipelines, and deploy updates to cloud environments (AWS, Azure, or GCP).

Skills

Required

  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field (or equivalent practical experience)
  • 3+ years of hands-on full-stack development experience
  • Solid back-end experience with Node.js (Express/NestJS) or Python (FastAPI, Django/Flask)
  • building REST APIs
  • GraphQL
  • authentication (JWT, OAuth)
  • integrating with databases (PostgreSQL, Elasticsearch)
  • Good understanding of cloud fundamentals
  • Docker
  • Kubernetes basics
  • Git
  • modern development workflows (CI/CD, testing: unit/integration/E2E)
  • Strong grasp of algorithms, data structures, time/space complexity, and problem-solving skills (comfortable with LeetCode-style questions)

Nice to have

  • Familiarity with real-time technologies (WebSockets, Socket.io)
  • message queues
  • event-driven systems
  • Exposure to AI/ML integration: working with LLMs, prompt engineering, Retrieval-Augmented Generation (RAG), vector databases (Pinecone, Milvus), or Agentic AI workflows
  • Knowledge of DevOps practices (Terraform, Helm, monitoring with Prometheus/Grafana)
  • serverless architectures
  • micro-frontends
  • Contributions to open-source projects
  • personal side projects with live demos
  • experience optimizing applications for high traffic/low latency