Senior Data Engineer, Nsv at NVIDIA

What you'd actually do

Define and execute the group's data technical roadmap, aligning with Infra, DevOps, and Performance teams

Design and maintain flexible ETL/ELT frameworks for ingesting, transforming, and classifying cluster verification and telemetry data

Build and optimize streaming pipelines using Apache Spark, Kafka, and Databricks, ensuring high throughput, reliability, and adaptability to evolving data schemas

Ensure data quality and pipeline health through observability standards, schema validation, lineage tracking, monitoring, and alerting

Contribute to the development of AI Agents that enhance the visibility and accessibility of insights and data for our users

Skills

Required

B.Sc. or M.Sc. in Computer Science, Data Science, or a related field
5+ years of hands-on experience in data engineering
Strong practical experience with Apache Spark( PySpark or Scala) and Databricks
Proficiency in Python and SQL for data transformation, automation, and pipeline logic
Experience with Apache Kafka, including stream ingestion and event processing
Experience with schema evolution, data versioning, and validation frameworks (Delta Lake, Iceberg, or Great Expectations)
Strong problem-solving skills and ability to debug and troubleshoot complex data-related issues
Strong communication skills and ability to work effectively across teams

Nice to have

Experience with real-time analytics frameworks (Spark Structured Streaming, Flink, Kafka Streams)
Exposure to hardware, firmware, or embedded telemetry environments
Experience with data cataloging or governance tools (DataHub, Collibra, or Alation)
Hands-on experience building or deploying AI Agents or LLM-based applications

NVIDIA is looking for a Senior Data Engineer to join the NSV (Network Solutions Validation) group. NSV builds high-performing software automation for NVIDIA's Data Center environments and helps drive the data growth of the world's biggest companies. In this role, you will design, build, and maintain scalable, high-performance data pipelines that handle massive volumes of data from hardware, communication modules, firmware, and large-scale AI and HPC clusters. You'll also contribute to our growing Agentic AI initiatives, helping develop AI Agents that bring our data capabilities to the next level.

What you'll be doing:

Define and execute the group's data technical roadmap, aligning with Infra, DevOps, and Performance teams
Design and maintain flexible ETL/ELT frameworks for ingesting, transforming, and classifying cluster verification and telemetry data
Build and optimize streaming pipelines using Apache Spark, Kafka, and Databricks, ensuring high throughput, reliability, and adaptability to evolving data schemas
Ensure data quality and pipeline health through observability standards, schema validation, lineage tracking, monitoring, and alerting
Deliver reliable insights for cluster performance analysis, telemetry visibility, and end-to-end test coverage
Support self-service analytics for engineers and researchers via Databricks notebooks, APIs, and datasets
Drive best practices in data modeling, code quality, and operational excellence; collaborate with cross-functional teams to support data-driven decision-making
Contribute to the development of AI Agents that enhance the visibility and accessibility of insights and data for our users

What we need to see:

B.Sc. or M.Sc. in Computer Science, Data Science, or a related field
5+ years of hands-on experience in data engineering
Strong practical experience with Apache Spark( PySpark or Scala) and Databricks
Proficiency in Python and SQL for data transformation, automation, and pipeline logic
Experience with Apache Kafka, including stream ingestion and event processing
Experience with schema evolution, data versioning, and validation frameworks (Delta Lake, Iceberg, or Great Expectations)
Strong problem-solving skills and ability to debug and troubleshoot complex data-related issues
Strong communication skills and ability to work effectively across teams

Ways to stand out from the crowd:

Experience with real-time analytics frameworks (Spark Structured Streaming, Flink, Kafka Streams)
Exposure to hardware, firmware, or embedded telemetry environments
Experience with data cataloging or governance tools (DataHub, Collibra, or Alation)
Hands-on experience building or deploying AI Agents or LLM-based applications

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you! NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.