Principal Security Data Engineer, Infrastructure Security Engineering - Dgx Cloud

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +3 · Remote

NVIDIA DGX Cloud is seeking a Principal Security Data Engineer to build the data backbone for its security control plane. This role involves designing, building, and operating data ingestion and transformation pipelines, architecting the data lake/lakehouse, and developing the analytics layer for security posture and detection. The engineer will also be responsible for securing the data layer itself, ensuring data quality and trust, and collaborating with cross-functional teams. The ideal candidate has extensive experience in data engineering at scale, production-grade coding, data modeling, distributed data systems, and security-minded data handling.

What you'd actually do

  1. Design, build, and operate the ingestion and transformation pipelines that collect security telemetry and asset inventory from dozens of heterogeneous sources, and normalize them into one canonical model.
  2. Architect and run the storage layer. A data lake/lakehouse built on open formats, with the schema flexibility to absorb structured inventory, semi-structured telemetry, and unstructured logs without constant, breaking migrations.
  3. Build the query and analytics layer that powers posture scoring, coverage and drift metrics, freshness monitoring, and multi-source correlation.
  4. Treat the data platform as a high-value target, because it is. The data you store is a map of every host, every gap, and every credential path. You will engineer encryption at rest and in transit, fine-grained RBAC/ABAC, non-repudiable audit logging, data classification, network isolation, and verifiable retention and purge.
  5. Build for stable identity, source attribution, append-only history, and honest coverage. Make a source going quiet a finding, not silence, so that every downstream number comes with a known confidence.

Skills

Required

  • Data Engineering at Scale
  • Production-Grade Coding
  • Data Modeling & Schema Design
  • Distributed Data Systems
  • Security-Minded Data Handling
  • Analytics Enablement
  • Python
  • Go
  • Scala
  • SQL

Nice to have

  • Security Telemetry & Detection Engineering
  • Real-Time & Streaming Data
  • HPC/AI Fleet Telemetry
  • AI-Ready Data

What the JD emphasized

  • 15+ years of experience designing, building, and operating production data pipelines, lakes, or lakehouses at high volume and throughput
  • strong software engineering background
  • Proven ability to design canonical schemas and data models
  • Hands-on experience with the modern data stacks, both streaming and batch processing, object storage, open table formats, and interactive query engines
  • You design data systems that are themselves defensible. Access control, encryption, audit, and isolation are first-class concerns in your work, and you understand that security data is among the most sensitive data an organization holds.
  • A track record of making large, messy datasets genuinely useful—serving interactive analysts, dashboards, and downstream services with data they can trust and query at low latency.