Software Engineer, Performance Tooling and Infrastructure

Nuro · Robotics · CA · Fleet Infrastructure

Nuro is seeking a Software Engineer to own the performance simulation platform infrastructure for their self-driving technology. This role involves developing and maintaining systems for benchmarking autonomy code changes, ensuring real-time performance on actual robot compute hardware. Responsibilities include building benchmarking infrastructure, ensuring platform reliability and observability, designing data pipelines for performance metrics, conducting statistical analysis, guiding bare-metal OS configuration, and driving platform strategy. The role requires strong software engineering skills in Python and C++, data engineering experience, deep Linux systems knowledge, and technical leadership capabilities. While the role supports AI development, it focuses on the infrastructure and tooling rather than core AI/ML model development.

What you'd actually do

Develop and maintain the job orchestration layer that schedules, executes, and validates autonomy performance benchmarks across a fleet of physical bench-top systems — integrated into CI/CD pipelines as merge-blocking and release-blocking quality gates.
Build monitoring, alerting, and self-healing automation for the bench fleet. Proactively identify systemic risks — capacity bottlenecks, hardware degradation patterns, infrastructure single points of failure — before they become outages. Track utilization, failure rates, and capacity trends to ensure the platform scales ahead of organizational demand.
Design and build end-to-end data pipelines that capture fine-grained performance metrics (CPU/GPU utilization, memory bandwidth, E2E latency, scheduling jitter) from bench-top runs, process them at scale, and surface actionable insights through dashboards and automated regression detection.
Work with Data Science to develop rigorous experimentation methodology for performance results from non-deterministic autonomy workloads — including variance analysis, significance testing, and regression detection.
Guide the SRE team through the OS and system-level configuration of bench hardware — including Linux kernel tuning, boot infrastructure, networking, and hardware bring-up — ensuring the platform faithfully reproduces production robot compute behavior.

Skills

Required

Python
C++
Linux systems
data pipelines
SQL
networking
storage
compute
technical leadership
roadmap setting
stakeholder alignment

Nice to have

Kubernetes
GCP
BigQuery
Grafana
NVIDIA Thor platform
systemd units
kernel tuning
boot infrastructure
hardware bring-up
statistical analysis
experimentation methodology
variance analysis
significance testing
agentic tooling

What the JD emphasized

must be validated for real-time performance on actual robot compute hardware
merge-blocking and release-blocking quality gates
technical DRI for the platform
setting the roadmap
making architectural calls
representing the platform's needs to the leadership team
ensuring the system scales through multiple hardware generations
proactively identify systemic risks
before they become outages
scales ahead of organizational demand
surface actionable insights
automated regression detection
rigorous experimentation methodology
faithfully reproduces production robot compute behavior
Own the planning lifecycle for the benchmarking fleet across hardware generations
Partner with engineering and program leadership to negotiate hardware allocation
model utilization scenarios under real-world constraints
present data-backed trade-off recommendations
balancing testing coverage, user throughput, cost, and SLA commitments against finite physical resources
translate their performance analysis needs into robust, self-service infrastructure
3+ years of industry software engineering experience
Strong proficiency in Python
working proficiency in C++
You write clean, testable, well-documented code and care about long-term maintainability
Experience building data pipelines, ingestion, transformation, storage, and visualization
Familiarity with SQL and analytical workflows
Deep comfort with Linux systems
you've configured kernels, debugged boot issues, written systemd units, or managed bare-metal infrastructure
You understand networking, storage, and compute at a level beyond "it just works."
Experience setting technical vision and roadmap for a project or platform
driving alignment across multiple stakeholders
You've independently identified the cross-functional partners needed to unblock and deliver
you've briefed senior engineering leadership on trade-offs and recommendations
AI-Native
treat AI as a core part of your engineering workflow
you use agentic tooling (e.g., Claude Code) across the development lifecycle
you understand the boundaries

Read full job description

Who We Are

Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses its core technology, the Nuro Driver™, to support a wide range of applications, from robotaxis and commercial fleets to personally owned vehicles. With technology proven over years of self-driving deployments, Nuro gives the automakers and mobility platforms a clear path to AVs at commercial scale, empowering a safer, richer, and more connected future.

**About the Role

Nuro leverages many different bench-top systems to evaluate and regression test different aspects of the software and hardware integration layer. This performance simulation platform includes systems

At Nuro, every autonomy code change, from ML model updates to radius of map around the robot to number of evaluated trajectories, must be validated for real-time performance on actual robot compute hardware before it reaches the road. You will own the infrastructure that makes this possible.

Our Performance Simulation Platform is a hybrid benchmarking system: physical bench-top rigs running production robot compute (NVIDIA Thor platform), orchestrated by cloud-native infrastructure (Kubernetes, GCP), automated data pipelines feeding performance metrics into BigQuery and Grafana, pre/post simulation magic, custom tracing and profiling tools, and much much more.

Engineers across the company rely on this platform daily to answer questions like:

How will my new ML model affect contention on the GPU?
How does a new data format impact onboard logging rate or network contention as more data might be flowing from through the system?
How much memory should be allocated for this new module, and how does it fit into the overall system budget?

You'll be responsible for development, integration, and the evolution of this platform — from the bare-metal OS and networking layer through the job orchestration and CI/CD integration up to the data analysis and visualization layer. This is a high-ownership, high-autonomy role on a small team where your work directly gates the release velocity of the entire autonomy stack. You'll be the technical DRI for the platform — setting the roadmap, making architectural calls, representing the platform's needs to the leadership team, and ensuring the system scales through multiple hardware generations.

**About the Work

Benchmarking Infrastructure: Develop and maintain the job orchestration layer that schedules, executes, and validates autonomy performance benchmarks across a fleet of physical bench-top systems — integrated into CI/CD pipelines as merge-blocking and release-blocking quality gates.
Platform Reliability & Observability: Build monitoring, alerting, and self-healing automation for the bench fleet. Proactively identify systemic risks — capacity bottlenecks, hardware degradation patterns, infrastructure single points of failure — before they become outages. Track utilization, failure rates, and capacity trends to ensure the platform scales ahead of organizational demand.
Performance Data Pipelines: Design and build end-to-end data pipelines that capture fine-grained performance metrics (CPU/GPU utilization, memory bandwidth, E2E latency, scheduling jitter) from bench-top runs, process them at scale, and surface actionable insights through dashboards and automated regression detection.
Statistical Analysis & Experimentation: Work with Data Science to develop rigorous experimentation methodology for performance results from non-deterministic autonomy workloads — including variance analysis, significance testing, and regression detection. Bare-Metal & OS Platform: Guide the SRE team through the OS and system-level configuration of bench hardware — including Linux kernel tuning, boot infrastructure, networking, and hardware bring-up — ensuring the platform faithfully reproduces production robot compute behavior.
**Drive Platform & Allocation Strategy: **Own the planning lifecycle for the benchmarking fleet across hardware generations. Partner with engineering and program leadership to negotiate hardware allocation, model utilization scenarios under real-world constraints, and present data-backed trade-off recommendations — balancing testing coverage, user throughput, cost, and SLA commitments against finite physical resources.
Cross-Functional Collaboration: Partner with Hardware Engineering, NPI (New Product Introduction), SRE (Site Reliability Engineering), Perception, Behavior, and Data Science teams to translate their performance analysis needs into robust, self-service infrastructure.

About You

Experience: 3+ years of industry software engineering experience.
Software Engineering: Strong proficiency in Python and working proficiency in C++. You write clean, testable, well-documented code and care about long-term maintainability.
Data Engineering: Experience building data pipelines, ingestion, transformation, storage, and visualization. Familiarity with SQL and analytical workflows.
Systems & Infrastructure: Deep comfort with Linux systems — you've configured kernels, debugged boot issues, written systemd units, or managed bare-metal infrastructure. You understand networking, storage, and compute at a level beyond "it just works."
**Technical Leadership: **Experience setting technical vision and roadmap for a project or platform, driving alignment across multiple stakeholders. You've independently identified the cross-functional partners needed to unblock and deliver, and you've briefed senior engineering leadership on trade-offs and recommendations.
AI-Native: You treat AI as a core part of your engineering workflow, not an occasional shortcut — you use agentic tooling (e.g., Claude Code) across the development lifecycle and you understand the boundaries of when AI output demands extra scrutiny versus when it accelerates you.
Bias for Action: Comfortable operating in ambiguous, fast-moving environments where you need to balance long-term architecture with short-term delivery.

Bonus Points:_

Experience with performance engineering, especially around tooling integration (perf, Perfetto, pprof, eBPF, NVIDIA Nsight Systems, NVIDIA CUPTI).
Experience in robotics or AV, particularly with NVIDIA DriveOS stack.

At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $152,000 and $228,000 for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.

At Nuro, we celebrate differences and are committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics. #LI-DNP

Who We Are

**About the Role

Nuro leverages many different bench-top systems to evaluate and regression test different aspects of the software and hardware integration layer. This performance simulation platform includes systems

Engineers across the company rely on this platform daily to answer questions like:

How will my new ML model affect contention on the GPU?
How does a new data format impact onboard logging rate or network contention as more data might be flowing from through the system?
How much memory should be allocated for this new module, and how does it fit into the overall system budget?

**About the Work

Benchmarking Infrastructure: Develop and maintain the job orchestration layer that schedules, executes, and validates autonomy performance benchmarks across a fleet of physical bench-top systems — integrated into CI/CD pipelines as merge-blocking and release-blocking quality gates.
Platform Reliability & Observability: Build monitoring, alerting, and self-healing automation for the bench fleet. Proactively identify systemic risks — capacity bottlenecks, hardware degradation patterns, infrastructure single points of failure — before they become outages. Track utilization, failure rates, and capacity trends to ensure the platform scales ahead of organizational demand.
Performance Data Pipelines: Design and build end-to-end data pipelines that capture fine-grained performance metrics (CPU/GPU utilization, memory bandwidth, E2E latency, scheduling jitter) from bench-top runs, process them at scale, and surface actionable insights through dashboards and automated regression detection.
Statistical Analysis & Experimentation: Work with Data Science to develop rigorous experimentation methodology for performance results from non-deterministic autonomy workloads — including variance analysis, significance testing, and regression detection. Bare-Metal & OS Platform: Guide the SRE team through the OS and system-level configuration of bench hardware — including Linux kernel tuning, boot infrastructure, networking, and hardware bring-up — ensuring the platform faithfully reproduces production robot compute behavior.
**Drive Platform & Allocation Strategy: **Own the planning lifecycle for the benchmarking fleet across hardware generations. Partner with engineering and program leadership to negotiate hardware allocation, model utilization scenarios under real-world constraints, and present data-backed trade-off recommendations — balancing testing coverage, user throughput, cost, and SLA commitments against finite physical resources.
Cross-Functional Collaboration: Partner with Hardware Engineering, NPI (New Product Introduction), SRE (Site Reliability Engineering), Perception, Behavior, and Data Science teams to translate their performance analysis needs into robust, self-service infrastructure.

About You

Experience: 3+ years of industry software engineering experience.
Software Engineering: Strong proficiency in Python and working proficiency in C++. You write clean, testable, well-documented code and care about long-term maintainability.
Data Engineering: Experience building data pipelines, ingestion, transformation, storage, and visualization. Familiarity with SQL and analytical workflows.
Systems & Infrastructure: Deep comfort with Linux systems — you've configured kernels, debugged boot issues, written systemd units, or managed bare-metal infrastructure. You understand networking, storage, and compute at a level beyond "it just works."
**Technical Leadership: **Experience setting technical vision and roadmap for a project or platform, driving alignment across multiple stakeholders. You've independently identified the cross-functional partners needed to unblock and deliver, and you've briefed senior engineering leadership on trade-offs and recommendations.
AI-Native: You treat AI as a core part of your engineering workflow, not an occasional shortcut — you use agentic tooling (e.g., Claude Code) across the development lifecycle and you understand the boundaries of when AI output demands extra scrutiny versus when it accelerates you.
Bias for Action: Comfortable operating in ambiguous, fast-moving environments where you need to balance long-term architecture with short-term delivery.

Bonus Points:_

Experience with performance engineering, especially around tooling integration (perf, Perfetto, pprof, eBPF, NVIDIA Nsight Systems, NVIDIA CUPTI).
Experience in robotics or AV, particularly with NVIDIA DriveOS stack.

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

**Who We Are **

**About the Role

**About the Work

About You

Bonus Points:_

**Who We Are **

**About the Role

**About the Work

About You

Bonus Points:_

Who We Are

Who We Are