Software Engineer, Distributed Compute System

Nuro Nuro · Robotics · CA · Offboard Infrastructure

Nuro is seeking senior engineers to build and scale their large-scale computing infrastructure in the cloud/data center. This system supports critical business applications including ML data pipelines, data discovery, simulation, and multimedia log processing. The role involves designing and developing scalable, reliable frameworks to orchestrate and execute large-scale workloads, optimizing efficiency, and providing generic tools and APIs to application teams.

What you'd actually do

  1. Design and develop scalable, reliable frameworks: [BATES](https://medium.com/@nuroteam/scaling-autonomy-in-the-cloud-10c57954289b) to orchestrate and execute large-scale workloads in cloud and on-premise clusters.
  2. Collaborate with application teams throughout Nuro to provide robust, generic tools and APIs that accelerate their development of business critical applications.
  3. Optimize workload efficiency and turnaround time by deeply understanding and analyzing application context to identify inefficiencies and bottlenecks, prioritize tasks, and implement innovative solutions

Skills

Required

  • building and developing large-scale distributed applications
  • Kubernetes
  • self-motivated to identify opportunities for system improvement
  • drive positive change
  • turn high-level requirements into a detailed, cohesive system design
  • deeply understand application logic
  • highly collaborative
  • creating the best possible user experience
  • thrive in the ambiguity of a dynamic, start-up environment
  • bachelor's degree in Computer Science, Electrical Engineering, or a closely related field or equivalent experience

Nice to have

  • database technologies (e.g., Redis, SQL, NoSQL)
  • pub/sub systems
  • hyperscalers, particularly Google Cloud Platform or other major providers
  • workflow management tools or map-reduce frameworks such as Apache Airflow, Buildkite, Ray, or Dataflow
  • Linux system, e.g. CPU scheduler, memory management, file systems
  • data analysis, includes defining metrics, running A/B testing, and building metric dashboards
  • real-time monitoring framework like Prometheus
  • continuous integration/delivery (CI/CD)