Staff Backend Engineer, ML Inference Sy… at Unity

What you'd actually do

Design, develop, and deploy production-grade backend services and distributed systems powering large-scale online model inference at billions of daily requests

Drive technical direction of our inference platform, with a focus on low-latency, high-throughput serving infrastructure

Partner with ML engineers to ensure online serving infrastructure scales with growing model complexity and inference volumes, without compromising latency or throughput

Ensure the reliability, scalability, and efficiency of our systems in production using monitoring and observability tools like Prometheus and Grafana.

Manage and optimize cloud infrastructure on GCP, orchestrating workloads with Kubernetes across a high-scale production environment

Skills

Required

5+ years designing, deploying, and maintaining distributed systems at scale
Expertise in Golang for building high-performance, low-latency backend infrastructure
Hands-on experience with cloud infrastructure on GCP and workload orchestration with Kubernetes
Strong grounding in monitoring and observability tooling, including Prometheus and Grafana
Experience in ad tech, recommender systems, real-time personalization, or other performance-critical domains
Familiarity with microservice architectures, containerization (Docker), and CI/CD best practices
Familiarity with machine learning platforms, workflows, and serving infrastructure

Nice to have

Experience with ML inference servers like NVIDIA Triton Inference Server.
Familiarity with auction mechanics or bidding systems in an ad tech context.
Experience embracing AI as a strategic advantage in engineering, following established best practices for code quality and security.

The opportunity Every day, we connect billions of players with the games and experiences they love.

Our Vector Gamer AI team sits at the heart of that mission, governing ad ranking and bidding decisions across billions of daily impressions, where large-scale machine learning and real-world impact converge at scale.

We're hiring a Staff Backend Engineer to build and operate the infrastructure those models depend on. You'll design and operate the distributed systems that power billions of daily decisions, with a focus on the performance, reliability, and scalability of inference systems.

Join us and help influence how billions of gaming experiences are discovered, monetized, and how creators are rewarded.

What you'll be doing

Design, develop, and deploy production-grade backend services and distributed systems powering large-scale online model inference at billions of daily requests
Drive technical direction of our inference platform, with a focus on low-latency, high-throughput serving infrastructure
Partner with ML engineers to ensure online serving infrastructure scales with growing model complexity and inference volumes, without compromising latency or throughput
Ensure the reliability, scalability, and efficiency of our systems in production using monitoring and observability tools like Prometheus and Grafana.
Manage and optimize cloud infrastructure on GCP, orchestrating workloads with Kubernetes across a high-scale production environment
Promote and implement best practices for backend service development, testing, deployment, and monitoring (DevOps, SRE).

What we're looking for

5+ years designing, deploying, and maintaining distributed systems at scale
Expertise in Golang for building high-performance, low-latency backend infrastructure
Hands-on experience with cloud infrastructure on GCP and workload orchestration with Kubernetes
Strong grounding in monitoring and observability tooling, including Prometheus and Grafana
Experience in ad tech, recommender systems, real-time personalization, or other performance-critical domains
Familiarity with microservice architectures, containerization (Docker), and CI/CD best practices
Familiarity with machine learning platforms, workflows, and serving infrastructure

You might also have

Experience with ML inference servers like NVIDIA Triton Inference Server.
Familiarity with auction mechanics or bidding systems in an ad tech context.
Experience embracing AI as a strategic advantage in engineering, following established best practices for code quality and security.

Additional information

Relocation support is not available for this position

Benefits At Unity, we want our team members to thrive. We offer a wide range of benefits designed to support well-being and work-life balance.

Please note: Benefits eligibility, specific offerings, and coverage vary based on the country and employment status.

While specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program

Life at Unity Unity [NYSE: U] is the world’s leading game engine, powering play for more than 3 billion consumers each month. The top mobile games in the world, the most played PC indie titles, the most innovative console games, and virtually all of the top XR and Web Games are developed, deployed, and grown in Unity. Unity also enables teams across industries like automotive, manufacturing, and healthcare to design, simulate, and collaborate in 3D — closing the gap between ideas and reality. For more information, please visit www.unity.com.

Unity is a proud equal opportunity employer. We are committed to fostering an inclusive, innovative environment and celebrate our employees across age, race, color, ancestry, national origin, religion, disability, sex, gender identity or expression, sexual orientation, or any other protected status in accordance with applicable law. Our differences are strengths that enable us to support the growing and evolving needs of our customers, partners, and collaborators. If you have a disability that means there are preparations or accommodations we can make to help ensure you have a comfortable and positive interview experience, please fill out this form to let us know.

This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.

Headhunters and recruitment agencies may not submit resumes/CVs through this website or directly to managers. Unity does not accept unsolicited headhunter and agency resumes. Unity will not pay fees to any third-party agency or company that does not have a signed agreement with Unity.

Your privacy is important to us. Please take a moment to review our Prospect Privacy Policy and Applicant Privacy Policy. Should you have any concerns about your privacy, please contact us at DPO@unity.com.

#SEN #LI-AR1

*Note: Certain locations require a good faith disclosure of the base salary range for the role. The actual salary for the successful candidate may differ based on location, experience, and other job-related factors.

Gross pay salary

$192,600—$271,900 USD