What you'd actually do

Architect and lead the development of next-generation, large-scale machine learning techniques.

Define and execute the ML strategy, identifying opportunities to enhance personalization and recommendation quality across Reddit.

Lead research initiatives on scalable machine learning systems and real-time model adaptation, bringing cutting-edge advancements into production.

Partner with ML infrastructure teams to build high-performance, distributed training systems that efficiently scale across multiple GPUs and cloud environments.

Establish and optimize real-time serving architectures for large-scale embeddings, ensuring low-latency inference and high throughput.

Skills

Required

8+ years of experience in machine learning engineering
strong focus on large-scale ML systems and recommendation or personalization systems
Expertise in modern deep learning architectures, including sequence models and foundational models
Deep understanding of complex multi-entity relationships in machine learning applications and how they are modeled in large-scale systems
Proven ability to design, implement, and optimize scalable ML architectures, from distributed training to real-time inference
Strong software engineering skills in Python, C++, or similar languages
experience in ML infrastructure, high-performance computing, and cloud-based ML pipelines
Demonstrated leadership in driving ML strategy, mentoring engineers, and influencing cross-functional teams
Experience with A/B testing, model evaluation frameworks, and real-time feedback loops in large-scale production systems
Excellent communication skills

What the JD emphasized

large-scale machine learning models

scalable model design and training approaches

scalable machine learning systems

real-time model adaptation

high-performance, distributed training systems

real-time serving architectures

large-scale embeddings

low-latency inference

large-scale ML systems

recommendation or personalization systems

scalable ML architectures

distributed training

real-time inference

large-scale production systems

Other signals

building highly expressive machine learning models

powering Reddit’s recommendation systems

leveraging modern deep learning approaches

scalable model designs

enhance personalization

massive scale

large-scale machine learning models

advanced deep learning architectures

high-impact ML systems

scalable model design and training approaches

efficient, reliable deployment of ML models

scalable machine learning systems

real-time model adaptation

cutting-edge advancements into production

high-performance, distributed training systems

real-time serving architectures

large-scale embeddings

low-latency inference

high throughput

integrate ML models into Reddit’s key AI-driven systems

forefront of AI research

new modeling paradigms

cutting-edge ML ecosystem

large-scale ML systems

recommendation or personalization systems

modern deep learning architectures

foundational models

complex multi-entity relationships

large-scale systems

scalable ML architectures

distributed training

real-time inference

ML infrastructure

high-performance computing

cloud-based ML pipelines

A/B testing

model evaluation frameworks

real-time feedback loops

large-scale production systems

Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit www.redditinc.com.

The Embedding Machine Learning Platform team is at the forefront of building highly expressive machine learning models that power Reddit’s recommendation systems. We go beyond standard retrieval and ranking architectures, leveraging modern deep learning approaches and scalable model designs to enhance personalization across Reddit’s ecosystem. Our work impacts content discovery, user engagement, and platform growth at a massive scale.

How You'll Have Impact

As a Staff Machine Learning Engineer, you will own the technical direction for large-scale machine learning models, guiding the development of advanced deep learning architectures and high-impact ML systems. You will partner with leadership to define ML roadmaps, drive innovation in scalable model design and training approaches, and ensure efficient, reliable deployment of ML models in production. This role offers an opportunity to influence key AI-driven systems across Reddit while mentoring and uplifting the team’s technical capabilities.

What You’ll Do

Architect and lead the development of next-generation, large-scale machine learning techniques.
Define and execute the ML strategy, identifying opportunities to enhance personalization and recommendation quality across Reddit.
Lead research initiatives on scalable machine learning systems and real-time model adaptation, bringing cutting-edge advancements into production.
Partner with ML infrastructure teams to build high-performance, distributed training systems that efficiently scale across multiple GPUs and cloud environments.
Establish and optimize real-time serving architectures for large-scale embeddings, ensuring low-latency inference and high throughput.
Collaborate cross-functionally with teams in Feed Ranking, Ads, Content Understanding, and Core ML to integrate ML models into Reddit’s key AI-driven systems.
Mentor and guide senior and mid-level ML engineers, fostering a culture of excellence, innovation, and knowledge sharing.
Stay at the forefront of AI research, evaluating and introducing new modeling paradigms to keep Reddit’s ML ecosystem cutting-edge.
Drive technical discussions, present findings to leadership, and contribute to long-term ML planning and decision-making.

Who You Might Be:

8+ years of experience in machine learning engineering, with a strong focus on large-scale ML systems and recommendation or personalization systems.
Expertise in modern deep learning architectures, including sequence models and foundational models.
Deep understanding of complex multi-entity relationships in machine learning applications and how they are modeled in large-scale systems.
Proven ability to design, implement, and optimize scalable ML architectures, from distributed training to real-time inference.
Strong software engineering skills in Python, C++, or similar languages, with experience in ML infrastructure, high-performance computing, and cloud-based ML pipelines.
Demonstrated leadership in driving ML strategy, mentoring engineers, and influencing cross-functional teams.
Experience with A/B testing, model evaluation frameworks, and real-time feedback loops in large-scale production systems.
Excellent communication skills, with the ability to effectively present complex ML concepts to technical and non-technical stakeholders.

Benefits:

Comprehensive Healthcare Benefits and Income Replacement Programs
401k with Employer Match
Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
Family Planning Support
Gender-Affirming Care
Mental Health & Coaching Benefits
Flexible Vacation & Paid Volunteer Time Off
Generous Paid Parental Leave

#LI-Remote

Pay Transparency:

This job posting may span more than one career level.

In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/.

To provide greater transparency to candidates, we share base salary ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.

The base salary range for this position is:

$253,300—$354,600 USD

In select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews.

During the interview, we will collect the following categories of personal information: Identifiers, Professional and Employment-Related Information, Sensory Information (audio/video recording), and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role, as applicable. We will not sell your personal information or disclose it to any third party for their marketing purposes. We will delete any recording of your interview promptly after making a hiring decision. For more information about how we will handle your personal information, including our retention of it, please refer to our Candidate Privacy Policy for Potential Employees and Contractors.

Reddit is proud to be an equal opportunity employer, and is committed to building a workforce representative of the diverse communities we serve. Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If, due to a disability, you need an accommodation during the interview process, please let your recruiter know.