Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit www.redditinc.com.

Team The ML Indexing & Retrieval Platform team at Reddit is responsible for building and scaling the core infrastructure that powers machine learning driven recommendations. We design and maintain systems for ML data ingestion, low-latency retrieval services, and end-to-end lifecycle management of data. With a focus on performance, reliability, and scalability, we enable real-time access to high-quality data that supports a wide range of applications, including Content Understanding, Semantic, Lexical retrieval & GenAI applications.

How You'll Have Impact

You’ll lead the development of next-generation ML Indexing & Retrieval systems, owning the full lifecycle from ideation to production and going beyond incremental improvements to reimagine core platform capabilities. As part of a high-impact, cross-functional team, you’ll solve complex technical challenges to build scalable, reliable platforms that empower developers to efficiently ship critical ML features.

Languages: Go, Java, Python, or any object oriented programming language

**Frameworks: Flink, Airflow, **Spark for large scale batch & stream processing

Databases: Familiarity with Vector, Lexical & Key-Value Databases

Tools: Kubernetes, Docker, AWS, GCP

What You’ll Do

Lead the technical strategy, architecture, and implementation of Reddit’s next-generation ML Indexing & Retrieval engine, integrating capabilities across lexical and vector indexing, low-latency retrieval, and emerging GenAI applications.
Partner closely with product engineers across Content Understanding, Search, Feeds, Ads, Growth, and Safety to deliver high-quality experiences.
Define best practices for observability, reliability, and operational excellence in large-scale distributed systems.
Mentor and guide engineers in designing scalable infrastructure and adopting robust DevOps and SRE principles.
Collaborate with infrastructure, and ML teams to ensure the platform evolves to meet the needs of Reddit’s growing user base and diverse content ecosystem.

Who You Might Be:

10+ years of experience in software engineering, specializing in Indexing and Retrieval systems.
3+ years in technical leadership, architecting and scaling distributed systems in production environments.
Deep expertise in large-scale data platforms, including batch indexing and stream processing.
Proven experience designing and operating large-scale, low-latency retrieval services.
Expertise in lexical and vector search retrieval technologies, such as Milvus, Vespa, or Elasticsearch.
Skilled in designing cloud-native architectures and managing containerized workloads using Kubernetes and AWS/GCP.
Adept at translating complex technical challenges into clear, actionable strategies.
Strong communicator and mentor who leads through collaboration, influence, and technical excellence.

Benefits:

Comprehensive Healthcare Benefits and Income Replacement Programs
401k with Employer Match
Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
Family Planning Support
Gender-Affirming Care
Mental Health & Coaching Benefits
Flexible Vacation & Paid Volunteer Time Off
Generous Paid Parental Leave

#LI-Remote

Pay Transparency:

This job posting may span more than one career level.

In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/.

To provide greater transparency to candidates, we share base salary ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.

The base salary range for this position is:

$279,200—$390,900 USD

In select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews.

During the interview, we will collect the following categories of personal information: Identifiers, Professional and Employment-Related Information, Sensory Information (audio/video recording), and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role, as applicable. We will not sell your personal information or disclose it to any third party for their marketing purposes. We will delete any recording of your interview promptly after making a hiring decision. For more information about how we will handle your personal information, including our retention of it, please refer to our Candidate Privacy Policy for Potential Employees and Contractors.

Reddit is proud to be an equal opportunity employer, and is committed to building a workforce representative of the diverse communities we serve. Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If, due to a disability, you need an accommodation during the interview process, please let your recruiter know.

How You'll Have Impact

Languages: Go, Java, Python, or any object oriented programming language

**Frameworks: Flink, Airflow, **Spark for large scale batch & stream processing

Databases: Familiarity with Vector, Lexical & Key-Value Databases

Tools: Kubernetes, Docker, AWS, GCP

What You’ll Do

Lead the technical strategy, architecture, and implementation of Reddit’s next-generation ML Indexing & Retrieval engine, integrating capabilities across lexical and vector indexing, low-latency retrieval, and emerging GenAI applications.
Partner closely with product engineers across Content Understanding, Search, Feeds, Ads, Growth, and Safety to deliver high-quality experiences.
Define best practices for observability, reliability, and operational excellence in large-scale distributed systems.
Mentor and guide engineers in designing scalable infrastructure and adopting robust DevOps and SRE principles.
Collaborate with infrastructure, and ML teams to ensure the platform evolves to meet the needs of Reddit’s growing user base and diverse content ecosystem.

Who You Might Be:

10+ years of experience in software engineering, specializing in Indexing and Retrieval systems.
3+ years in technical leadership, architecting and scaling distributed systems in production environments.
Deep expertise in large-scale data platforms, including batch indexing and stream processing.
Proven experience designing and operating large-scale, low-latency retrieval services.
Expertise in lexical and vector search retrieval technologies, such as Milvus, Vespa, or Elasticsearch.
Skilled in designing cloud-native architectures and managing containerized workloads using Kubernetes and AWS/GCP.
Adept at translating complex technical challenges into clear, actionable strategies.
Strong communicator and mentor who leads through collaboration, influence, and technical excellence.

Benefits:

Comprehensive Healthcare Benefits and Income Replacement Programs
401k with Employer Match
Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
Family Planning Support
Gender-Affirming Care
Mental Health & Coaching Benefits
Flexible Vacation & Paid Volunteer Time Off
Generous Paid Parental Leave

#LI-Remote

Pay Transparency:

This job posting may span more than one career level.

The base salary range for this position is:

$279,200—$390,900 USD

Senior Staff Software Engineer, Indexing & Retrieval Platform

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

How You'll Have Impact

What You’ll Do

Who You Might Be:

How You'll Have Impact

What You’ll Do

Who You Might Be: