What you'd actually do

Design, build, and iterate on components for ranking, retrieval, and recommendations that improve measurable relevance and latency.

Implement production features leveraging embeddings, semantic/hybrid search, and LLM-enabled retrieval under mentorship and design guidance.

Contribute to offline/online evaluation, A/B tests, and relevance tuning using metrics such as NDCG, MRR, and precision@k.

Develop reliable, observable microservices and near real-time indexing pipelines across distributed systems.

Own well-scoped projects from design to rollout, writing clear design docs, tests, and operational runbooks.

Skills

Required

3+ years of industry experience building backend or distributed systems
production ownership of services or data pipelines
Proficient in at least one of: Java, Scala, C++, or Python
Experience with data pipelines, message queues, or streaming systems (e.g., Kafka, Pub/Sub) and near real-time processing.
Familiarity with cloud-native microservices, CI/CD, observability, and performance tuning.
BS in Computer Science or related field, or equivalent practical experience.

Nice to have

comfortable writing production-grade Python
Exposure to search, ranking, recommendations, or applied ML in production
understand the basics of training-to-serving workflows
Experience with Elasticsearch, Solr, Lucene, or custom search systems; understanding of inverted indexes and scoring functions.
Knowledge of relevance tuning, learning-to-rank concepts, and offline/online experimentation practices.
Exposure to vector search, dense/sparse embeddings, and hybrid retrieval architectures.
Familiarity with IR fundamentals (BM25, TF-IDF, multi-stage retrieval) and query understanding.
Experience with Kubernetes/Terraform and a major cloud (GCP/AWS/Azure).
Practical exposure to PyTorch or TensorFlow; LLM familiarity helpful but not required.

What the JD emphasized

production ownership of services or data pipelines

production-grade Python

applied ML in production

near real-time processing

performance tuning

custom search systems

relevance tuning

learning-to-rank concepts

offline/online experimentation practices

vector search

dense/sparse embeddings

hybrid retrieval architectures

IR fundamentals

query understanding

WHAT IS BOX?

Box (NYSE:BOX) is the leader in Intelligent Content Management. Our platform enables organizations to fuel collaboration, manage the entire content lifecycle, secure critical content, and transform business workflows with enterprise AI. We help companies thrive in the new AI-first era of business. Founded in 2005, Box simplifies work for leading global organizations, including JLL, Morgan Stanley, and Nationwide. Box is headquartered in Redwood City, CA, with offices across the United States, Europe, and Asia.

By joining Box, you will have the unique opportunity to continue driving our platform forward. Content powers how we work. It’s the billions of files and information flowing across teams, departments, and key business processes every single day: contracts, invoices, employee records, financials, product specs, marketing assets, and more. Our mission is to bring intelligence to the world of content management and empower our customers to completely transform workflows across their organizations. With the combination of AI and enterprise content, the opportunity has never been greater to transform how the world works together and at Box you will be on the front lines of this massive shift.

The Search Relevance team at Box powers discovery across billions of files, enabling customers to find the right content quickly, securely, and intelligently. As we expand into a new era of AI-powered content understanding, we’re investing in the foundation that makes great search possible: reliable systems, strong signals, and models that learn from real-world usage.

This is a rare opportunity to work at the intersection of information retrieval science, applied machine learning, and large-scale distributed systems. You’ll be building the infrastructure that powers intelligent content discovery for Fortune 500 companies—where milliseconds matter, relevance is measurable, and your experiments directly impact how millions of users work.

We’re looking for a Machine Learning Engineer III to improve search quality end-to-end—signals, ranking, retrieval, and evaluation—while building scalable, low-latency services that serve queries in real time. You’ll collaborate with senior engineers, Product, Data, and Infra partners to productionize modern retrieval techniques and experimentation frameworks that directly impact how millions of users work.

WHAT YOU’LL DO

Design, build, and iterate on components for ranking, retrieval, and recommendations that improve measurable relevance and latency.
Implement production features leveraging embeddings, semantic/hybrid search, and LLM-enabled retrieval under mentorship and design guidance.
Contribute to offline/online evaluation, A/B tests, and relevance tuning using metrics such as NDCG, MRR, and precision@k.
Develop reliable, observable microservices and near real-time indexing pipelines across distributed systems.
Own well-scoped projects from design to rollout, writing clear design docs, tests, and operational runbooks.
Improve data and feature pipelines (batch/streaming) to ensure quality, freshness, and end-to-end performance.
Document patterns and contribute to team best practices that raise the bar on code quality and reliability.
Participate in our on-call rotation, available at all times while on-call to help respond to and triage any issues that arise.

WHO YOU ARE

3+ years of industry experience building backend or distributed systems, with production ownership of services or data pipelines.
Proficient in at least one of: Java, Scala, C++, or Python; comfortable writing production-grade Python is a plus.
Exposure to search, ranking, recommendations, or applied ML in production; understand the basics of training-to-serving workflows.
Experience with data pipelines, message queues, or streaming systems (e.g., Kafka, Pub/Sub) and near real-time processing.
Familiarity with cloud-native microservices, CI/CD, observability, and performance tuning.
BS in Computer Science or related field, or equivalent practical experience.
Pragmatic, metrics-driven mindset—eager to experiment, measure impact, and iterate quickly in collaboration with partners.

Preferred

Experience with Elasticsearch, Solr, Lucene, or custom search systems; understanding of inverted indexes and scoring functions.
Knowledge of relevance tuning, learning-to-rank concepts, and offline/online experimentation practices.
Exposure to vector search, dense/sparse embeddings, and hybrid retrieval architectures.
Familiarity with IR fundamentals (BM25, TF-IDF, multi-stage retrieval) and query understanding.
Experience with Kubernetes/Terraform and a major cloud (GCP/AWS/Azure).
Practical exposure to PyTorch or TensorFlow; LLM familiarity helpful but not required.

Box lives its values, with community and in-person collaboration being a core part of our culture. Boxers are expected to work from their assigned office a minimum of 3 days per week.Your Recruiter will share more about how we work and company culture during the hiring process.

At Box, we believe unique and diverse experiences benefit our culture, our products, our customers, our company, and our world. We aim to recruit a passionate, high-performing workforce that reflects the world we live in.** **If you are head-over-heels about this role but unsure if you meet all the requirements, we encourage you to apply!

EQUAL OPPORTUNITY

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability, and any other protected ground of discrimination under applicable human rights legislation. Box strives to respect the dignity and ‎‎independence of people with disabilities and is committed to giving them the same ‎‎opportunity to succeed as all other employees. Inclusiveness is core to our culture at Box, and we strive to ensure you get the most from your interview experience.

Box makes reasonable accommodations for applicants with disabilities. If a reasonable accommodation is needed to participate in the job application or interview process, please complete this form. Reasonable accommodations may include scheduling adjustments, document dictation and beyond.

Notice to applicants in Los Angeles: Box, Inc and its related branches will consider for employment, qualified applicants with criminal histories in a manner consistent with the Los Angeles Fair Chair Ordinance. The Fair Chance Ordinance is provided here.

Notice to applicants in San Francisco: Box, Inc and its related branches will consider for employment, qualified applicants with criminal histories in a manner consistent with the San Francisco Fair Chair Ordinance. The Fair Chance Ordinance is provided here.

For details on how we protect your information when you apply, please see our Personnel Privacy Notice. If you are a California-resident, please read our California Applicant & Candidate Privacy Notice here.

Box is committed to fair and equitable compensation practices. Actual base salary (or OTE if commissionable role) is dependent upon factors such as: knowledge, skill level, experience, and work location. This role is also eligible for equity and benefits. For more information, check out our benefits and perks. __

__In accordance with OFCCP compliance, here is the Pay Transparency Provision. __

United States Pay Range

$175,500—$219,500 USD

WHAT IS BOX?

WHAT YOU’LL DO

Design, build, and iterate on components for ranking, retrieval, and recommendations that improve measurable relevance and latency.
Implement production features leveraging embeddings, semantic/hybrid search, and LLM-enabled retrieval under mentorship and design guidance.
Contribute to offline/online evaluation, A/B tests, and relevance tuning using metrics such as NDCG, MRR, and precision@k.
Develop reliable, observable microservices and near real-time indexing pipelines across distributed systems.
Own well-scoped projects from design to rollout, writing clear design docs, tests, and operational runbooks.
Improve data and feature pipelines (batch/streaming) to ensure quality, freshness, and end-to-end performance.
Document patterns and contribute to team best practices that raise the bar on code quality and reliability.
Participate in our on-call rotation, available at all times while on-call to help respond to and triage any issues that arise.

WHO YOU ARE

3+ years of industry experience building backend or distributed systems, with production ownership of services or data pipelines.
Proficient in at least one of: Java, Scala, C++, or Python; comfortable writing production-grade Python is a plus.
Exposure to search, ranking, recommendations, or applied ML in production; understand the basics of training-to-serving workflows.
Experience with data pipelines, message queues, or streaming systems (e.g., Kafka, Pub/Sub) and near real-time processing.
Familiarity with cloud-native microservices, CI/CD, observability, and performance tuning.
BS in Computer Science or related field, or equivalent practical experience.
Pragmatic, metrics-driven mindset—eager to experiment, measure impact, and iterate quickly in collaboration with partners.

Preferred

Experience with Elasticsearch, Solr, Lucene, or custom search systems; understanding of inverted indexes and scoring functions.
Knowledge of relevance tuning, learning-to-rank concepts, and offline/online experimentation practices.
Exposure to vector search, dense/sparse embeddings, and hybrid retrieval architectures.
Familiarity with IR fundamentals (BM25, TF-IDF, multi-stage retrieval) and query understanding.
Experience with Kubernetes/Terraform and a major cloud (GCP/AWS/Azure).
Practical exposure to PyTorch or TensorFlow; LLM familiarity helpful but not required.

EQUAL OPPORTUNITY

__In accordance with OFCCP compliance, here is the Pay Transparency Provision. __

United States Pay Range

$175,500—$219,500 USD

Machine Learning Engineer Iii, Search Relevance

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

WHAT YOU’LL DO

WHO YOU ARE

WHAT YOU’LL DO

WHO YOU ARE