Machine Learning Engineer, Platform

Scale AI Scale AI · Data AI · London, United Kingdom · EPD

Machine Learning Engineer to build retrieval and knowledge representation systems for an enterprise-grade Generative AI platform. The role involves owning ML components end-to-end, from research to production, focusing on knowledge bases, vector stores, RAG pipelines, and context engines to power agents for enterprise customers.

What you'd actually do

  1. Own large areas of platform end to end, driving components from design through to production deployment.
  2. Work on knowledge representation systems, including ontologies and knowledge graphs, to support structured reasoning over enterprise data.
  3. Design and implement RAG pipelines, including chunking, embedding, indexing, retrieval, and reranking.
  4. Build and maintain integrations between retrieval and ML components and diverse enterprise data sources, vector databases, APIs, and services.
  5. Develop context retrieval systems that balance recall, precision, latency, and cost.

Skills

Required

  • Python
  • production-quality code
  • testable code
  • maintainable code
  • retrieval systems
  • RAG
  • embeddings
  • vector indexing
  • knowledge representation
  • semantic search
  • agentic systems

Nice to have

  • ontologies
  • knowledge graphs
  • structured reasoning
  • chunking
  • embedding
  • indexing
  • retrieval
  • reranking
  • context retrieval systems
  • recall
  • precision
  • latency
  • cost
  • evaluation frameworks
  • datasets
  • metrics
  • backend services
  • data pipelines
  • ML components
  • LLM components
  • experiments
  • new capabilities
  • high quality
  • tight feedback loops
  • customer-facing
  • cross-functional environments
  • Master’s or PhD degree in Computer Science, Machine Learning, AI, or equivalent practical experience
  • scaling or shipping products at high-growth startups
  • ambiguous problem spaces
  • research-driven approaches
  • pragmatic product constraints
  • communication skills

What the JD emphasized

  • 5+ years of experience building and deploying machine learning or AI systems for real-world, production use cases
  • deep, hands-on understanding of retrieval systems, RAG, embeddings, vector indexing, and knowledge representation
  • Experience with knowledge representation, semantic search, or agentic systems
  • Python, including writing production-quality, testable, and maintainable code
  • scaling or shipping products at high-growth startups
  • operate in ambiguous problem spaces, balancing research-driven approaches with pragmatic product constraints

Other signals

  • building and deploying machine learning or AI systems for real-world, production use cases
  • knowledge representation systems, including ontologies and knowledge graphs
  • RAG pipelines
  • context retrieval systems
  • evaluation frameworks, datasets, and metrics