Senior/staff Software Engineer, Search & Retrieval Infrastructure

Pinecone Pinecone · Data AI · Tel Aviv, Israel · R&D

Senior/Staff Software Engineer to design and build core components of a next-generation knowledge retrieval system for AI applications. Focus on search and retrieval infrastructure powering scalable, enterprise-grade agentic systems, connecting knowledge to LLM-powered applications using a vector DB for semantic and hybrid retrieval. Role involves backend system architecture, distributed systems, and applied AI infrastructure.

What you'd actually do

  1. Design and build scalable platform components leveraging advanced retrieval via query planning, semantic and hybrid search, metadata-aware search, and LLM generation
  2. Design and build optimized indexing pipelines for structured and unstructured data
  3. Build backend services for semantic and hybrid retrieval, knowledge graph construction, and retrieval orchestration
  4. Improve retrieval quality through evaluation and observability frameworks
  5. Design APIs for internal and external user and agentic consumers

Skills

Required

  • backend system architecture
  • distributed systems
  • high throughput
  • low latency
  • long-term maintainability
  • high-throughput indexing pipelines
  • unstructured data
  • structured schemas
  • semantic search
  • vector databases
  • hybrid retrieval strategies
  • traditional search engines
  • RAG
  • embedding pipelines
  • query planning
  • metadata filtering
  • Go
  • Rust
  • C++
  • Java
  • Python
  • Kubernetes
  • cloud-native architectures
  • observability frameworks
  • Terraform
  • Pulumi
  • product thinking
  • design clean, intuitive APIs

Nice to have

  • multi-tenant SaaS platforms
  • retrieval evaluation frameworks
  • agentic reasoning loops

What the JD emphasized

  • shipping production-grade backends for large-scale systems
  • high-throughput indexing pipelines
  • semantic search
  • vector databases
  • hybrid retrieval strategies
  • RAG
  • embedding pipelines
  • hybrid search techniques
  • query planning
  • metadata filtering

Other signals

  • vector database
  • semantic search
  • hybrid retrieval
  • agentic systems
  • LLM-powered applications