Senior ML & AI Technical Solutions Engineer

Databricks Databricks · Data AI · Bangalore, India · Support

Senior ML & AI Technical Solutions Engineer role focused on helping customers debug and maintain GenAI and ML workloads with AI agent systems on the Databricks Platform. Responsibilities include troubleshooting production workloads, optimizing performance, diagnosing LLM deployments, guiding customers on generative AI use cases, and collaborating with internal teams. Requires expertise in Python, Scala, Java, distributed systems, ML frameworks, LLM applications, agentic frameworks, MLOps, and LLMOps.

What you'd actually do

  1. Act as senior technical solution expert for complex issues spanning data pipelines, ML pipelines and/or AI applications, applying deep expertise in distributed systems.
  2. Analyse and troubleshoot production workloads at the code level, optimise for performance, reliability, latency, and cost.
  3. Diagnose and support Machine Learning and/or Large Language Model deployments, including real-time and batch inference, autoscaling, monitoring, logging, and alerting. Serve as a Subject Matter Expert guiding customers on experiment tracking, model registry, versioning, evaluation, labelling, tracing, and lifecycle observability.
  4. Provide high-quality support by guiding customers in leveraging Databricks AI to solve generative AI use cases & challenges, leveraging LLMs, MCP, AI Agents, RAG/Agentic RAG, APIs, vector embeddings, semantic search, Vector Search/Lakebase databases, context orchestration, memory management, and prompt engineering.
  5. Collaborate with internal teams to influence roadmap, product improvements and support business growth.

Skills

Required

  • Python
  • Scala
  • Java
  • distributed systems
  • Machine Learning
  • generative AI
  • LLM applications
  • agentic frameworks
  • LangChain
  • LangGraph
  • context orchestration
  • prompt design
  • memory management
  • retrieval systems
  • vector embeddings
  • semantic search
  • tool integrations
  • MLOps
  • LLMOps
  • model evaluation
  • model scoring
  • model ranking
  • model optimisation
  • model training
  • model validation
  • model packaging
  • Apache Spark
  • feature engineering
  • ML frameworks
  • model monitoring
  • drift detection
  • retraining strategies
  • deep learning
  • NLP techniques

Nice to have

  • Databricks
  • AWS
  • Azure
  • GCP
  • developing agent skills
  • plugins
  • debugging with native AI capabilities
  • customer-facing experience
  • customer service skills
  • Data Scientist
  • ML Engineer
  • AI Engineer

What the JD emphasized

  • production troubleshooting
  • production workloads
  • MLOps
  • LLMOps
  • agentic frameworks
  • agent skills

Other signals

  • customer support
  • troubleshooting
  • GenAI
  • MLOps
  • LLMOps