Principal Engineer - Quality Engineering

Eli Lilly Eli Lilly · Pharma · Hyderabad, India

Principal Engineer - Quality Engineering role focused on designing and building evaluation frameworks for AI platforms (LLM chat/voice bots) and robust test automation for API, web, and mobile applications. Requires hands-on Python, Java, JavaScript/TypeScript for test infrastructure, CI/CD integration, and advanced AI testing methodologies. Also involves performance testing and cloud-based execution.

What you'd actually do

  1. Designs and builds end-to-end evaluation frameworks for AI platforms including LLM-based chat bots and voice bots , covering accuracy, relevance, hallucination detection, latency, and response quality.
  2. Implements AI model evaluation pipelines using RAGAS, DeepEval, and LangChain to benchmark and validate LLM outputs against ground truth.
  3. Architects and builds robust, scalable, and maintainable test automation frameworks for API (REST/SOAP), web, and mobile applications using pytest, Selenium, WebdriverIO, and Appium.
  4. Develops test strategies for conversational AI — validating intent recognition, slot filling, dialogue flow, fallback handling, and multi-turn context retention.
  5. Builds voice bot quality validation covering speech recognition accuracy, TTS quality, call flow logic, and DTMF handling.

Skills

Required

  • Python
  • Java
  • JavaScript/TypeScript
  • pytest
  • Selenium
  • WebdriverIO
  • Appium
  • API testing
  • web testing
  • mobile testing
  • CI/CD integration
  • performance testing
  • cloud-based test execution
  • AWS
  • Kubernetes
  • Docker
  • RAGAS
  • DeepEval
  • LangChain

Nice to have

  • Go
  • Rust
  • GitHub Actions
  • Jenkins
  • Bamboo
  • JMeter

What the JD emphasized

  • deep expertise in building evaluation frameworks for AI platforms
  • AI quality validation
  • framework architecture
  • end-to-end evaluation frameworks for AI platforms
  • AI model evaluation pipelines
  • conversational AI
  • voice bot quality validation

Other signals

  • AI platform evaluation frameworks
  • LLM testing
  • conversational AI testing
  • voice bot quality validation
  • test automation frameworks