Senior Engineering Consultant-cloud & AI

Verizon Verizon · Telecom · Hyderabad, India +2

Senior AI Engineer role focused on designing, developing, and operationalizing production-grade Generative AI agents and applications for Verizon's vRAN automation platform. The role involves building LLM-powered pipelines, integrating AI agents with data sources and APIs, optimizing data ingestion, and ensuring deployability, observability, and maintainability within CI/CD frameworks. Key responsibilities include instrumenting and monitoring AI agent performance, maintaining documentation, and staying updated with the LLM ecosystem.

What you'd actually do

  1. Designing, developing, and deploying production-grade Gen AI agents and applications, with a focus on reliability, low latency, and real-world operability.
  2. Building and maintaining LangGraph agents and custom Python orchestration logic to power GenAI pipelines — enabling low-latency inference, context-aware decision-making, and multi-step agentic workflows.
  3. Integrate AI agents with internal data sources, Postgres databases, and REST API endpoints to give agents the context they need to act intelligently.
  4. Designing and optimizing data ingestion and preprocessing pipelines in Python to support LLM inference and grounding workflows (RAG, tool use, structured outputs).
  5. Collaborating with DevOps engineers to ensure AI agents and pipelines are deployable, observable, and maintainable within existing CI/CD and infrastructure frameworks.

Skills

Required

  • Python
  • LangChain
  • LangGraph
  • LlamaIndex
  • prompt engineering
  • RAG
  • tool/function calling
  • context windows
  • structured outputs
  • agent memory patterns
  • Postgres
  • REST APIs
  • Git
  • code review
  • testing
  • documentation

Nice to have

  • Claude Code
  • OpenAI Assistants
  • MLOps
  • model versioning
  • pipeline monitoring
  • experiment tracking
  • production observability for AI systems
  • Ansible
  • Jenkins
  • GitLab CI
  • Linux server experience
  • Shell scripting
  • Docker
  • Kubernetes
  • telecommunications
  • network operations
  • infrastructure automation
  • anomaly detection
  • log analysis
  • failure prediction
  • vector databases
  • pgvector
  • Pinecone
  • Weaviate
  • streaming architectures
  • event-driven architectures
  • Kafka
  • Redis

What the JD emphasized

  • production-grade Gen AI agents and applications
  • low latency
  • production environment
  • LLM-powered agents or applications
  • production-grade, testable code
  • LangGraph agents
  • agent orchestration logic
  • low-latency inference
  • multi-step agentic workflows
  • data ingestion and preprocessing pipelines
  • LLM inference
  • RAG
  • tool use
  • structured outputs
  • AI agents and pipelines are deployable, observable, and maintainable
  • instrument and monitor AI agent performance
  • latency, reliability, failure rates, and accuracy
  • agent architecture designs
  • integration specs
  • prompt strategies
  • operational runbooks
  • LLM and agent tooling ecosystem

Other signals

  • Generative AI agents
  • LLM-powered pipelines
  • production-grade AI solutions
  • low-latency inference
  • agent orchestration