Lead Product Engineer – (agentic AI for Observability) - Remote

Allstate Allstate · Insurance · IL · Remote

Lead Product Engineer for an AI-powered Observability platform, focusing on agentic AI for anomaly detection, root cause analysis, and automated remediation in hybrid/multi-cloud environments. The role involves designing and building scalable platforms, intelligent automation, and developer-centric tooling using modern software engineering practices and languages like Java, Python, and Node.js.

What you'd actually do

  1. Design and build agentic AI solutions for observability, including autonomous agents capable of anomaly detection, root cause analysis, and automated remediation.
  2. Embed AI/ML capabilities into observability platforms to enable predictive insights, anomaly detection, and intelligent alerting.
  3. Develop AIOps frameworks that reduce noise, automate workflows, and improve incident response times.
  4. Leverage LLMs and AI orchestration frameworks to create self-service diagnostics and operational assistants.
  5. Architect and implement end-to-end observability solutions across logs, metrics, traces, and events.

Skills

Required

  • 5+ years of engineering experience building software, platforms, or automation solutions, with a strong emphasis on observability and distributed systems.
  • 4+ years of hands-on experience with observability platforms (Datadog, Dynatrace, OTEL).
  • 3+ years of software development experience using Java (Spring Boot), Python, Node.js, or React.
  • 2+ years of hands-on experience building agentic AI systems (autonomous agents, AI workflows, or AI-native applications).
  • Proven ability to design and implement AI-driven automation and AIOps solutions.
  • Experience integrating LLMs, orchestration frameworks, or AI pipelines into production systems.
  • Strong experience with Kubernetes and cloud-native architectures.
  • Hands-on experience in hybrid environments (on-prem + cloud) across Linux and Windows.
  • Expertise in API development, event-driven systems, and microservices architecture.
  • Familiarity with CI/CD pipelines, infrastructure-as-code, and DevOps practices.
  • Strong problem-solving skills with a focus on automation and innovation.
  • Excellent communication skills to articulate complex technical solutions.

Nice to have

  • Passion for emerging technologies, particularly AI in observability and operations.

What the JD emphasized

  • agentic AI
  • observability
  • AI-driven automation
  • AIOps
  • LLMs
  • autonomous agents
  • hybrid environments

Other signals

  • agentic AI solutions
  • autonomous agents
  • AI orchestration frameworks
  • AIOps frameworks
  • LLMs