Principal Platform Software Engineer

Oracle Oracle · Enterprise · BENGALURU, KARNATAKA, India

Seeking a Principal Engineer to design and build Agentic AI capabilities for Oracle Cloud Infrastructure's Developer Tools. This role involves creating AI agents to assist developers with code, debugging, automation, and reasoning over cloud systems, utilizing LLMs, RAG, tool use, and orchestration. The position requires strong distributed systems experience, hands-on LLM application development, and leadership in designing safe and reliable AI execution patterns, including evaluation frameworks.

What you'd actually do

  1. Design, build, and operate Agentic AI-powered developer tools for the OCI Developer Tools organization.
  2. Develop AI agents that assist with code authoring, debugging, test generation, build failure analysis, deployment guidance, infrastructure automation, cloud diagnostics, and developer workflow optimization.
  3. Build systems that combine LLMs, retrieval-augmented generation, tool calling, workflow orchestration, code intelligence, structured outputs, and OCI service APIs.
  4. Create agent workflows that can reason across source code, SDKs, APIs, CLI commands, documentation, build logs, telemetry, repositories, deployment artifacts, and cloud resource metadata.
  5. Design safe and reliable agent execution patterns, including human-in-the-loop approval, guardrails, access control, audit logging, tool-use constraints, error recovery, and policy-aware automation.

Skills

Required

  • Bachelor's or Master’s degree in Computer Science, Computer Engineering, Artificial Intelligence, Machine Learning, or a related technical field, with 10+ years experience.
  • Strong professional experience designing and building large-scale distributed systems, developer platforms, cloud services, or enterprise software products.
  • Hands-on experience building applications using large language models, including prompt design, structured outputs, function calling, tool use, retrieval-augmented generation, or AI workflow orchestration.
  • Practical understanding of Agentic AI patterns, including planning, reasoning loops, task decomposition, tool invocation, memory, context management, agent state, and autonomous or semi-autonomous execution.
  • Strong programming experience in one or more languages such as Java, Python, Go, or similar.
  • Experience building developer-facing tools such as CLIs, SDKs, APIs, IDE extensions, build systems, CI/CD platforms, testing frameworks, observability tools, infrastructure-as-code tooling, or cloud development platforms.
  • Strong understanding of modern software development workflows, including source control, code review, testing, build automation, deployment pipelines, release management, and production operations.
  • Experience with cloud-native architecture, including microservices, APIs, containers, distributed systems, asynchronous workflows, authentication, authorization, and service observability.
  • Familiarity with AI/ML infrastructure components such as embedding models, vector databases, model serving, model evaluation, telemetry, and experimentation frameworks.
  • Ability to reason about risks in AI-powered developer tools, including incorrect code generation, hallucinated APIs, prompt injection, unsafe tool execution, data leakage, permission misuse, and unreliable automation.
  • Demonstrated ability to lead complex technical projects independently, influence architecture across teams, and deliver high-quality production systems.
  • Strong written and verbal communication skills, with the ability to explain complex technical decisions to engineering, product, and leadership audiences.

Nice to have

  • Experience building AI coding assistants, developer copilots, autonomous debugging agents, test generation systems, build failure analyzers, cloud troubleshooting agents, or AI-powered DevOps tools.
  • Experience with agent frameworks or orchestration technologies such as LangChain, LangGraph, CrewAI, or custom agent runtimes.
  • Experience with commercial or open-source LLM

What the JD emphasized

  • Agentic AI capabilities
  • design and build
  • large-scale distributed systems
  • developer platforms
  • cloud services
  • enterprise software products
  • large language models
  • prompt design
  • structured outputs
  • function calling
  • tool use
  • retrieval-augmented generation
  • AI workflow orchestration
  • Agentic AI patterns
  • planning
  • reasoning loops
  • task decomposition
  • tool invocation
  • memory
  • context management
  • agent state
  • autonomous or semi-autonomous execution
  • developer-facing tools
  • source control
  • code review
  • testing
  • build automation
  • deployment pipelines
  • release management
  • production operations
  • cloud-native architecture
  • microservices
  • APIs
  • containers
  • distributed systems
  • asynchronous workflows
  • authentication
  • authorization
  • service observability
  • embedding models
  • vector databases
  • model serving
  • model evaluation
  • telemetry
  • experimentation frameworks
  • risks in AI-powered developer tools
  • incorrect code generation
  • hallucinated APIs
  • prompt injection
  • unsafe tool execution
  • data leakage
  • permission misuse
  • unreliable automation
  • lead complex technical projects independently
  • influence architecture across teams
  • deliver high-quality production systems
  • AI coding assistants
  • developer copilots
  • autonomous debugging agents
  • test generation systems
  • build failure analyzers
  • cloud troubleshooting agents
  • AI-powered DevOps tools
  • agent frameworks
  • orchestration technologies

Other signals

  • Agentic AI capabilities
  • LLMs
  • RAG
  • tool use
  • workflow orchestration
  • enterprise-grade safety controls
  • evaluation frameworks