Software Developer 5

Oracle Oracle · Enterprise · Nashville, TN +1

Software Developer 5 role at Oracle focused on building and operating state-of-the-art observability platforms for Oracle Cloud Infrastructure (OCI). The role involves designing, developing, and optimizing large-scale distributed systems for telemetry ingestion, processing, storage, and querying, serving both internal OCI services and external customers. Key responsibilities include leading architectural decisions, driving technical strategy, mentoring engineers, and ensuring reliability, scalability, and cost-efficiency of observability solutions.

What you'd actually do

  1. Lead the design, development, and operation of cloud-scale observability platforms supporting metrics, logs, traces, and related telemetry data.
  2. Architect and implement highly scalable, resilient, and cost-efficient telemetry collection, ingestion, processing, storage, and query systems.
  3. Drive the evolution of end-to-end observability pipelines, from instrumentation and data collection through real-time analytics and long-term retention.
  4. Design and optimize distributed systems capable of ingesting and processing massive volumes of telemetry data with stringent latency and availability requirements.
  5. Develop scalable storage and indexing solutions for high-cardinality metrics, large-scale log analytics, and distributed tracing workloads.

Skills

Required

  • Java
  • Go
  • C
  • C++
  • Python
  • Cloud scale products and services
  • Mutli-tenant services
  • Concurrent Programming
  • Open source technologies for development and management
  • Cloud technologies
  • Full product/service development and operations lifecycle
  • Strong communication and analytical skills
  • Able to adapt to fast changing requirements

Nice to have

  • Observability Solutions (metrics, logs, traces)
  • terraform

What the JD emphasized

  • cloud-scale observability platforms
  • massive scale
  • distributed systems
  • high-throughput telemetry ingestion
  • large-scale data processing
  • cost-efficient storage
  • low-latency query execution
  • multi-tenant reliability
  • operational excellence
  • cloud-native observability platforms
  • highly scalable, resilient, and cost-efficient telemetry collection, ingestion, processing, storage, and query systems
  • massive volumes of telemetry data with stringent latency and availability requirements
  • scalable storage and indexing solutions for high-cardinality metrics, large-scale log analytics, and distributed tracing workloads
  • Build and enhance query, search, and retrieval services that deliver fast, reliable, and intuitive access to observability data.
  • Identify and resolve performance bottlenecks across the observability stack, including ingestion, storage, indexing, aggregation, and query execution.
  • Design systems with a strong focus on reliability, fault tolerance, scalability, security, and operational excellence.
  • Drive technical strategy and architectural decisions for observability services operating at hyperscale cloud environments.