Senior Core Infrastructure Engineer

Oracle Oracle · Enterprise · HYDERABAD, TELANGANA, India

Senior Core Infrastructure Engineer at Oracle, Hyderabad, India. Focuses on building and improving scalable distributed systems for data processing, ensuring reliability, performance, and security. Responsibilities include coding, rigorous testing, troubleshooting production issues, on-call support, platform security, infrastructure automation, and project management within a cloud environment. The role involves implementing core components of horizontally and vertically scalable distributed systems for an Access Governance product, optimizing code and system performance for large-scale data processing, and leveraging data plane platforms. Collaboration across engineering teams, fault-tolerant component design, recovery-oriented computing principles, and building monitoring/alerting systems are key. The role also includes diagnosing and resolving complex issues, participating in operational support, designing automation scripts, and applying security measures. Project timeline management, cross-team collaboration, and continuous learning are also emphasized.

What you'd actually do

  1. Build and improve scalable distributed systems capable of processing massive amounts of data.
  2. Write efficient code, run rigorous performance tests, and create highly reliable features that stay online during network issues and system updates.
  3. Troubleshoot live production problems, participate in on-call support duties, and set up proactive dashboards and alerts to catch errors early.
  4. Maintain platform security and infrastructure automation while independently managing project timelines, collaborating across teams, and continuously improving engineering processes.
  5. Implement and develop core components of horizontally and vertically scalable distributed systems powering a robust Access Governance product.

Skills

Required

  • distributed systems
  • data processing
  • performance testing
  • reliability engineering
  • scalability
  • fault tolerance
  • incident response
  • infrastructure automation
  • cloud infrastructure
  • security
  • access governance

Nice to have

  • recovery-oriented computing
  • circuit breakers
  • retries
  • timeouts
  • telemetry systems
  • fault-injection
  • brown-outs
  • Infrastructure as Code
  • change management protocols