Principal Core Infrastructure Engineer

Oracle · Enterprise · HYDERABAD, TELANGANA, India

Lead the architecture and development of highly scalable distributed systems for an Access Governance product, focusing on performance, fault tolerance, zero-downtime upgrades, system monitoring, resilience testing, and security compliance. This role involves complex problem-solving, automation, and mentoring.

What you'd actually do

Lead the architecture and development of horizontally and vertically scalable distributed systems powering a robust Access Governance product capable of handling hyper-scale data processing.
Optimize code and infrastructure for high-throughput data storage and retrieval.
Build fault-tolerant platforms designed to withstand network disruptions and support zero-downtime maintenance through redundancy, data replication, and automatic failover.
Establish comprehensive key performance indicators and build sophisticated telemetry, dashboards, and proactive alerting mechanisms to continuously monitor system health.
Proactively diagnose and resolve complex production issues while providing expert guidance during on-call incident response and root cause investigations.

Skills

Required

Architecture and development of scalable distributed systems
Optimization of code and infrastructure for high-throughput data storage and retrieval
Building fault-tolerant platforms
Zero-downtime maintenance strategies
Performance and load testing
System monitoring and telemetry
Incident response and root cause analysis
Automation tools and cloud infrastructure scripting
Security measures (encryption, access controls)
Compliance with industry standards and regulations
Project management and delegation
Mentoring junior engineers

Nice to have

Access Governance product development
Elastic computing environments
Fault-injection and brown-out testing
Load-shedding, throttling, rate-limiting
Cross-functional collaboration

What the JD emphasized

highly scalable distributed systems
massive workloads
complex software programs
high-performance platforms
unpredictable network failures
complex engineering challenges
heavy data traffic
seamless, zero-downtime upgrades
deep system monitoring
rigorous resilience testing
hands-on problem-solving
horizontally and vertically scalable distributed systems
hyper-scale data processing
high-throughput data storage and retrieval
elastic computing environments
rigorous performance and load testing
dynamic system demands
fault-tolerant platforms
network disruptions
zero-downtime maintenance
redundancy
data replication
automatic failover
advanced traffic management strategies
load-shedding
throttling
rate-limiting
strict service level objectives
comprehensive key performance indicators
sophisticated telemetry
dashboards
proactive alerting mechanisms
system health
complex testing scenarios
fault-injection
brown-outs
complex production issues
on-call incident response
root cause investigations
automation tools
cloud infrastructure scripts
safe patching
updates
seamless rollbacks
robust security measures
encryption
access controls
multi-tenant environments
strict compliance with industry standards and regulations
complex project timelines
efficiently delegating tasks
prioritizing workloads
multiple engineering initiatives
cross-functional stakeholders
technical solutions
core business objectives
continuous improvement
engineering workflows
advanced problem-solving strategies
elevate overall team capabilities
mentoring junior engineers
sharing industry best practices
actively participating in candidate evaluations
high-performing talent pipeline

Read full job description

Lead the architecture and development of highly scalable distributed systems built for massive workloads. Design, develop, troubleshoot, and debug complex software programs spanning databases, applications, tools, and network infrastructure. Engineer secure, high-performance platforms that remain operational even during unpredictable network failures. Tackle complex engineering challenges from optimizing heavy data traffic to ensuring seamless, zero-downtime upgrades. Drive operational excellence by championing deep system monitoring, rigorous resilience testing, and hands-on problem-solving, while mentoring the engineering team and setting technical standards.

As a member of the software engineering division, lead the development and architecture of horizontally and vertically scalable distributed systems powering a robust Access Governance product capable of handling hyper-scale data processing. Optimize code and infrastructure for high-throughput data storage and retrieval. Design elastic computing environments and execute rigorous performance and load testing to seamlessly meet dynamic system demands.

Build fault-tolerant platforms designed to withstand network disruptions and support zero-downtime maintenance through redundancy, data replication, and automatic failover. Implement advanced traffic management strategies, including load-shedding, throttling, and rate-limiting, to guarantee strict service level objectives. Establish comprehensive key performance indicators and build sophisticated telemetry, dashboards, and proactive alerting mechanisms to continuously monitor system health. Validate system correctness and data integrity through complex testing scenarios, such as fault-injection and brown-outs.

Proactively diagnose and resolve complex production issues while providing expert guidance during on-call incident response and root cause investigations. Develop and maintain automation tools and cloud infrastructure scripts to enable safe patching, updates, and seamless rollbacks. Implement robust security measures, including encryption and access controls, to protect data in multi-tenant environments while ensuring strict compliance with industry standards and regulations.

Manage complex project timelines by efficiently delegating tasks and prioritizing workloads across multiple engineering initiatives. Collaborate seamlessly with cross-functional stakeholders to ensure technical solutions directly align with core business objectives. Drive continuous improvement in engineering workflows and advanced problem-solving strategies. Elevate overall team capabilities by mentoring junior engineers, sharing industry best practices, and actively participating in candidate evaluations to build a high-performing talent pipeline.

Career Level - IC4