Senior Site Reliability Engineer

Caterpillar · Industrial · Kosice, Slovakia

Senior Site Reliability Engineer responsible for ensuring the reliability, availability, and performance of Cat Digital Mission Critical web-based systems and infrastructure. This role involves delivering technical roadmaps, analyzing business requirements, conducting risk assessments, and communicating client feedback. Requires experience in SRE, DevOps, QA, AWS, IaC, CI/CD, and programming languages like Python or JavaScript. Experience with monitoring, alerting, and QA testing tools is a plus.

What you'd actually do

  1. Deliver technical roadmaps on business processes and applications to support business objectives, standards and processes.
  2. Analyze business requirements, current technical gaps needed to support business strategies and expectations.
  3. Play a key role in conducting risk assessments, root cause analysis, corrective actions, quality assurance processes, and routine issue resolution.
  4. Communicate client feedback to technology teams in order to improve deliverables and meet business requirements.

Skills

Required

  • site reliability engineering
  • DevOps
  • QA
  • AWS infrastructure and services
  • IaC solutions like CloudFormation and Terraform
  • CI/CD solutions - Github, Azure DevOps
  • Python
  • JavaScript
  • containerization technologies, such as Docker and Kubernetes

Nice to have

  • monitoring and alerting solutions such as Thousand Eyes, Grafana, AppDynamics, Datadog, New Relic, Dynatrace
  • QA testing principles and tools - SonarQube, Firebase, JMeter, K6, Selenium, Playwright, Lighthouse
  • ITIL and/or ITSM process
  • node.js, next.js and similar headless services
  • tracing for modern applications – opentelemetry and similar
  • AWS certification associate or higher level