Staff Engineer – Reliability Engineering (hybrid)

GEICO GEICO · Insurance · Bethesda, MD +1

Staff Engineer focused on reliability engineering for large-scale systems, including cloud-native applications, monitoring, and incident management. The role involves coding, system design, and mentoring, with a focus on ensuring the quality, performance, and usability of engineering solutions. Experience with programming languages, container orchestration, cloud platforms, and observability tools is required. Knowledge in ML and AI technologies is a plus.

What you'd actually do

  1. Focus on multiple areas and provide strategic and technical guidance
  2. Utilize programming languages like Go, Python, Java, .Net or other object-oriented languages, SQL, and NoSQL databases.
  3. Work with container orchestration tools such as Docker and Kubernetes (K8S), OpenStack and a variety of Azure tools and services.
  4. Architect and develop cloud-native applications using Azure Services.
  5. Collaborate with product managers, team members, customers, and other engineering teams to solve our toughest problems

Skills

Required

  • Go
  • Python
  • Java
  • .NET
  • SQL
  • NoSQL
  • Docker
  • Kubernetes
  • Azure
  • system design
  • data pipeline
  • cloud platforms
  • distributed systems
  • monitoring
  • logging
  • observability tools
  • Terraform
  • Ansible
  • CI/CD
  • Infrastructure as Code
  • cloud-based deployments
  • software development life cycle
  • problem-solving mindset

Nice to have

  • HTML
  • CSS
  • JavaScript
  • MySQL
  • PostgreSQL
  • ONgDB
  • Cassandra
  • MongoDB
  • Elasticsearch
  • GCP
  • AWS
  • Helm
  • Puppet
  • Spinnaker
  • ML
  • AI technologies