Sr Staff Software Engineer - (platform Engineering/java/temporal/sre/open Source) - *hybrid*

GEICO GEICO · Insurance · Renton, WA +1

This role is for a Sr Staff Software Engineer focused on platform engineering, modernizing systems, and building highly scalable, available, and reliable platforms with a strong emphasis on DevOps, hybrid cloud, and incident management. The role involves driving technical roadmaps, leading design and architecture, championing best practices, and mentoring other engineers. While the company is GEICO, an insurance company that may use AI, this role's core responsibilities are in traditional software and platform engineering, not AI/ML model development or research.

What you'd actually do

  1. Develop and drive the overall technical roadmap for the Digital Personal Auto Sales, aligning it with the organization's business goals and objectives.
  2. Work closely with executive leadership, tech teams, and other cross-discipline stakeholders to build optimal strategy for delivering platform services.
  3. Leverage technical and domain expertise to influence partners and leadership to create a force multiplier in achieving milestones in the team’s technical roadmap.
  4. Provide thought leadership in GIA Platform, staying ahead of industry trends and emerging technologies to create effective strategy that minimizes business disruption while balancing the modernization of legacy platform components.
  5. Lead the design and architecture of resilient and scalable platform services, considering both on-premises and cloud-based solutions.

Skills

Required

  • Fluency in at least one modern language (Go is preferred, Java is a plus)
  • Proven track record of designing, implementing, and maintaining highly scalable, available and reliable system in production
  • Understanding of security best practices and data encryption technology
  • Understanding of SQL and NoSQL databases, including stateful services management and storage
  • Understanding of networking, caches, key/value stores, load balancing, global load balancing, queues, DNS and CDN
  • Deep knowledge of DevOps practices, methodologies, and principles, along with a solid understanding of on prem and public cloud-based network, compute, and storage technologies
  • In-depth knowledge of hybrid cloud architecture, IaaS and PaaS technologies, container orchestration platforms (e.g., Kubernetes), cloud efficiency and observability etc.
  • Strong background in incident management
  • Ability to create incident response playbooks, runbooks, incident triaging strategies, and post-incident analysis to drive continuous improvement in system reliability and availability
  • Experience with open-source management and monitoring tools
  • Experience with infrastructure automation, tooling, and configuration management frameworks
  • Hands on experience with Kubernetes container management system
  • Familiarity with cloud security best practices and compliance standards
  • Excellent leadership skills with a passion for mentoring and fostering professional growth
  • Strong problem-solving and analytical abilities, with a keen eye for detail and a passion for driving operational excellence
  • Visionary thinker with the ability to anticipate future challenges and opportunities
  • Exceptional leadership and communication skills
  • Strong analytical and problem-solving capabilities
  • Proven track record of successfully leading and building software in large and complex organizations

Nice to have

  • Go
  • Java

What the JD emphasized

  • high-performance, low maintenance, zero-downtime platforms
  • highly scalable, available, and reliable systems
  • effective observability
  • resilient and scalable platform services
  • safe deployment processes
  • continuous, incremental delivery
  • incident response plans
  • operational excellence
  • highly scalable, available and reliable system in production
  • incident management
  • system reliability and availability
  • infrastructure automation