Software Engineer Iii, Infrastructure, Chronicle Soar

Google Google · Big Tech · Ramat Gan, Israel

Software Engineer III, Infrastructure, Chronicle SOAR at Google. This role focuses on enhancing the reliability, performance, and security of Chronicle SOAR by building, automating, and maintaining systems and tools that operate at Google scale. Responsibilities include designing and building software for reliability and performance, creating automation for operational workflows, developing infrastructure configurations in Go, implementing monitoring/logging/tracing, and analyzing incidents to build preventative software solutions.

What you'd actually do

  1. Design, build, and maintain software and systems that enhance the reliability, availability, and performance of Chronicle SOAR.
  2. Create and improve software tools, platforms, and services that automate complex operational workflows, reduce manual toil, and improve the efficiency of managing Chronicle SOAR at scale.
  3. Develop and manage infrastructure configurations and policies using Go to ensure secure, consistent, and auditable management of GCP resources for Chronicle SOAR.
  4. Design and implement sophisticated monitoring, logging, and tracing solutions.
  5. Analyze past incidents and proactively develop software solutions to prevent recurrence. Build automation to accelerate incident detection, diagnosis and resolution, and contribute to blameless post-mortems to identify and implement systemic improvements.

Skills

Required

  • software development
  • developing large-scale infrastructure
  • distributed systems
  • networks
  • compute technologies
  • storage
  • hardware architecture
  • Go
  • Python
  • Java
  • C++
  • C#

Nice to have

  • designing, analyzing, and troubleshooting large-scale distributed systems
  • container orchestration technologies (e.g., Kubernetes, GKE)
  • infrastructure automation
  • configuration management tools (e.g., Terraform, Ansible)
  • incident management
  • on-call rotations
  • security principles and practices
  • access management
  • compliance
  • monitoring, logging, and alerting best practices and tools