Lead System Administrator

Braze Braze · Enterprise · Chicago, IL · Engineering

This role is for a Lead System Administrator at Braze, focusing on the reliability, security, and operational excellence of IT services. Responsibilities include technical escalation, incident response, root cause analysis, system improvement through automation, and mentoring. The role requires experience with various SaaS platforms, scripting, automation, and IT operations best practices. It is not directly related to AI/ML development or research.

What you'd actually do

  1. Serve as the primary escalation point for the Service Desk to investigate and resolve complex technical issues
  2. Own the maintenance, configuration, availability, and business continuity of core IT services
  3. Act as Incident Manager or partner closely with Incident Management during service outages, and security incidents, ensuring clear and timely communication to the business
  4. Identify recurring issues, define corrective actions, and implement long-term solutions
  5. Provide advanced support for Google Workspace, including email delivery, permissions, security issues, and service integrations

Skills

Required

  • Supporting SaaS platforms such as Google Workspace, Slack, Okta, Iru, and other enterprise IT services, including API-based administration
  • Scripting and automation using tools such as Bash, Python, and/or Ruby
  • Designing, implementing, and improving IT services
  • IT operations best practices, including security, storage, data protection, and disaster recovery
  • Networking fundamentals, including familiarity with the OSI model
  • Software development lifecycle principles

Nice to have

  • ITIL Foundation (or higher) certification
  • Managing cloud infrastructure in AWS, Azure, or Google Cloud Platform