Software Engineer, Devinfra

Mixpanel Mixpanel · Data AI · San Francisco, CA · Engineering

Software Engineer, DevInfra at Mixpanel, focusing on building and supporting the software development lifecycle. This role involves deploying AI tools for engineers, building agents for tasks like on-call response and bug fixing, and managing core infrastructure on Google Cloud Platform (GCP) and Kubernetes.

What you'd actually do

  1. We partner with a wide variety of teams to build out their software development lifecycle to be optimized for speed, safety, and reliability. We support teams writing front-end UI code as well as teams maintaining our highly stateful storage systems deep in our stack.
  2. We are responsible for deploying AI tools like Claude Code to our entire engineering team. We are building agents into our platform to do things like augment on-call response and automatically one-shot bugs that come through a team’s triage queue.
  3. We support systems that ingest more than 1 Trillion user-generated events every month while balancing low end-to-end query latency. Mixpanel queries typically scan more than 1 Quadrillion events over the span of a month. More details in [this blog post](https://engineering.mixpanel.com/under-the-hood-of-mixpanels-infrastructure-0c7682125e9b?source=collection_home---4------3-----------------------).
  4. Mixpanel runs entirely on Google Cloud Platform and Google Kubernetes Engine. DevInfra is the overall admin for our cloud environment, so we wear many hats. We are responsible for things like Terraform, cost management, networking, and security best practices.
  5. We are the overall service owner for Kubernetes. Individual teams are responsible for maintaining and monitoring their own clusters and workloads. We are responsible for setting standards for deployment, observability, and developer experience. We facilitate efforts like Kubernetes version upgrades and helping teams adopt new Google Kubernetes Engine features.

Skills

Required

  • 3+ years of industry experience
  • Experience with at least one of: Go, Python, or JavaScript/TypeScript
  • Production experience working with Kubernetes
  • Production experience managing infrastructure in a major cloud provider such as Google Cloud Platform, Amazon Web Services, or Azure
  • Knowledgeable about coding agents like Claude Code

Nice to have

  • Experience with observability solutions like OpenTelemetry, Prometheus, or Distributed Tracing for application monitoring and performance analysis
  • Experience with service mesh
  • You write excellent docs
  • Proficient with GitHub Actions and the GitHub ecosystem
  • Experience building deployment pipelines
  • Bazel expertise
  • Experience with Terraform
  • Experience with cloud Identity and Access Management (IAM) systems and knowledgeable about security best practices
  • Experience implementing Internal Developer Platforms like Backstage
  • Working knowledge of site reliability engineering (SRE) principles such as implementing Service Level Objectives (SLOs)
  • You ❤️ open source

What the JD emphasized

  • solving tooling pain points
  • automating away toil
  • supporting their fellow engineers
  • good taste and leverages AI to great effect without generating slop
  • Frequent forays into uncharted territory requires our team to be highly adaptable and always ready to learn
  • Knowledgeable about coding agents like Claude Code

Other signals

  • AI tools like Claude Code
  • building agents into our platform
  • augment on-call response
  • automatically one-shot bugs