Senior Manager, Software Engineering (ml Platform)

Affirm Affirm · Fintech · United States · Remote · Checkout

Senior Manager to lead ML Platform engineering at Affirm, focusing on building and operating infrastructure for feature computation, model training, and model serving at scale. The role involves technical strategy, roadmap ownership, team leadership, and staying ahead of AI/ML trends.

What you'd actually do

  1. Own the technical strategy and roadmap for ML Platform, covering real-time and batch feature computation, model training infrastructure, and model serving at scale.
  2. Lead and grow a team of engineering managers, while staying hands-on with the technical direction and maintaining close partnership with ICs.
  3. Continuously evolve the platform to stay ahead of the frontier — anticipating where AI and ML are heading and building the infrastructure that makes those capabilities possible at Affirm before they become urgent needs. This includes large-scale training and serving of transformer-based models, GPU compute, reinforcement learning, and whatever comes next.
  4. Partner with ML modeling, product, and infrastructure leadership to ensure the platform accelerates Affirm's most critical ML initiatives.
  5. Establish engineering excellence across the organization: reliability, observability, developer experience, and operational rigor.

Skills

Required

  • Software Engineering Management
  • ML Platform Infrastructure
  • Distributed Systems
  • Large-scale Training
  • Model Serving
  • Feature Stores
  • Data Pipelines
  • Deep Neural Networks
  • Transformer Architectures
  • Reinforcement Learning
  • GPU Compute
  • Systems Thinking
  • Technical Strategy
  • Roadmap Development
  • Engineering Excellence
  • Reliability
  • Observability
  • Developer Experience
  • Operational Rigor
  • Recruiting
  • Team Leadership

Nice to have

  • Applied ML modeling experience

What the JD emphasized

  • lead our ML Platform engineering organization
  • builds and operates the critical infrastructure
  • technically demanding role
  • stay deeply connected to the engineers building the platform
  • continuously evolve the platform to stay ahead of the frontier
  • large-scale training and serving of transformer-based models
  • accelerates Affirm's most critical ML initiatives
  • 12+ years of industry experience
  • significant hands-on software engineering experience
  • 4+ years managing engineering managers
  • Deep expertise in building and operating large-scale ML infrastructure
  • Strong systems thinking
  • Track record of building platforms that meaningfully accelerate the productivity and impact of ML teams

Other signals

  • ML Platform Engineering
  • Distributed Systems
  • Infrastructure
  • Scale
  • Leadership