Performance & Capacity Engineer - Planning Optimization

Meta Meta · Big Tech · Bellevue, WA +2

Meta is looking for a Performance & Capacity Engineer to optimize site-wide performance and capacity across all Meta products and physical infrastructure. This role involves building and optimizing capacity planning processes and tools, working cross-functionally with engineering and business teams, and informing strategic decisions at the executive level. The engineer will design and build software-driven solutions, connect business strategy with technical execution, and partner with various teams including Finance and Production Engineering. Experience with AI tools for workflow optimization and responsible AI practices is required.

What you'd actually do

  1. Own both technical as well as business outcomes for capacity planning for all of Meta: all software products/services and plans for how to scale server and data center resources most efficiently
  2. Build automated, scalable data and analytics solutions by developing advanced automation, mathematical optimization, and/or AI models
  3. Use the tools you build to own the business outcomes: develop and analyze variety of business and technical scenarios to inform executive decision-making around infrastructure/product, up to the CxO level
  4. Design and help build software systems to build scalable, reliable planning systems to connect business strategy with detailed technical execution including regional and temporal bin-packing, optimal service placement, traffic shifts and service migrations, efficient hardware refresh, etc
  5. Partner across the engineering technical landscape to optimize at the intersection of hardware, infrastructure, and software.

Skills

Required

  • 8+ years of experience in any coding language and designing software systems, OR 4+ years experience with a PhD
  • 8+ years of experience in capacity, performance, software, or reliability engineering, OR 4+ years experience with a PhD
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
  • Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews)
  • Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies
  • Practical experience and demonstrated success in Capacity Planning for a major private or public cloud
  • Experience with planning for large-scale technical infrastructure and distributed systems
  • Experience working with variety of technical and business teams
  • Experience with mathematical optimization
  • Experience and interest in building "Zero to One" - building systems and process from scratch with ambiguous requirements and goals
  • MS or PhD in Computer Science, Operations Research, or other technical field

Nice to have

  • AI models

What the JD emphasized

  • AI tools to optimize/redesign workflows
  • responsible, ethical AI practices
  • AI skill development
  • Capacity Planning for a major private or public cloud
  • building systems and process from scratch with ambiguous requirements and goals