Performance & Capacity Engineer - Planning Optimization

Meta Meta · Big Tech · Bellevue, WA +2

Meta is looking for a Performance & Capacity Engineer to optimize site-wide performance and capacity across all Meta products and infrastructure. The role involves building and optimizing capacity planning processes and tools, using AI models and optimization techniques, and partnering with various engineering and business teams to inform strategic decisions and drive efficiency. The role emphasizes software-driven solutions and ROI.

What you'd actually do

  1. Own both technical as well as business outcomes for capacity planning for all of Meta: all software products/services and plans for how to scale server and data center resources most efficiently
  2. Build automated, scalable data and analytics solutions by developing advanced automation, mathematical optimization, and/or AI models
  3. Use the tools you build to own the business outcomes: develop and analyze variety of business and technical scenarios to inform executive decision-making around infrastructure/product, up to the CxO level
  4. Design and help build software systems to build scalable, reliable planning systems to connect business strategy with detailed technical execution including regional and temporal bin-packing, optimal service placement, traffic shifts and service migrations, efficient hardware refresh, etc
  5. Partner across the engineering technical landscape to optimize at the intersection of hardware, infrastructure, and software.

Skills

Required

  • 8+ years experience in any coding language and designing software systems
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 8+ years experience in capacity, performance, software, or reliability engineering
  • Experience navigating ambiguous problem spaces and ramping up on new technical and business domains to deliver results
  • Experience with mathematical optimization
  • Experience with planning for large-scale technical infrastructure and distributed systems
  • Experience working with variety of technical and business teams

Nice to have

  • MS or PhD in Computer Science, Operations Research, or other technical field
  • Practical experience and demonstrated success in Capacity Planning for a major private or public cloud

What the JD emphasized

  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
  • Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews)
  • Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies