Production Engineer

Meta Meta · Big Tech · Bellevue, WA +2

Production Engineers (PEs) at Meta are specialized software engineers who develop the underlying infrastructure for all of Meta's products and services, forming the backbone of every major engineering effort that keeps our platforms running smoothly and scaling efficiently. PEs work across Meta’s product and infrastructure teams to ensure our services are reliable, performant, and capable of supporting billions of users. This means writing high‑quality code, solving complex problems in live production, and tackling challenges that impact over 2 billion people worldwide. The role involves owning back-end services, leading engineering teams, writing and reviewing code, debugging complex systems, and participating in on-call rotations. Qualifications include extensive experience in *nix, network fundamentals, coding in standard languages, configuring infrastructure applications, and knowledge of web technologies. A demonstrated ability to integrate AI tools to optimize workflows and drive impact, along with experience in responsible AI practices and ongoing AI skill development, is also required.

What you'd actually do

  1. Own back-end services which handle fleet management, front-end services such as WhatsApp / Instagram / Facebook / Meta Ads, infrastructure components that drive Meta’s advances in AI, core services which are used by every team at Meta, the world’s largest MySQL deployments, networking systems and everything in between
  2. Lead your engineering team by example, mentor and help others around you grow, be a force multiplier of impact
  3. Write and review code, develop documentation and capacity plans, and debug the hardest problems, live, on some of the largest and most complex systems in the world
  4. Together with your engineering team, you will share an on-call rotation and be an escalation contact for service incidents
  5. Partner alongside the best engineers in the industry working on the coolest stuff around, the code and systems you work on will be in production and used by billions of people all around the world

Skills

Required

  • software
  • infrastructure
  • backend services
  • fleet management
  • front-end services
  • AI infrastructure
  • core services
  • MySQL
  • networking systems
  • code quality
  • complex problem solving
  • production systems
  • mentoring
  • documentation
  • capacity planning
  • debugging
  • on-call rotation
  • service incidents
  • nix (Linux or UNIX-like OS)
  • Network fundamentals
  • Java
  • Python
  • C++
  • PHP/Hack
  • Rust
  • Go
  • Kubernetes
  • Terraform
  • MySQL
  • web technologies
  • Internet service architectures
  • CDN
  • Load Balancing
  • capacity planning
  • urgent capacity augmentation
  • AI tools integration
  • workflow optimization
  • ethical AI practices
  • risk assessment
  • bias mitigation
  • quality and accuracy reviews
  • prompt engineering
  • context engineering
  • agent orchestration
  • emerging AI technologies

Nice to have

  • BS or MS in Computer Science

What the JD emphasized

  • 10+ years of experience in *nix (Linux or another UNIX-like OS) and Network fundamentals
  • 10+ years of coding experience in an industry-standard language (e.g. Java, Python, C++, PHP/Hack, Rust, Go)
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
  • Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews)
  • Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies