Production Engineer

Meta Meta · Big Tech · New York, NY

Production Engineer role at Meta focused on developing, designing, and testing software services for optimal performance and capacity. Responsibilities include owning back-end data warehouse and front-end services, infrastructure components, writing and reviewing code, developing documentation and capacity plans, and debugging complex software systems in real time. The role also involves serving as an escalation contact for service incidents.

What you'd actually do

  1. Develop, design, create, modify, and/or test software services to ensure optimal performance and capacity for growth.
  2. Own back-end data warehouse services, front-end services like Messenger and Newsfeed, and infrastructure components to ensure services run without incident.
  3. Write and review code, develop documentation and capacity plans, and debug the problems in real time in highly complex software systems.
  4. Serve an escalation contact for service incidents.

Skills

Required

  • Bachelor's degree (or foreign degree equivalent) in Computer Science, Engineering, Information Systems, Analytics, Mathematics, Physics, Applied Sciences, or a related field
  • Completion of a graduate-level course, research project, or internship involving UNIX or Linux operating system fundamentals
  • TCP/IP network fundamentals
  • Coding in at least one of the following higher-level programming languages: PHP, Python, C++, or Java
  • Software frameworks and APIs
  • Performing 'guerilla capacity planning' for internet service architectures
  • Internet service architectures (such as load balancing, LAMP, or CDN’s)
  • Configuring and maintaining applications using at least one of the following: web servers, load balancers, relational databases, storage systems, or messaging systems
  • Relational Databases including MySQL
  • Network protocols including at least one of the following: NFS, DHCP, NTP, SSH, DNS, or SNMP
  • Maintaining web-based applications using at least one of the following: Apache, Memecached, or Squid
  • Storage Systems including NFS
  • Network Management tools like DHCP, NTP, SSH, DNS, or SNMP
  • Diagnosing and troubleshooting issues ranging from low-level hardware issues to large scale failures within datacenter clusters
  • Experience utilizing high performance query engines (Presto or Spark) for big data