Software Engineer, Systems ML

Meta Meta · Big Tech · Sunnyvale, CA +1

Software Engineer focused on AI Infrastructure, optimizing ML systems and hardware acceleration for Meta's products. Responsibilities include system design, data-driven analysis, cross-team collaboration, and mentoring. Requires expertise in ML infrastructure, C++/Python, distributed systems, and ethical AI practices.

What you'd actually do

  1. Apply relevant AI infrastructure and hardware acceleration techniques to build & optimize our intelligent ML systems that improve Meta’s products and experiences
  2. Goal setting related to project impact, AI system design, and infrastructure/developer efficiency
  3. Directly or influencing partners to deliver impact through deep, thorough data-driven analysis
  4. Drive large efforts across multiple teams
  5. Define use cases, and develop methodology & benchmarks to evaluate different approaches

Skills

Required

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Specialized experience in one or more of the following machine learning/deep learning domains: Hardware accelerators architecture, GPU architecture, machine learning compilers, or ML systems, AI infrastructure, high performance computing, performance optimizations, or Machine learning frameworks (e.g. PyTorch), numerics and SW/HW co-design
  • Experience developing AI-System infrastructure or AI algorithms in C/C++ or Python
  • Master/PhD degree in Computer Science, Computer Engineering
  • Technical leadership experience
  • Experience with distributed systems or on-device algorithm development
  • Experience with recommendation and ranking models
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
  • Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews)
  • Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies

Nice to have

  • on-device algorithm development
  • prompt/context engineering

What the JD emphasized

  • AI Infrastructure
  • hardware acceleration
  • ML systems
  • performance optimizations
  • distributed systems
  • recommendation and ranking models
  • ethical AI practices
  • agent orchestration

Other signals

  • AI Infrastructure
  • Hardware acceleration
  • ML systems
  • Performance optimizations
  • Distributed systems
  • Recommendation and ranking models