Business Support Engineer

Meta Meta · Big Tech · Menlo Park, CA

Meta is seeking a Business Support Engineer to support partners in integrating and utilizing AI-driven business solutions, specifically focusing on Llama and other LLMs. The role involves providing engineering support, troubleshooting distributed systems, building and optimizing AI solutions, developing monitoring systems, and collaborating with cross-functional teams. The engineer will also be responsible for creating documentation and coaching peers on AI/ML expertise.

What you'd actually do

  1. Provide proactive and reactive engineering support for partners, independently managing complex outages to ensure high partner satisfaction
  2. Troubleshoot large-scale distributed systems and partner integrations, maintaining high code quality and operational standards
  3. Leverage AI tools to accelerate troubleshooting, automate repetitive tasks, and scale your impact with an 'AI native' mindset
  4. Build, launch, and optimize AI solutions using Llama and other LLMs, owning the full lifecycle from prototype to production
  5. Develop performance monitoring systems for partner integrations to ensure high availability; leverage metrics to proactively identify issues and drive improvements across teams

Skills

Required

  • Software Engineering or Site Reliability Engineering
  • API development on cloud-based infrastructures
  • Debugging and root cause analysis
  • Full web stack, REST APIs, Python, PHP/Hack, and JavaScript/React development
  • Fine-tuning and optimizations of PyTorch models
  • Experience with at least one LLM (LLaMA, GPT, Claude, Falcon, etc.)
  • Communicating with technical and business audiences
  • Writing technical documentation
  • Assessing, analyzing, and resolving operational issues using data analysis (SQL)
  • Integrating AI tools to optimize/redesign workflows
  • Open Source cloud stacks (Kubernetes, Kubeflow, Docker containers)
  • Responsible, ethical AI practices (risk assessment, bias mitigation, quality and accuracy reviews)
  • AI skill development (prompt/context engineering, agent orchestration)
  • Partner-facing or customer-centric engineering roles
  • Building and deploying solutions on cloud platforms (AWS, GCP, Azure)
  • Working with large language models and AI agents
  • Cross-cultural engineering environments
  • Data transformation, model selection/training/optimization, and deployment at scale

Nice to have

  • distributed systems
  • API troubleshooting
  • improving the end-to-end support experience
  • AI-driven business solutions
  • track industry advancements and partner experiences
  • evaluating their impact and influencing the product's strategic roadmap
  • performance monitoring systems
  • proactively identify issues and drive improvements
  • 24/7 oncall support coverage
  • Platform and Infrastructure teams collaboration
  • Create clear documentation, specs, guides, and presentations
  • scaling the team's knowledge internally and externally
  • Drive end-to-end execution
  • sound judgment to manage stakeholder expectations
  • ensuring clear alignment
  • Build recognized AI/ML expertise
  • actively coach and mentor peers on technical troubleshooting and project execution

What the JD emphasized

  • demonstrated experience in distributed systems and API troubleshooting
  • owning the full lifecycle from prototype to production
  • Experience with the full web stack, REST APIs, Python, PHP/Hack, and JavaScript/React development, along with debugging and bug management
  • Knowledge on fine-tuning and optimizations of PyTorch models and with at least one LLM such as LLaMA, GPT, Claude, Falcon, etc
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact
  • Experience adhering to and implementing responsible, ethical AI practices
  • Demonstrated ongoing AI skill development and staying current with emerging AI technologies
  • Experience with data transformation, model selection/training/optimization, and deployment at scale

Other signals

  • Build, launch, and optimize AI solutions using Llama and other LLMs
  • Leverage AI tools to accelerate troubleshooting, automate repetitive tasks, and scale your impact
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact