Business Support Engineer - Meta Business Agents

Meta Meta · Big Tech · Singapore

This role focuses on building, launching, and optimizing AI solutions, specifically using LLMs and AI agents, for business partners. It involves end-to-end ownership from prototype to production, including fine-tuning and optimization of models, and integrating AI tools to improve workflows. The role also requires supporting partners with distributed systems and API troubleshooting, and contributing to the product's strategic roadmap.

What you'd actually do

  1. Provide proactive and reactive engineering support for partners, independently managing complex outages to ensure high partner satisfaction
  2. Troubleshoot large-scale distributed systems and partner integrations, championing operational excellence and engineering craftsmanship
  3. Leverage AI tools to accelerate troubleshooting, automate repetitive tasks, and scale your impact with an 'AI native' mindset
  4. Build, launch, and optimize AI solutions using Llama and other LLMs, owning the full lifecycle from prototype to production
  5. Develop performance monitoring systems for partner integrations to ensure high availability; leverage metrics to proactively identify issues and drive improvements across teams

Skills

Required

  • Software engineering or Site Reliability Engineering background
  • Experience in API development on cloud-based infrastructures, being able to debug, identify root causes and resolve independently outages that impact Meta Partners
  • Experience with the full web stack, REST APIs, Python, PHP/Hack, and JavaScript/React development, along with debugging and bug management
  • Knowledge on fine-tuning and optimizations of PyTorch models and with at least one LLM, such as LLaMA, GPT, Claude, or Falcon, etc
  • Experience in communicating with technical and business audiences and writing technical documentation
  • Experience in assessing, analyzing, and resolving operational issues using data analysis (SQL)
  • Fluency in English is required as this role will work closely with internal stakeholders and external customers whose usual business language is English
  • Experience in building and deploying solutions on cloud platforms (e.g., AWS, GCP, Azure)
  • Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews)
  • Experience with Open Source cloud stacks like Kubernetes, Kubeflow, Docker containers
  • Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
  • Hands-on experience working with large language models and AI agents
  • Experience collaborating with engineering teams and stakeholders across multiple regions and time zones
  • Experience transforming data, model selection/training/optimization, and deployment at scale
  • Experience in partner-facing or customer-centric engineering roles

Nice to have

  • Provide 24/7 oncall support coverage via rotation schedule, including weekends
  • Collaborate with Platform and Infrastructure teams to investigate issues, align on fixes, and drive continuous product improvement
  • Create clear documentation, specs, guides, and presentations to communicate complex AI concepts to diverse audiences, scaling the team's knowledge internally and externally
  • Drive end-to-end execution, using sound judgment to manage stakeholder expectations and ensuring clear alignment.
  • Develop and share AI/ML knowledge, actively coach and mentor peers on technical troubleshooting and project execution

What the JD emphasized

  • Experience in partner-facing or customer-centric engineering roles
  • Experience with the full web stack, REST APIs, Python, PHP/Hack, and JavaScript/React development, along with debugging and bug management
  • Experience in assessing, analyzing, and resolving operational issues using data analysis (SQL)
  • Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies
  • Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
  • Hands-on experience working with large language models and AI agents
  • Experience transforming data, model selection/training/optimization, and deployment at scale

Other signals

  • Build, launch, and optimize AI solutions using Llama and other LLMs, owning the full lifecycle from prototype to production
  • Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration)
  • Hands-on experience working with large language models and AI agents
  • Experience transforming data, model selection/training/optimization, and deployment at scale