Business Support Engineer - Meta Business Agents

This role focuses on building, launching, and optimizing AI solutions, specifically using LLMs and AI agents, for business partners. It involves end-to-end ownership from prototype to production, including fine-tuning and optimization of models, and integrating AI tools to improve workflows. The role also requires supporting partners with distributed systems and API troubleshooting, and contributing to the product's strategic roadmap.

What you'd actually do

Provide proactive and reactive engineering support for partners, independently managing complex outages to ensure high partner satisfaction
Troubleshoot large-scale distributed systems and partner integrations, championing operational excellence and engineering craftsmanship
Leverage AI tools to accelerate troubleshooting, automate repetitive tasks, and scale your impact with an 'AI native' mindset
Build, launch, and optimize AI solutions using Llama and other LLMs, owning the full lifecycle from prototype to production
Develop performance monitoring systems for partner integrations to ensure high availability; leverage metrics to proactively identify issues and drive improvements across teams

Skills

Required

Software engineering or Site Reliability Engineering background
Experience in API development on cloud-based infrastructures, being able to debug, identify root causes and resolve independently outages that impact Meta Partners
Experience with the full web stack, REST APIs, Python, PHP/Hack, and JavaScript/React development, along with debugging and bug management
Knowledge on fine-tuning and optimizations of PyTorch models and with at least one LLM, such as LLaMA, GPT, Claude, or Falcon, etc
Experience in communicating with technical and business audiences and writing technical documentation
Experience in assessing, analyzing, and resolving operational issues using data analysis (SQL)
Fluency in English is required as this role will work closely with internal stakeholders and external customers whose usual business language is English
Experience in building and deploying solutions on cloud platforms (e.g., AWS, GCP, Azure)
Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews)
Experience with Open Source cloud stacks like Kubernetes, Kubeflow, Docker containers
Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies
Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
Hands-on experience working with large language models and AI agents
Experience collaborating with engineering teams and stakeholders across multiple regions and time zones
Experience transforming data, model selection/training/optimization, and deployment at scale
Experience in partner-facing or customer-centric engineering roles

Nice to have

Provide 24/7 oncall support coverage via rotation schedule, including weekends
Collaborate with Platform and Infrastructure teams to investigate issues, align on fixes, and drive continuous product improvement
Create clear documentation, specs, guides, and presentations to communicate complex AI concepts to diverse audiences, scaling the team's knowledge internally and externally
Drive end-to-end execution, using sound judgment to manage stakeholder expectations and ensuring clear alignment.
Develop and share AI/ML knowledge, actively coach and mentor peers on technical troubleshooting and project execution

What the JD emphasized

Experience in partner-facing or customer-centric engineering roles
Experience with the full web stack, REST APIs, Python, PHP/Hack, and JavaScript/React development, along with debugging and bug management
Experience in assessing, analyzing, and resolving operational issues using data analysis (SQL)
Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies
Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements)
Hands-on experience working with large language models and AI agents
Experience transforming data, model selection/training/optimization, and deployment at scale

Other signals

Build, launch, and optimize AI solutions using Llama and other LLMs, owning the full lifecycle from prototype to production
Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration)
Hands-on experience working with large language models and AI agents
Experience transforming data, model selection/training/optimization, and deployment at scale

Read full job description

Meta recently launched its Business Agent, helping businesses of every size use AI to boost productivity and deliver more personalized customer experiences. Business Support Engineering will be at the forefront of this shift, and we’re looking for an engineer to play a pivotal role supporting Meta’s partners bringing demonstrated experience in distributed systems and API troubleshooting and a focus on improving the end-to-end support experience. As a Business Support Engineer, you will work closely with cross-functional teams and business partners across the globe, incorporating AI-driven business solutions into their service offerings. You will track industry advancements and partner experiences, evaluating their impact and influencing the product's strategic roadmap.

Responsibilities

Provide proactive and reactive engineering support for partners, independently managing complex outages to ensure high partner satisfaction Troubleshoot large-scale distributed systems and partner integrations, championing operational excellence and engineering craftsmanship Leverage AI tools to accelerate troubleshooting, automate repetitive tasks, and scale your impact with an 'AI native' mindset Build, launch, and optimize AI solutions using Llama and other LLMs, owning the full lifecycle from prototype to production Develop performance monitoring systems for partner integrations to ensure high availability; leverage metrics to proactively identify issues and drive improvements across teams Provide 24/7 oncall support coverage via rotation schedule, including weekends Collaborate with Platform and Infrastructure teams to investigate issues, align on fixes, and drive continuous product improvement Create clear documentation, specs, guides, and presentations to communicate complex AI concepts to diverse audiences, scaling the team's knowledge internally and externally Drive end-to-end execution, using sound judgment to manage stakeholder expectations and ensuring clear alignment. Develop and share AI/ML knowledge, actively coach and mentor peers on technical troubleshooting and project execution

Qualifications

Software engineering or Site Reliability Engineering background Experience in API development on cloud-based infrastructures, being able to debug, identify root causes and resolve independently outages that impact Meta Partners Experience with the full web stack, REST APIs, Python, PHP/Hack, and JavaScript/React development, along with debugging and bug management Knowledge on fine-tuning and optimizations of PyTorch models and with at least one LLM, such as LLaMA, GPT, Claude, or Falcon, etc Experience in communicating with technical and business audiences and writing technical documentation Experience in assessing, analyzing, and resolving operational issues using data analysis (SQL) Fluency in English is required as this role will work closely with internal stakeholders and external customers whose usual business language is English Experience in building and deploying solutions on cloud platforms (e.g., AWS, GCP, Azure) Experience adhering to and implementing responsible, ethical AI practices (e.g., risk assessment, bias mitigation, quality and accuracy reviews) Experience with Open Source cloud stacks like Kubernetes, Kubeflow, Docker containers Demonstrated ongoing AI skill development (e.g., prompt/context engineering, agent orchestration) and staying current with emerging AI technologies Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements) Hands-on experience working with large language models and AI agents Experience collaborating with engineering teams and stakeholders across multiple regions and time zones Experience transforming data, model selection/training/optimization, and deployment at scale Experience in partner-facing or customer-centric engineering roles