What you'd actually do

Lead the team through architecture, implementation, production launch, and fast iteration.

Stay hands-on: review designs and code, inspect traces, debug production behavior, evaluate prototypes, and help engineers make pragmatic tradeoffs.

Translate Agent OS strategy into concrete platform slices that ship quickly without creating one-off agent implementations.

Define platform contracts for role shells, capabilities, tools, actions, approvals, context, memory, evidence, and evaluation.

Build the distinction between what an agent can do and what it is authorized to do in a given tenant, role, workflow state, and risk context.

Skills

Required

8+ years of software engineering experience
4+ years leading engineering teams or major technical initiatives
Strong technical background as a builder
Recent hands-on technical leadership (reviewed design docs, read implementation details, inspected production traces/logs, or debugged system behavior in the last 6–12 months)
Experience shipping AI, ML, data, platform, infrastructure, workflow, automation, or developer-platform systems in production
Practical understanding of modern LLM application architecture
Strong instincts for production agent safety
Production-minded approach to evaluation
Strong engineering judgment across APIs, distributed systems, event-driven systems, data platforms, observability, reliability, security, and multi-tenant SaaS constraints
Strong data and context instincts
Ability to turn ambiguous strategy into sequenced roadmaps, measurable outcomes, and clear ownership
Clear communication with engineers, product lead

Nice to have

AI
ML
data
platform
infrastructure
workflow
automation
developer-platform systems
model gateways
prompt/context assembly
retrieval
tool calling
structured outputs
memory
agent workflows
human approval patterns
typed tools
scoped permissions
business invariants
precondition checks
approval thresholds
reversible actions
idempotency
audit trails
rollback
scenario design
behavioral evals
regression suites
trace review
simulation
offline/online metrics
monitoring for non-deterministic systems
distributed systems
event-driven systems
data platforms
observability
reliability
security
multi-tenant SaaS constraints
SQL
unstructured data
vector search
metadata
provenance
source authority
freshness
privacy boundaries

Ready to be a Titan?

ServiceTitan is building the Agent OS for the trades: a shared platform that powers role-specific AI experiences across Atlas, field, office, voice, mobile, and future product surfaces.

This is not a collection of chatbots. Agent OS is the runtime, context, memory, action, trust, and evaluation layer that lets AI agents help contractors run their businesses safely, observably, and at enterprise scale.

We are looking for a Senior Engineering Manager to lead a small, hands-on AI platform team building the core Agent OS. This is a builder-manager role. The right person can lead engineers, shape architecture, make high-quality technical decisions, and stay close enough to the work to unblock design, implementation, debugging, evaluation, and production delivery.

This is not a pure people-management, AI strategy, or research leadership role. We need someone who can earn credibility with senior engineers by improving the technical work, not just coordinating it. We do not expect one person to have built every part of an agent platform before. We do expect strong engineering judgment, production scars, hands-on curiosity, and the ability to learn fast while making high-quality technical decisions.

What You’ll Build

You will lead a compact AI platform engineering team responsible for foundational Agent OS capabilities, including:

Agent runtime and workflow execution: role-specific agents, planning, tool use, delegation, pause/resume, long-running workflows, durable checkpoints, and failure recovery.
Context and memory systems: retrieval, tenant-aware memory, transcripts, artifacts, tool results, provenance, freshness, and replayable evidence.
Capability platform: reusable domain capabilities that combine prompts, tools, context requirements, policies, evals, rollout controls, ownership, and rollback expectations.
Action and trust layer: typed action contracts, scoped permissions, business precondition checks, approval flows, reversibility, idempotency, audit trails, and human-in-the-loop controls.
Evaluation and observability harness: offline and online evals, scenario libraries, simulation, trajectory review, regression detection, quality metrics, cost/latency telemetry, and autonomy promotion gates.
ServiceTitan integration: secure access to systems of record, governed data sources, domain context, Atlas integration, and role-specific agent experiences for owners, CSRs, dispatchers, technicians, managers, accountants, and back-office teams.

What You’ll Do

Lead the team through architecture, implementation, production launch, and fast iteration.
Stay hands-on: review designs and code, inspect traces, debug production behavior, evaluate prototypes, and help engineers make pragmatic tradeoffs.
Translate Agent OS strategy into concrete platform slices that ship quickly without creating one-off agent implementations.
Define platform contracts for role shells, capabilities, tools, actions, approvals, context, memory, evidence, and evaluation.
Build the distinction between what an agent can do and what it is authorized to do in a given tenant, role, workflow state, and risk context.
Partner with Product, Design, Architecture, Security, Data Platform, Atlas, and domain engineering teams to create useful, safe, and measurable agent capabilities.
Drive evaluation as part of everyday engineering: scenario design, regression suites, trace review, simulation, production monitoring, quality gates, and rollout criteria.
Help the team make model and inference tradeoffs across latency, cost, quality, structured outputs, caching, fallback behavior, and provider choices.
Ensure live ServiceTitan systems of record remain authoritative while memory, retrieval, transcripts, and agent-generated artifacts are governed as contextual evidence.
Work through real agent failures with the team: wrong tool calls, stale context, missing permissions, unsafe actions, poor retrieval, bad recommendations, latency spikes, and cost regressions.
Hire, coach, and retain strong engineers who can build fast, reason deeply, and operate responsibly in a fast-moving AI environment.

What You’ll Bring

8+ years of software engineering experience, including 4+ years leading engineering teams or major technical initiatives in a product or platform organization.
Strong technical background as a builder. You may not write production code every day, but you can read code, review implementation plans, reason through distributed systems, and debug real behavior.
Recent hands-on technical leadership: you have personally reviewed design docs, read implementation details, inspected production traces/logs, or debugged system behavior in the last 6–12 months.
Experience shipping AI, ML, data, platform, infrastructure, workflow, automation, or developer-platform systems in production.
Practical understanding of modern LLM application architecture: model gateways, prompt/context assembly, retrieval, tool calling, structured outputs, memory, agent workflows, and human approval patterns.
Strong instincts for production agent safety: typed tools, scoped permissions, business invariants, precondition checks, approval thresholds, reversible actions, idempotency, audit trails, and rollback.
Production-minded approach to evaluation: scenario design, behavioral evals, regression suites, trace review, simulation, offline/online metrics, and monitoring for non-deterministic systems.
Strong engineering judgment across APIs, distributed systems, event-driven systems, data platforms, observability, reliability, security, and multi-tenant SaaS constraints.
Strong data and context instincts: SQL, unstructured data, vector search, metadata, provenance, source authority, freshness, and privacy boundaries.
Ability to turn ambiguous strategy into sequenced roadmaps, measurable outcomes, and clear ownership.
Clear communication with engineers, product leaders, architects, security partners, and executives.
Low-ego coaching style. You raise the technical bar while helping the team move faster.

Preferred Experience

Experience building or operating agent runtimes, workflow engines, evaluation platforms, model gateways, ML platforms, developer platforms, or internal control planes.
Experience with approval-gated automation, compliance-sensitive workflows, audit trails, policy engines, or governed writes to systems of record.
Experience integrating AI systems into complex enterprise products where permissions, tenant boundaries, data freshness, customer trust, and reliability are first-order concerns.
Background in SaaS, vertical software, field service, fintech, ERP, CRM, marketplace, operations, or other domains where software decisions affect real-world business outcomes.

What Success Looks Like

Success means the first Agent OS primitives are real, shipped, and usable by product teams.

Role-specific agents can be built on shared runtime, context, memory, trust, action, and evaluation foundations. Domain teams can publish governed capabilities without rebuilding infrastructure. Customers can see why an agent made a recommendation, what evidence it used, what it is allowed to do, and when a human approved it.

The best candidate will not only manage the team building this system. They will help build the system.

**Remote Location: **US and Canada remote candidates will be considered. Candidates based in Pacific Time or able to work significant Pacific Time overlap are highly preferred.

Be Human With Us:

Being human isn’t about checking every box on a list. It’s about the experiences we have, people we meet, and the perspectives we share. So, if you have the skills but are hesitant to apply because of your background, apply anyway. We need amazing people like you to help us challenge the conventional and think differently about the problems that we’re solving. We’re in this together. Come be human, with us.

Use of AI Technology:

We use technology, including automated and AI-assisted tools, to support certain aspects of our recruitment process. These tools are designed to improve efficiency and enhance the candidate experience. AI tools are not used to make hiring decisions; all hiring decisions are made by our hiring teams.

What We Offer: When you join our team, you’re not just accepting a job. You’re making a career move. Here’s how we’ll support you in doing some of the most impactful work of your career:

Flextime, recognition, and support for autonomous work: Flexible time off with ample learning and development opportunities to continue growing your career. We offer a comprehensive onboarding program, leadership training for Titans at all levels, and other programs and events. Great work is rewarded through Bonusly, peer-nominated awards, and more.
Holistic health and wellness benefits: Company-paid medical, dental, and vision (with 100% employer paid options and 90% coverage for dependents), FSA and HSA, 401k match, and telehealth options including memberships to One Medical.
Support for Titans at all stages of life: Parental leave and support, up to $20k in fertility services (i.e. IUI and IVF), surrogacy, and adoption reimbursement, on demand maternity support through Maven Maternity, free breast milk shipping through Maven Milk, pet insurance, legal advisory services, financial planning tools, and more.

At ServiceTitan, we celebrate individuality and uniqueness. We believe that the convergence of fresh perspectives and experiences from all walks of life is what makes our product and culture so great. We strongly encourage people from underrepresented groups to apply. We do not discriminate against employees based on race, color, religion, sex, national origin, gender identity or expression, age, disability, pregnancy (including childbirth, breastfeeding, or related medical condition), genetic information, protected military or veteran status, sexual orientation, or any other characteristic protected by applicable federal, state or local laws.

ServiceTitan is committed to fair and equitable compensation for all of our employees. We thoughtfully consider a wide range of factors when determining individual compensation.The expected salary range for this role for candidates residing in the United States is between $223,600 USD - $299,100 USD. Compensation for candidates residing outside the United States will vary by location and the specific salary range will be discussed during the hiring process. Actual compensation for an individual may vary depending on skills, performance over time, qualifications, experience, and location. In addition to the base salary, the total compensation package also includes an annual bonus, equity and a holistic suite of benefits.