What you'd actually do

Build real-time AI agent infrastructure: Design and operate the stateful, low-latency runtime that powers voice and chat AI agents — from LLM streaming and conversation state management to graceful recovery and multi-channel support.

Solve distributed systems problems: Own session management across scaled-out workers — including affinity, checkpointing, crash recovery, and consistency under concurrent access.

Build a function execution platform: Own a serverless-style runtime where customers deploy custom logic — build orchestration, container lifecycle, autoscaling, and versioned rollouts.

Own developer experience and test infrastructure: Build CLI tools, local development environments, and test execution frameworks that let engineers iterate quickly and ship with confidence.

Raise the bar on production quality: Drive observability, incident response, and engineering best practices across the team.

Skills

Required

5+ years of software engineering experience, with meaningful time spent on infrastructure, platform, or systems work.
Strong Python and Go
Deep understanding of distributed systems: consistency, fault tolerance, state management, concurrency.
Experience with Kubernetes and cloud-native infrastructure.
Experience building developer-facing tooling — CLIs, SDKs, local dev environments, or internal platforms.
Strong communicator who can drive technical decisions, write clear design docs, and mentor others.
High bar for code quality — thorough testing, thoughtful code review, and sustainable engineering practices.
Comfort operating what you build — on-call, incident response, and production ownership.

Nice to have

Experience with real-time voice or streaming media systems.
Hands-on with LLM integration — streaming inference, prompt orchestration, retrieval-augmented generation.
Experience building serverless or function-as-a-service platforms.
Workflow engines (Temporal, Argo, Airflow) for durable, long-running processes.
Experience in conversational AI or speech domains.
Infrastructure-as-code and GitOps workflows.

Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Our platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices, automate conversations and inefficient processes, and empower every team member to work smarter and faster. Born from the prestigious Stanford AI lab, Cresta's co-founder and chairman isSebastian Thrun, the genius behind Google X, Waymo, Udacity, and more. Our leadership also includes CEO,Ping Wu, the co-founder of Google Contact Center AI and Vertex AI platform,and co-founder, Tim Shi, an early member of Open AI.

Join us on this thrilling journey to revolutionize the workforce with AI. The future of work is here, and it's at Cresta.

About the role

Build real-time AI agent infrastructure: Design and operate the stateful, low-latency runtime that powers voice and chat AI agents — from LLM streaming and conversation state management to graceful recovery and multi-channel support.
Solve distributed systems problems: Own session management across scaled-out workers — including affinity, checkpointing, crash recovery, and consistency under concurrent access.
Build a function execution platform: Own a serverless-style runtime where customers deploy custom logic — build orchestration, container lifecycle, autoscaling, and versioned rollouts.
Own developer experience and test infrastructure: Build CLI tools, local development environments, and test execution frameworks that let engineers iterate quickly and ship with confidence.
Raise the bar on production quality: Drive observability, incident response, and engineering best practices across the team.

What we’re looking for

5+ years of software engineering experience, with meaningful time spent on infrastructure, platform, or systems work.
Strong Python and Go — both are core to this role, not one primary and one secondary.
Deep understanding of distributed systems: consistency, fault tolerance, state management, concurrency.
Experience with Kubernetes and cloud-native infrastructure.
Experience building developer-facing tooling — CLIs, SDKs, local dev environments, or internal platforms.
Strong communicator who can drive technical decisions, write clear design docs, and mentor others.
High bar for code quality — thorough testing, thoughtful code review, and sustainable engineering practices.
Comfort operating what you build — on-call, incident response, and production ownership.
AI-native workflow — you actively use LLMs and AI-assisted tools in your daily development, and can leverage them to move faster and tackle problems that would otherwise be impractical.

Nice-to-haves

Experience with real-time voice or streaming media systems.
Hands-on with LLM integration — streaming inference, prompt orchestration, retrieval-augmented generation.
Experience building serverless or function-as-a-service platforms.
Workflow engines (Temporal, Argo, Airflow) for durable, long-running processes.
Experience in conversational AI or speech domains.
Infrastructure-as-code and GitOps workflows.

Perks & Benefits:

We offer Cresta employees a variety of medical, dental, and vision plans, designed to fit you and your family’s needs
Paid parental leave to support you and your family
Monthly Health & Wellness allowance
Work from home office stipend to help you succeed in a remote environment
Lunch reimbursement for in-office employees
PTO: 3 weeks in Canada

Compensation for this position includes a base salary, equity, and a variety of benefits. Actual base salaries will be based on candidate-specific factors, including experience, skillset, and location, and local minimum pay requirements as applicable. We are actively hiring for this role in the US and Canada. Your recruiter can provide further details.

We have noticed a rise in recruiting impersonations across the industry, where scammers attempt to access candidates' personal and financial information through fake interviews and offers. All Cresta recruiting email communications will always come from the @cresta.ai domain. Any outreach claiming to be from Cresta via other sources should be ignored. If you are uncertain whether you have been contacted by an official Cresta employee, reach out to recruiting@cresta.ai

Join us on this thrilling journey to revolutionize the workforce with AI. The future of work is here, and it's at Cresta.

About the role

Build real-time AI agent infrastructure: Design and operate the stateful, low-latency runtime that powers voice and chat AI agents — from LLM streaming and conversation state management to graceful recovery and multi-channel support.
Solve distributed systems problems: Own session management across scaled-out workers — including affinity, checkpointing, crash recovery, and consistency under concurrent access.
Build a function execution platform: Own a serverless-style runtime where customers deploy custom logic — build orchestration, container lifecycle, autoscaling, and versioned rollouts.
Own developer experience and test infrastructure: Build CLI tools, local development environments, and test execution frameworks that let engineers iterate quickly and ship with confidence.
Raise the bar on production quality: Drive observability, incident response, and engineering best practices across the team.

What we’re looking for

5+ years of software engineering experience, with meaningful time spent on infrastructure, platform, or systems work.
Strong Python and Go — both are core to this role, not one primary and one secondary.
Deep understanding of distributed systems: consistency, fault tolerance, state management, concurrency.
Experience with Kubernetes and cloud-native infrastructure.
Experience building developer-facing tooling — CLIs, SDKs, local dev environments, or internal platforms.
Strong communicator who can drive technical decisions, write clear design docs, and mentor others.
High bar for code quality — thorough testing, thoughtful code review, and sustainable engineering practices.
Comfort operating what you build — on-call, incident response, and production ownership.
AI-native workflow — you actively use LLMs and AI-assisted tools in your daily development, and can leverage them to move faster and tackle problems that would otherwise be impractical.

Nice-to-haves

Experience with real-time voice or streaming media systems.
Hands-on with LLM integration — streaming inference, prompt orchestration, retrieval-augmented generation.
Experience building serverless or function-as-a-service platforms.
Workflow engines (Temporal, Argo, Airflow) for durable, long-running processes.
Experience in conversational AI or speech domains.
Infrastructure-as-code and GitOps workflows.

Perks & Benefits:

We offer Cresta employees a variety of medical, dental, and vision plans, designed to fit you and your family’s needs
Paid parental leave to support you and your family
Monthly Health & Wellness allowance
Work from home office stipend to help you succeed in a remote environment
Lunch reimbursement for in-office employees
PTO: 3 weeks in Canada

Senior Software Engineer, Backend (ai Agent Runtime)

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

About the role

About the role