Staff Software Engineer, Backend

Cresta Cresta · Vertical AI · AB, Canada, Canada · Remote · Engineering

Staff Software Engineer, Backend role at Cresta, focusing on designing and evolving backend systems for an AI platform that powers conversational AI agents. The role involves architecting scalable, distributed systems, collaborating with ML engineers to operationalize AI capabilities, and building runtime infrastructure and orchestration systems for AI agents. It emphasizes leadership, scalable systems design, AI/ML collaboration, data solutions, production excellence, and technical mentorship.

What you'd actually do

  1. Lead the architecture and evolution of large-scale distributed backend systems, driving cross-team technical initiatives from design through production
  2. Build high-performance, fault-tolerant backend services supporting real-time AI agents, conversation intelligence, and enterprise integrations
  3. Partner with ML engineers to operationalize AI capabilities, building the runtime infrastructure and orchestration systems that power AI agents at scale
  4. Design and optimize data storage and retrieval systems for real-time analytics and insights
  5. Drive continuous improvements in system performance, cost efficiency, and scalability to handle high-volume production workloads

Skills

Required

  • 8+ years building scalable backend systems in production environments
  • Proven experience leading large-scale distributed system architecture with deep understanding of consistency, fault tolerance, state management, and concurrency
  • Strong proficiency in at least one major programming language (Python, Go, Java, C++, or similar)
  • Strong API design expertise (REST, gRPC) and service-oriented architectures
  • Experience with database design and optimization across SQL and NoSQL systems
  • Hands-on experience with containerization and orchestration (Kubernetes, Docker)
  • Cloud platform expertise (AWS, GCP, or Azure) with strong security knowledge
  • Experience building developer-facing tooling (CLIs, SDKs, testing frameworks)
  • Strong expertise in system reliability, observability, monitoring, and production ownership (including on-call)
  • Excellent communication skills for technical decision-making, cross-functional collaboration, and mentorship
  • High standards for code quality, testing, and engineering best practices

Nice to have

  • Effective use of AI-assisted coding tools (Claude Code, OpenAI Codex, Cursor, etc.) to significantly boost development productivity
  • Deep understanding of real-time communication systems, low-latency design, and high-availability architectures
  • Proven track record with third-party enterprise platform integrations
  • Experience with Temporal workflow orchestration platform
  • Hands-on experience with Postgres, ClickHouse, and Elasticsearch
  • Experience with Contact Center as a Service (CCaaS) platforms or contact center systems
  • Familiarity with AI Agents, LLM-based systems, or AI orchestration platforms
  • Real-time voice or streaming media systems background
  • LLM integration experience (streaming inference, prompt orchestration, RAG)
  • Infrastructure-as-code and GitOps workflows

What the JD emphasized

  • backend systems
  • AI platform
  • AI agents
  • runtime infrastructure
  • orchestration systems
  • large-scale distributed backend systems
  • real-time AI agents
  • AI/ML Collaboration
  • AI Agents
  • LLM-based systems
  • AI orchestration platforms
  • LLM integration experience

Other signals

  • AI platform
  • AI agents
  • conversation intelligence
  • backend systems
  • runtime infrastructure
  • orchestration systems