Staff Machine Learning Engineer - Integrations & Solutions Group (au Remote)

Canva Canva · Enterprise · Melbourne, VIC, Australia · Information Technology

Staff Machine Learning Engineer to drive the design, evolution, and architecture of AI integration layers, agent systems, and evaluation frameworks connecting Canva's design capabilities with leading AI assistants. The role involves defining standards for LLM interaction, pioneering agent communication, owning evaluation pipelines, building observability systems, and influencing integration strategy with partners like OpenAI and Google.

What you'd actually do

  1. Drive the design and evolution of AI-ready tools and APIs that enable LLM platforms (ChatGPT, Claude, Gemini and others) to reliably interact with Canva's design capabilities — defining the patterns and standards that other teams adopt for tool descriptions, payload structures, and intent-based interfaces. Pioneer agent-to-agent communication approaches.
  2. Own and evolve evaluation frameworks that systematically measure tool-use accuracy across platforms — defining what "good" looks like for proxy-based fast evals and real-client production evals, and ensuring these frameworks scale as we add platforms and capabilities.
  3. Shape Canva's agent architecture — making strategic technical decisions about where intelligence should live (in external LLMs vs Canva-hosted agents), building the orchestration layers that allow third-party providers to invoke Canva's design tools at scale, and driving automation of complex workflows like marketing campaigns.
  4. Define and build observability systems that give multiple teams visibility into how AI assistants consume Canva's tools in production — identifying failure patterns, setting quality benchmarks, and closing the loop between production data and continuous improvement.
  5. Work across team and platform boundaries — proactively identifying problems not yet defined, understanding behavioural quirks across LLM platforms, and driving solutions that span the AI Integrations, API Capabilities, and Workflow Integrations teams.

Skills

Required

  • Python
  • ML frameworks
  • TypeScript/Node.js
  • Cloud infrastructure (Cloudflare Workers, AWS, or similar)
  • LLM-powered systems production experience
  • Evaluation pipelines
  • Technical standards setting
  • Cross-team and partner-facing work

Nice to have

  • Agent-to-agent communication
  • Orchestration layers

What the JD emphasized

  • quantify your impact
  • improving tool-call accuracy
  • reducing agent error rates
  • cutting latency
  • measurably improving user outcomes
  • find and solve problems others haven't defined yet
  • built or owned evaluation pipelines end-to-end
  • set technical standards that others follow
  • connect external ecosystem changes to internal strategy

Other signals

  • AI integration layer
  • agent architecture
  • evaluation frameworks
  • tool use accuracy
  • LLM platforms