Senior Machine Learning Engineer

Cloudflare Cloudflare · Enterprise · Austin, TX · Business Intelligence

Cloudflare is seeking a Senior Machine Learning Engineer to architect and build the next generation of their unified AI/ML platform, powering traditional ML models, generative AI, LLMs, and autonomous agent frameworks. The role involves owning the end-to-end technical strategy, design, and execution of scalable backend services and data pipelines, driving vision from requirements to global deployment and long-term ownership. Responsibilities include architecting a multi-tenant AI/ML platform, designing and implementing AI Agents and Multi-Agent Systems, building high-throughput backends, acting as a technical anchor for Data Science, and evaluating AI infrastructure tools.

What you'd actually do

  1. Architect and evolve a highly scalable, multi-tenant AI/ML platform that seamlessly unifies traditional ML (classification, regression, forecasting) and Generative AI/LLM orchestration.
  2. Design and implement robust production-grade AI Agents and Advanced Chatbots. Build reliable execution environments for Multi-Agent Systems, including state management, long-term memory architectures, and Model Context Protocol (MCP) server integrations.
  3. Build high-throughput, low-latency application backends and orchestration layers. Partner closely with data, platform, and full-stack engineers to ensure seamless feature delivery and reliable production operations.
  4. Act as a technical anchor for the Data Science team – enforcing rigorous engineering standards, leading design and security reviews, evaluating build-vs-buy decisions, and mapping business requirements to robust technical designs.
  5. Evaluate trade-offs and drive adoption of modern AI infrastructure tools, optimized embedding pipelines, vector databases, and serverless compute paradigms (such as Workers AI).

Skills

Required

  • Extensive experience as a Senior or Lead ML Engineer
  • Proven track record of architecting and operating production-grade ML platforms, services and distributed backends
  • Strong competency in Traditional ML lifecycles (feature stores, training pipelines, model monitoring)
  • Deep experience in Generative AI patterns (RAG pipelines, context engineering, fine-tuning, guardrailing, and agentic AI systems)
  • Mastery of Python
  • Robust experience with modern backend ecosystems
  • 3+ years of dedicated ML Engineering experience within a large-scale, enterprise environment
  • Proven ability to architect, scale, and secure reliable, highly observable distributed systems
  • Experience mentoring engineers
  • Leading by example through high-quality code and rigorous design reviews
  • Fostering a culture of technical excellence
  • Strong problem-solving skills
  • Demonstrated ability to independently drive complex projects through ambiguous spaces
  • Collaborate cross-functionally with data engineers, full-stack teams, and analysts
  • Hands-on proficiency in building production-grad

Nice to have

  • Familiarity with (or willingness to collaborate on) full-stack technologies like React and TypeScript
  • willingness to collaborate on full-stack technologies like React and TypeScript

What the JD emphasized

  • architecting and operating production-grade ML platforms, services and distributed backends
  • shaping your own technical roadmap
  • taking extreme ownership of system reliability, costs, and model performance
  • architect, scale, and secure reliable, highly observable distributed systems
  • drive complex projects through ambiguous spaces

Other signals

  • AI/ML platform architecture
  • Generative AI and LLM orchestration
  • Production-grade AI Agents and Advanced Chatbots
  • Multi-Agent Systems
  • Scalable backend services and data pipelines