Software Engineer: Distributed Systems (infrastructure)

Cloudflare Cloudflare · Enterprise · Austin, TX, Washington, DC · Engineering

Software Engineer role focused on building a 'Maintenance Coordination System' powered by an infrastructure dependency graph to enable intelligent, autonomous systems for network orchestration. This is a foundational step towards next-generation data-driven infrastructure, involving complex, globally distributed systems.

What you'd actually do

  1. part of a Resiliency Organization responsible for the core services that power Cloudflare’s global operations
  2. join the Infrastructure Intelligence team and shape the transition toward model-driven network orchestration
  3. building a cutting-edge 'Maintenance Coordination System', powered by an infrastructure dependency graph of one of the world's largest physical networks
  4. foundational step towards designing intelligent, autonomous systems that will transform the orchestration of Cloudflare’s network
  5. creating the robust primitives for global coordination today, you will be enabling the next generation of data-driven infrastructure at Cloudflare

Skills

Required

  • Programming experience in Go, or similar languages
  • Experience in designing and implementing secure and highly-available distributed systems
  • Experience (and love) for debugging to ensure the system works in all cases
  • Experience with a continuous integration workflow and using source control (we use git)
  • Experience with continuous delivery and deployment of a k8s hosted application
  • Understanding of security issues and responsibilities
  • Experience with monitoring, alerting and debugging high volume production systems
  • Fluent in analyses of data sets such as logs
  • Strong English language oral and written communications skills
  • Designing and building APIs
  • At least 4 years of hands-on software development experience on meaningfully complex systems
  • An understanding of the systems architecture required to scale machine learning model-driven decision engines in a production environment

Nice to have

  • Experience with the Cloudflare development stack is a plus
  • Experience with graph theory and building services for graph generation, storage and retrieval
  • Experience building both backend systems and frontend widgets
  • Ability to contribute to planning, development, and execution to meet commitments and deliver with predictability
  • Experience implementing tools, processes, internal instrumentation, and methodologies
  • Comfortable working on projects with tight deadlines and short release cycles
  • Experience with DCIM, CMDB, IPAM, and other Data Center and Asset Lifecycle Management tools is a plus
  • Experience with data ingestion and analysis - pulling metrics from hundreds of edge data centers

What the JD emphasized

  • model-driven network orchestration
  • intelligent, autonomous systems
  • data-driven infrastructure

Other signals

  • model-driven network orchestration
  • intelligent, autonomous systems
  • data-driven infrastructure