Software Engineer- Resiliency

Cloudflare Cloudflare · Enterprise · India · Remote · Infrastructure

Software Engineer focused on Resiliency within Cloudflare's Infrastructure Team. The role involves developing and maintaining systems for managing Cloudflare's infrastructure at scale, expanding automation for server provisioning, expansion, repair, and decommissioning. The engineer will integrate AI tools like LLMs and predictive analytics into their workflow for tasks such as synthesizing partner intelligence, monitoring regulatory shifts, and automating regional reporting, aiming to reduce manual effort and support company growth.

What you'd actually do

  1. develop and maintain the systems that manage Cloudflare’s infrastructure at scale
  2. expanding and evolving the suite of automations that allow our Infrastructure Operations partners to provision, expand, repair, and decommission our rapidly growing fleet of servers
  3. integrate AI into your workflow—utilizing LLMs and predictive analytics to synthesize partner intelligence, monitor regulatory shifts, and automate regional reporting
  4. collaborate with the team to understand business needs and develop technical solutions
  5. work closely with internal customers to understand their requirements

Skills

Required

  • 5+ years of hands-on software development experience on meaningfully complex systems
  • Excellent proficiency in one of Python, Go, or Rust, with deep understanding of the language and its best practices
  • Experience working with Linux systems
  • Basic networking knowledge, like VLAN, Routing, DNS, etc
  • Experience building both backend systems and frontend widgets
  • Ability to contribute to planning, development, and execution to meet commitments and deliver with predictability
  • Experience implementing tools, processes, internal instrumentation, and methodologies
  • Comfortable working on projects with tight deadlines and short release cycles
  • Excellent verbal and written English language skills

Nice to have

  • Experience with DCIM, CMDB, IPAM, and other Data Center and Asset Lifecycle Management tools

What the JD emphasized

  • AI-native curiosity to create a solution using the latest tools
  • AI is a tool for strategic mastery
  • integrate AI into your workflow