Database Reliability Engineer (remote, Aus)

CrowdStrike CrowdStrike · Enterprise · Melbourne, Australia, Australia +3 · Remote

CrowdStrike is seeking an experienced Database Reliability Engineer to manage and automate large-scale cloud-based critical systems, including Cassandra, ElasticSearch/OpenSearch, Kafka, Redis, Valkey, MySQL, and PostgreSQL, within Kubernetes environments. The role involves designing Kubernetes-native solutions, developing infrastructure services, troubleshooting production issues, and ensuring the safety, security, and availability of petabytes of data.

What you'd actually do

  1. Maintain a deep understanding of the data components - including Cassandra, ElasticSearch/OpenSearch, Kafka, Zookeeper, MySQL and PostgreSQL, and use that understanding to operate and automate properly configured clusters.
  2. Operate and manage databases in Kubernetes (K8s) environments, including cluster orchestration, container image management, and workload deployment.
  3. Design and implement Kubernetes-native solutions for data services, including StatefulSets, persistent volumes, and resource management for stateful applications.
  4. Develop infrastructure services to support the CrowdStrike engineering team's pursuit of a full devops model.
  5. Work closely with Engineering and Customer Support to troubleshoot time-sensitive production issues, regardless of when they happen.

Skills

Required

  • Configuration management (Chef)
  • Scripting in Python and bash
  • Experience with large scale datastores using technologies like Cassandra, ElasticSearch, Kafka, Zookeeper, and MySQL.
  • Hands-on experience operating and managing databases in Kubernetes environments, including container orchestration, StatefulSets, persistent storage, and resource optimization.
  • Proficiency with Kubernetes tools and concepts: kubectl, Helm, YAML manifests, networking, storage classes, and cluster administration.
  • Experience with large-scale, business-critical Linux environments
  • Experience operating within the cloud, preferably Amazon Web Services, GCP and OCI
  • Proven ability to work effectively with both local and remote teams
  • Track record of making great decisions, particularly when it matters most
  • Excellent communication skills, both verbal and written
  • A combination of confidence and independence with the prudence to know when to ask for help from the rest of the team
  • Bachelor's degree in an applicable field, such as CS, CIS or Engineering

What the JD emphasized

  • large scale datastores using technologies like Cassandra, ElasticSearch, Kafka, Zookeeper, and MySQL
  • operating and managing databases in Kubernetes environments
  • large-scale, business-critical Linux environments
  • Track record of making great decisions, particularly when it matters most