Sr. Software Engineer - Data Infrastructure

CrowdStrike CrowdStrike · Enterprise · New York, NY

Build and manage CrowdStrike's hyper-scale data lake that stops breaches and finds threats. This role involves architecting and operating a big data ecosystem (Kubernetes, Flink, Kafka, Spark, MinIO, HDFS, Trino, Hive, Pinot, Ray) across multiple cloud providers and on-premises data centers, and leading development of state-of-the-art Flink/Spark/Ray platforms on Kubernetes. The primary focus is on data infrastructure to enable analytics, machine learning, and threat hunting at petabyte scale.

What you'd actually do

  1. Build and manage large-scale distributed data processing systems Own Flink/Spark infrastructure on Kubernetes/AWS/GPS/OCI
  2. Ensure availability, reliability, and security Solve complex distributed systems problems at scale
  3. Design and own APIs, automation, and cluster health dashboards
  4. Deploy DevOps tools: Kubernetes, Terraform, Chef, Ansible, CI/CD Enable engineers and data scientists to run jobs and extract insights
  5. Work with query systems (Trino/Presto, Jupyter Notebooks).

Skills

Required

  • Spark
  • Flink
  • Kubernetes
  • Kafka
  • AWS
  • Terraform
  • S3/MinIO
  • Java
  • Kotlin
  • Scala
  • Go
  • Distributed systems
  • Docker

Nice to have

  • open source committer
  • real-time data streaming
  • event-driven architectures
  • data governance
  • compliance
  • security best practices
  • Contributions to open-source projects
  • cybersecurity
  • high-stakes production environments
  • site reliability engineering
  • DevOps practices
  • Advanced degree in Computer Science

What the JD emphasized

  • Expert-level Spark and/or Flink with Kubernetes
  • Strong Kubernetes/Docker experience
  • Deep understanding of distributed systems (Iceberg, Pinot, Hive, Kafka)
  • Proficient in Java, Kotlin, Scala or Go
  • Experience with Kafka, AWS, Terraform, S3/MinIO or similar storage systems
  • Proven success with distributed teams
  • Strong communication skills.