Software Engineer, Data Infrastructure

Cohere Cohere · AI Frontier · New York, NY · Agentic Platform

Software Engineer to build and maintain the high-performance data layer for AI training and evaluation workloads, working on petabyte-scale storage infrastructure and distributed data processing.

What you'd actually do

  1. Work directly on petabyte-scale storage infrastructure, and the networking and performance challenges that come with it.
  2. Collaborate daily with researchers and engineers who are some of the best in the world at what they do.

Skills

Required

  • Python
  • Kubernetes
  • distributed data processing frameworks
  • S3
  • GCS
  • POSIX

Nice to have

  • BigQuery
  • Airflow
  • dbt

What the JD emphasized

  • 4+ years of experience working on data storage infrastructure
  • Comfort operating at the edge of what's known, with a desire to build something genuinely new rather than optimize what already exists

Other signals

  • petabyte-scale storage infrastructure
  • high-performance data layer
  • training and evaluation jobs