Member of Technical Staff (software Engineer)

Cerebras Cerebras · Semiconductors · Headquarters +1 · Software

Software Engineer role focused on developing and maintaining high-performance, low-latency inference infrastructure, deploying and optimizing scalable inference services using Kubernetes and Docker, and collaborating with ML engineers to ensure reliable production-ready ML infrastructure.

What you'd actually do

  1. Implement infrastructure to support high-performance, low-latency inference service.
  2. Deploy and configure Kubernetes services to ensure scalability and reliability of inference workloads.
  3. Optimize resource allocation and auto-scaling policies to handle variable inference demand while minimizing operational costs.
  4. Integrate inference services with containerized environments using Docker and Kubernetes for orchestration.
  5. Ensure high availability and fault tolerance by implementing multi-region deployments and disaster recovery strategies.

Skills

Required

  • Docker
  • Kubernetes
  • Java
  • C++
  • ActiveMQ
  • Kafka
  • Python
  • Groovy
  • JavaScript
  • TypeScript
  • Linux
  • SQL
  • OracleDB
  • Redis
  • Git

What the JD emphasized

  • high-performance, low-latency inference infrastructure
  • deploying and optimizing scalable inference services
  • Kubernetes
  • Docker

Other signals

  • Deploy and optimize scalable inference services
  • High-performance, low-latency inference infrastructure
  • Kubernetes and Docker for orchestration