What you'd actually do

Design and develop software features that support system resiliency and high availability, including automated recovery mechanisms and fault-tolerant architecture across distributed environments.

Develop and maintain cloud-based deployment workflows for AI inference software using AWS tools and services to support low-latency and scalable system performance.

Develop Python-based scripts and APIs to streamline data preprocessing, inference execution, and post-processing for real-time inference tasks.

Develop inference software in Docker containers and define Kubernetes orchestration strategies that ensure software reliability and efficient scaling.

Triage and resolve defects in the software service by analyzing logs, metrics, and distributed traces using tools like AWS CloudWatch, Grafana, or custom Python scripts.

Skills

Required

Terraform
AWS CloudFormation
AWS CDK
Ansible
Docker
Kubernetes
AWS EKS
AWS Elastic Container Service (ECS)
AWS Fargate
Helm
AWS EC2
AWS Lambda functions
Auto Scaling Groups
AWS CloudWatch
AWS X-Ray
ELK (Elasticsearch, Logstash, Kibana)
Prometheus
Grafana
Python
Node.js
JavaScript
Flask
PostgreSQL
Redis
NFS
Jenkins
Git

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.

Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.

Cerebras Systems Inc. has multiple openings for Sr. Member of Technical Staff

Title: Sr. Member of Technical Staff

Job Duties:

Design and develop software features that support system resiliency and high availability, including automated recovery mechanisms and fault-tolerant architecture across distributed environments.
Develop and maintain cloud-based deployment workflows for AI inference software using AWS tools and services to support low-latency and scalable system performance.
Develop Python-based scripts and APIs to streamline data preprocessing, inference execution, and post-processing for real-time inference tasks.
Use parallel programming techniques (e.g., multi-threading, asynchronous processing) to maximize resource efficiency on AWS compute instances.
Develop software components to support visualization and analysis of system performance metrics, enhancing the monitoring and usability of inference services. ⠀
Develop inference software in Docker containers and define Kubernetes orchestration strategies that ensure software reliability and efficient scaling.
Develop automated scripts to detect and mitigate common failure modes, improving software system reliability.
Debug issues related to model deployment, container orchestration, networking configurations, documenting steps to reproduce and root-cause defects.
Triage and resolve defects in the software service by analyzing logs, metrics, and distributed traces using tools like AWS CloudWatch, Grafana, or custom Python scripts.
Work with product management and user experience teams to define requirements for inference service interfaces, including configuration, monitoring, and event logging.
Author detailed technical documentation for infrastructure configurations, inference workflows, and APIs, ensuring clarity for internal teams and external customers.
Document and track defects, enhancements, and release notes using tools like Jira and Git, ensuring version control and traceability.

Minimum Requirements:

Master’s degree or foreign equivalent degree in Computer Science, or a related field and 18 months of experience as Information Security Analyst, Software Engineer, Sr. Member of Technical Staff, IT Senior Applications Engineer, or a related occupation required.

The required experience must include 18 months of experience with the following:

**Infrastructure-as-Code and deployment automation:**Terraform, AWS CloudFormation, AWS CDK, and Ansible;
**Containerization and orchestration:**Docker, Kubernetes, AWS EKS, AWS Elastic Container Service (ECS), AWS Fargate, and Helm;
Compute and serverless services: AWS EC2, AWS Lambda functions, and Auto Scaling Groups;
Monitoring, logging, and distributed tracing: AWS CloudWatch, AWS X-Ray, ELK (Elasticsearch, Logstash, Kibana), Prometheus, and Grafana;
Programming languages and frameworks: Python, Node.js, JavaScript, and Flask;
Data storage and caching: PostgreSQL, Redis, and NFS; and
CI/CD and version control: Jenkins and Git

**Additional Information: **

Employer’s name: Cerebras Systems Inc.

Job site : 1237 E Arques Avenue, Sunnyvale, CA 94085

Telecommuting permitted

Salary Range: $230,000.00 per year to $250,000.00 per year

If you are interested in applying for this position, please apply online on this web page or mail resume to HR at Cerebras Systems Inc., 1237 E Arques Avenue, Sunnyvale, CA 94085. Please reference Job # 142 on resume or cover letter.

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

Build a breakthrough AI platform beyond the constraints of the GPU.
Publish and open source their cutting-edge AI research.
Work on one of the fastest AI supercomputers in the world.
Enjoy job stability with startup vitality.
Our simple, non-corporate work culture that respects individual beliefs.

Read our blog: Five Reasons to Join Cerebras in 2026.

Apply today and become part of the forefront of groundbreaking advancements in AI!

_Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. __We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. _We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Cerebras Systems Inc. has multiple openings for Sr. Member of Technical Staff

Title: Sr. Member of Technical Staff

Job Duties:

Design and develop software features that support system resiliency and high availability, including automated recovery mechanisms and fault-tolerant architecture across distributed environments.
Develop and maintain cloud-based deployment workflows for AI inference software using AWS tools and services to support low-latency and scalable system performance.
Develop Python-based scripts and APIs to streamline data preprocessing, inference execution, and post-processing for real-time inference tasks.
Use parallel programming techniques (e.g., multi-threading, asynchronous processing) to maximize resource efficiency on AWS compute instances.
Develop software components to support visualization and analysis of system performance metrics, enhancing the monitoring and usability of inference services. ⠀
Develop inference software in Docker containers and define Kubernetes orchestration strategies that ensure software reliability and efficient scaling.
Develop automated scripts to detect and mitigate common failure modes, improving software system reliability.
Debug issues related to model deployment, container orchestration, networking configurations, documenting steps to reproduce and root-cause defects.
Triage and resolve defects in the software service by analyzing logs, metrics, and distributed traces using tools like AWS CloudWatch, Grafana, or custom Python scripts.
Work with product management and user experience teams to define requirements for inference service interfaces, including configuration, monitoring, and event logging.
Author detailed technical documentation for infrastructure configurations, inference workflows, and APIs, ensuring clarity for internal teams and external customers.
Document and track defects, enhancements, and release notes using tools like Jira and Git, ensuring version control and traceability.

Minimum Requirements:

The required experience must include 18 months of experience with the following:

**Infrastructure-as-Code and deployment automation:**Terraform, AWS CloudFormation, AWS CDK, and Ansible;
**Containerization and orchestration:**Docker, Kubernetes, AWS EKS, AWS Elastic Container Service (ECS), AWS Fargate, and Helm;
Compute and serverless services: AWS EC2, AWS Lambda functions, and Auto Scaling Groups;
Monitoring, logging, and distributed tracing: AWS CloudWatch, AWS X-Ray, ELK (Elasticsearch, Logstash, Kibana), Prometheus, and Grafana;
Programming languages and frameworks: Python, Node.js, JavaScript, and Flask;
Data storage and caching: PostgreSQL, Redis, and NFS; and
CI/CD and version control: Jenkins and Git

**Additional Information: **

Employer’s name: Cerebras Systems Inc.

Job site : 1237 E Arques Avenue, Sunnyvale, CA 94085

Telecommuting permitted

Salary Range: $230,000.00 per year to $250,000.00 per year

Why Join Cerebras

Build a breakthrough AI platform beyond the constraints of the GPU.
Publish and open source their cutting-edge AI research.
Work on one of the fastest AI supercomputers in the world.
Enjoy job stability with startup vitality.
Our simple, non-corporate work culture that respects individual beliefs.

Read our blog: Five Reasons to Join Cerebras in 2026.

Apply today and become part of the forefront of groundbreaking advancements in AI!

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Sr. Member of Technical Staff

What you'd actually do

Skills

Required

What the JD emphasized

Other signals

Why Join Cerebras

Apply today and become part of the forefront of groundbreaking advancements in AI!

Why Join Cerebras

Apply today and become part of the forefront of groundbreaking advancements in AI!