Senior / Staff Software Engineer, Data Infrastructure

Suno Suno · Multimodal · Boston, MA · Engineering

Suno is seeking a Senior/Staff Software Engineer to build and scale their data infrastructure. This role involves designing and operating pipelines for ingesting, processing, and serving large volumes of data, supporting analytics, experimentation, and machine learning. The engineer will focus on reliability, scalability, and observability of the data platform, collaborating with various teams to ensure data quality and timely delivery for product development and improvement.

What you'd actually do

  1. Design, build, and maintain scalable data ingestion pipelines using AWS services such as Kinesis, Firehose, Lambda, S3, and Glue.
  2. Own the architecture and reliability of core data infrastructure, including both streaming and batch pipelines.
  3. Build and maintain data workflows and orchestration using Dagster, ensuring pipelines are modular, testable, and observable.
  4. Partner with backend and product engineers to define event schemas, data contracts, and ingestion patterns from production systems.
  5. Improve the reliability and observability of data pipelines through monitoring, alerting, lineage tracking, and automated recovery mechanisms.

Skills

Required

  • Python
  • AWS Kinesis
  • AWS Firehose
  • AWS Lambda
  • AWS Glue
  • Snowflake
  • Dagster
  • dbt
  • Terraform
  • Data Warehousing
  • Data Modeling
  • Pipeline Design
  • Distributed Systems
  • Streaming Data Processing
  • Batch Data Processing
  • Observability
  • Monitoring
  • Alerting
  • Logging
  • Tracing

Nice to have

  • Airflow

What the JD emphasized

  • 5-7+ years of experience as a Software Engineer, Data Engineer, or in a similar infrastructure-focused role
  • Strong programming skills (Python preferred) with experience building production-grade distributed systems
  • Deep experience building and operating data pipelines, including both real-time (streaming) and batch processing systems
  • Hands-on experience with AWS data services, including Kinesis (Streams), Firehose, Lambda, and Glue
  • Strong experience with Snowflake, including data modeling, performance optimization, and large-scale data processing
  • Experience with orchestration frameworks such as Dagster (or similar tools like Airflow), with an emphasis on building maintainable, observable workflows
  • Proficiency with dbt for managing transformations, testing, and modular data modeling
  • Experience designing systems for reliability and observability (monitoring, logging, alerting, tracing)
  • Familiarity with infrastructure as code (e.g., Terraform) and managing cloud resources programmatically
  • Strong understanding of data warehousing, data modeling, and pipeline design best practices
  • Experience collaborating cross-functionally with data, ML, and product engineering teams
  • Ability to operate in ambiguous environments and make pragmatic, high-quality technical decisions
  • A strong sense of ownership and a track record of delivering scalable infrastructure systems