Principal Software Engineer

Autodesk Autodesk · Enterprise · Singapore

Principal Software Engineer to own architecture and technical design for a large-scale, API-first streaming platform on Apache Flink, Apache Kafka, and AWS. The platform powers near-real-time data ingestion, processing, and delivery for internal and product-facing use cases. Responsibilities include defining end-to-end architecture, designing API-first boundaries, shaping multi-tenant patterns, evolving AWS-native deployment, building platform surfaces, driving developer experience, and engineering for scale, reliability, and cost. The role requires expert-level experience with Kafka and Flink, Java, Spring Boot, and AWS, as well as proven CI/CD and infrastructure management experience.

What you'd actually do

  1. Architect and design the streaming platform
  2. Design API-first boundaries (REST and event-driven patterns as appropriate) so streams, jobs, schemas, and operational workflows are manageable and safe at scale
  3. Shape multi-tenant or multi-team patterns: quotas, isolation, naming, observability, and upgrade strategies for long-running Flink workloads
  4. Evolve AWS-native deployment and operations (networking, IAM, scaling, resilience patterns) aligned with organizational security and compliance requirements
  5. Lead design of Spring Boot-based services that implement the platform control plane, metadata, orchestration hooks, and integration with Kafka/Flink operations

Skills

Required

  • Apache Kafka
  • Apache Flink
  • Java
  • Spring Boot
  • AWS
  • CI/CD
  • Infrastructure as Code
  • distributed systems
  • API design
  • event-driven patterns
  • observability
  • testing
  • container orchestration

Nice to have

  • schema governance
  • serialization formats
  • AWS managed streaming services (MSK, Kinesis)
  • observability stacks tailored to Kafka and Flink
  • technical standards, guilds, or cross-org architecture forums

What the JD emphasized

  • deep hands-on ownership of distributed systems in production
  • Expert-level experience with Apache Kafka (topics, partitioning, consumer groups, reliability, operations) and Apache Flink (state, checkpoints, rescaling, fault tolerance, performance tuning)
  • sustained production ownership, not occasional use