What you'd actually do

Build & Scale Data Pipelines: Design, develop, and maintain robust batch and real-time data pipelines (using Python, Go, Spark, Kafka) that ingest and transform massive multi-modal data—text, audio, and video—to train and run AI models.

Power Intelligent Features: Collaborate with ML engineers to implement data structures and APIs for new, exciting features like PPT-to-video automation and interactive AI avatars that require low-latency data fetching.

Data Lakehouse Infrastructure: Architect and manage data lakehouse solutions (e.g., Snowflake, Databricks, Apache Iceberg) to store and query unstructured media data efficiently, enhancing storage and computation efficiency.

Data Reliability & Observability: Implement data quality checks, data contracts, and monitoring to ensure high reliability of data, preventing downtime in production video generation.

Productize Data: Transform raw data into structured, actionable data products that can be easily consumed by front-end applications, API endpoints, and AI agents.

About HeyGen

At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention. But the ability to create such content, in particular videos, continues to be costly and challenging to scale. Our ambition is to build technology that equips more people with the power to reach, captivate, and inspire audiences. Learn more at www.heygen.com. Visit our Mission and Culture doc here.

Position Summary

A Software Engineer with data engineering responsibilities to bridge the gap between core application development and large-scale data infrastructure. You will help build the data foundational layers for our next-generation features. This role is not just about moving data—it’s about enabling AI models to function in real-time, building robust pipelines for multimedia, and powering engaging user experiences. This team is currently working on cutting-edge features including PPT-to-video converters and interactive, conversational video capabilities.

Core Responsibilities

Build & Scale Data Pipelines: Design, develop, and maintain robust batch and real-time data pipelines (using Python, Go, Spark, Kafka) that ingest and transform massive multi-modal data—text, audio, and video—to train and run AI models.
Power Intelligent Features: Collaborate with ML engineers to implement data structures and APIs for new, exciting features like PPT-to-video automation and interactive AI avatars that require low-latency data fetching.
Data Lakehouse Infrastructure: Architect and manage data lakehouse solutions (e.g., Snowflake, Databricks, Apache Iceberg) to store and query unstructured media data efficiently, enhancing storage and computation efficiency.
Data Reliability & Observability: Implement data quality checks, data contracts, and monitoring to ensure high reliability of data, preventing downtime in production video generation.
Productize Data: Transform raw data into structured, actionable data products that can be easily consumed by front-end applications, API endpoints, and AI agents.

Qualifications

Bachelor’s/Master’s degree in Computer Science, Engineering, or a related field.
3-5+ years of experience as a Backend Software Engineer with heavy data processing responsibilities.
Strong proficiency in Python (for ETL/scripting) and SQL (for data modeling).
Experience with cloud platforms (AWS/GCP) and data technologies like Kafka, Spark, and Snowflake/Databricks.
Experience or interest in Computer Vision/Generative AI data processing.
Proactive, "owner" mindset; ability to operate in a fast-paced, startup environment.

What HeyGen Offers

Competitive salary and benefits package.
Dynamic and inclusive work environment focused on innovation and creativity.
Opportunities for professional growth and skill development.
Collaborative culture that values teamwork and employee input.
Access to state-of-the-art technologies and tools.

Salary Range $180,000 – $220,000 + equity + benefits Please note that the salary information is a general guideline only. HeyGen considers factors such as scope and responsibilities of the position, candidate's work experience, education/training, key skills, and internal equity, as well as location, market and business considerations when extending an offer. As part of our total rewards package, HeyGen offers comprehensive benefits including equity, a 401k plan, health benefits, generous PTO, a parental leave program and emotional health resources.

HeyGen is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Join us at HeyGen and be part of a team that's reshaping the world of video creation through innovative technology!

About HeyGen

Position Summary

Core Responsibilities

Data Reliability & Observability: Implement data quality checks, data contracts, and monitoring to ensure high reliability of data, preventing downtime in production video generation.

Productize Data: Transform raw data into structured, actionable data products that can be easily consumed by front-end applications, API endpoints, and AI agents.

Qualifications

Bachelor’s/Master’s degree in Computer Science, Engineering, or a related field.

3-5+ years of experience as a Backend Software Engineer with heavy data processing responsibilities.

Strong proficiency in Python (for ETL/scripting) and SQL (for data modeling).

Experience with cloud platforms (AWS/GCP) and data technologies like Kafka, Spark, and Snowflake/Databricks.

Experience or interest in Computer Vision/Generative AI data processing.

Proactive, "owner" mindset; ability to operate in a fast-paced, startup environment.

What HeyGen Offers

Competitive salary and benefits package.

Dynamic and inclusive work environment focused on innovation and creativity.

Opportunities for professional growth and skill development.

Collaborative culture that values teamwork and employee input.

Access to state-of-the-art technologies and tools.

HeyGen is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Join us at HeyGen and be part of a team that's reshaping the world of video creation through innovative technology!

Software Engineer, Data

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

About HeyGen

About HeyGen