Staff Data Engineer

MongoDB MongoDB · Enterprise · Gurgaon, India · Data & Platform

Staff Data Engineer role focused on building ETL pipelines and an Internal Data Platform, with a specific responsibility to design and build AI agents for task automation. Requires extensive data engineering experience and thorough AI knowledge, particularly in codegen tools and agentic frameworks.

What you'd actually do

  1. Design and build AI agents that can help automate many of the common development and support tasks that the team performs
  2. Guide the Data Engineering team on building highly performance ETL pipelines using Spark and other Big Data technologies
  3. Help design the architecture of our Internal Data Platform to support the implementation of a robust medallion architecture
  4. Provide thought leadership on ways to achieve infrastructure cost savings on Cloud hyperscalers
  5. Work with Security and Compliance teams to ensure that datasets have appropriate permissions and regulations in place

Skills

Required

  • Spark
  • Python
  • AWS or GCP
  • enterprise data lakes/warehouses
  • codegen tools
  • agentic frameworks

Nice to have

  • real-time or streaming data technologies
  • Hive
  • Iceberg
  • Glue
  • Parquet
  • Avro
  • JSON

What the JD emphasized

  • 10+ years experience working on enterprise data lakes/warehouses
  • 5+ years of Spark and Python experience
  • 5+ years of direct hands-on experience working with AWS or GCP
  • Thorough AI knowledge, particularly with codegen tools and agentic frameworks

Other signals

  • Design and build AI agents that can help automate many of the common development and support tasks that the team performs
  • Thorough AI knowledge, particularly with codegen tools and agentic frameworks