Principal Specialist, Senior Data Engineer

RTX RTX · Aerospace · bengaluru, Karnātaka, India · Digital Technology

This role is for a Senior Data Engineer at RTX, a large aerospace and defense company. The primary responsibilities include designing and implementing data pipelines for ingesting, processing, and transforming data, developing and optimizing ELT processes, implementing data quality checks, and establishing patterns for scalable ETL/ELT work in a cloud environment. The role also involves data governance and collaborating with global business teams to design scalable solutions. Requires 7+ years of experience as a Data Engineer with strong skills in Python, Pyspark, Databricks, SQL, and GitHub, along with experience in data warehousing and ETL tools.

What you'd actually do

  1. Collaborate with cross-functional teams to understand data requirements and design data pipelines to ingest, process, and transform data from various sources.
  2. Develop, optimize, and maintain ELT (Extract, Load, Transform) processes to ensure efficient data extraction and loading.
  3. Implement data quality checks and monitoring mechanisms to ensure data accuracy, quality, and consistency.
  4. Develop ETL pipelines in cloud environment for general business consumption, establishing patterns for repeatable, scalable ETL/ELT work that others can leverage in their work.
  5. Implement data governance and master data management principles across specific project execution.

Skills

Required

  • Data Science
  • Engineering
  • Mathematics
  • Statistics
  • Data Engineer
  • data warehousing technologies
  • ETL tools
  • Python
  • YAML
  • Pyspark
  • databricks
  • GitHub
  • Agile methodology
  • SQL
  • English
  • interpersonal skills
  • collaboration
  • problem-solving

Nice to have

  • Unity catalog
  • CICD pipeline
  • Aerospace & Defense Processes
  • gas turbine engine fundamentals
  • Data Governance
  • Architecture
  • Platforms

What the JD emphasized

  • 7+ years of experience working as a Data Engineer
  • Hands-on experience with data warehousing technologies and ETL tools
  • Hands on experience with Python, YAML, Pyspark and databricks
  • Strong database knowledge in SQL.