Associate Data Solutions Architect

Boeing Boeing · Aerospace · Seattle, WA +3

The Associate Data Solutions Architect will design, build, and operate data pipelines and workloads on cloud platforms (GCP, AWS) to support analytics and AI use cases in supply chain. Responsibilities include implementing ETL/ELT processes, developing data transformations, building data workflows, and collaborating with data scientists and analysts.

What you'd actually do

  1. Implement, test, and maintain Extract, Transform, Load/Extract, Load Transform (ETL/ELT) pipelines to ingest data from on-prem systems and cloud sources into data lakes/warehouses
  2. Develop data transformations and data models to support reporting, analytics, and Machine Learning (ML) workloads
  3. Build and maintain batch and streaming data workflows using orchestration tools (Airflow, Cloud Composer, Prefect, etc.) and data integration platforms (Informatica, Talend, Fivetran, etc.)
  4. Work with on-prem technologies (databases, file shares, middleware) and cloud services (BigQuery, Redshift, Cloud Storage, S3, Dataflow, Glue) to move and transform data
  5. Collaborate with data owners to profile data, identify quality issues, and implement data validation and cleansing rules

Skills

Required

  • 1+ years of experience building data pipelines, ETL/ELT processes, and/or data engineering tasks
  • 1+ years of experience with SQL
  • Experience designing queries and data models for analytics
  • Experience with scripting languages (Python or R programming) for data processing and automation
  • Experience with orchestration tools (Airflow, Prefect, Cloud Composer)
  • Experience with relational databases (Oracle, SQL Server, Postgres)
  • Experience with cloud data service (BigQuery, Redshift, Snowflake, and/or equivalent)
  • Experience with data quality, basic data governance, and logging/monitoring concepts

Nice to have

  • Bachelor’s degree in Computer Science, Information Systems, Engineering, or related field and/or equivalent experience
  • 3+ years of experience with data engineering across on-prem and cloud environments
  • Experience with troubleshooting skills with a collaborative mindset
  • Experience with good written and verbal communication skills
  • Experience with cloud platforms GCP and/or AWS (BigQuery, Cloud Storage, Dataflow, Pub/Sub, DataProc, S3, Redshift, Glue, Lambda)
  • Experience building streaming and/or near-real-time pipelines (Kafka, Pub/Sub, Kinesis, Spark Streaming)
  • Experience building ETL data pipelines (Informatica, Talend, Fivetran, etc.)
  • Experience with infrastructure-as-code tooling (Terraform, CloudFormation) and CI/CD practices for data workflows
  • Experience implementing data cataloging/lineage tools (Data Catalog, Amundsen, Collibra, Alation) and data quality frameworks (Great Expectations, AWS Glue Data Quality)
  • Experience with containerization (Docker) and deploying microservices and/or data jobs on Kubernetes
  • Experience tuning for SQL and data processing jobs for cost-optimization on cloud platforms
  • Experience working in supply chain, manufacturing, and/or within aerospace environments