Senior Data Engineer

CVS Health CVS Health · Healthcare · Work at Home, IL +50 · Innovation and Technology

Senior Data Engineer at CVS Health responsible for designing and implementing data pipelines for analytical capabilities, focusing on ETL/ELT processes, performance optimization, data quality, and data modeling to support analytics, reporting, and machine learning requirements. The role involves cloud environments and collaboration with data scientists and analysts.

What you'd actually do

  1. Design and build ETL/ELT data pipelines to ingest, process, and transform large datasets from multiple sources.
  2. Implement best practices for performance tuning, partitioning, and clustering to optimize data queries and reduce costs.
  3. Establish and enforce data quality standards, data governance frameworks, and security policies for data storage and access.
  4. Develop and optimize data models and schemas to support analytics, reporting, and machine learning requirements.
  5. Collaborate with data scientists and analysts to design data solutions that integrate with BI tools and machine learning models.

Skills

Required

  • 5+ years of applicable work experience
  • Proficiency in Python, specifically with ETL pipelines.
  • Strong proficiency in SQL and experience in developing complex queries.
  • Familiarity with pySpark, DBT, or other similar frameworks.
  • Experience deploying data pipelines in a cloud environment (Azure, AWS, GCP).
  • Understanding of data warehousing concepts, dimensional modeling, and building data marts.
  • Excellent communication and interpersonal skills, with the ability to collaborate effectively with data scientists, analysts, and product owners.

Nice to have

  • Knowledge of data governance best practices in a cloud environment.
  • Experience with data design in BigQuery
  • Experience working with the Epic data model.
  • Experience working with healthcare data (Claims and Admissions)