Senior Data Specialist

Caterpillar Caterpillar · Industrial · Bangalore, Karnataka

AWS Data Engineer responsible for designing, building, and maintaining data pipelines and connectors on AWS, with a focus on data quality, reliability, and validation for enterprise-scale ingestion pipelines supporting search and content experiences.

What you'd actually do

  1. Design, develop, and maintain scalable data pipelines on AWS (batch and near‑real‑time).
  2. Own data quality validation across pipelines, including Data completeness, freshness, consistency, and accuracy checks Schema validation and anomaly detection
  3. Implement automated data quality checks, alerts, and monitoring for pipeline failures or data issues.
  4. Collaborate with search and platform teams to ensure high‑quality indexed data for Coveo sources.
  5. Contribute to CI/CD pipelines and follow enterprise SDLC and security standards.

Skills

Required

  • Data Engineer working on AWS
  • AWS Glue
  • BedRock
  • Lambda
  • Step Functions
  • S3
  • SNS/SQS
  • Python
  • PySpark
  • data quality concepts
  • ETL/ELT patterns
  • data modeling
  • logging
  • monitoring
  • alerting for data pipelines
  • CI/CD practices
  • Git

Nice to have

  • Coveo
  • search platforms
  • content indexing pipelines
  • Snowflake