Data Engineer

Caterpillar Caterpillar · Industrial · Chicago, IL +1

Data Engineer responsible for developing Python data pipelines that build business data objects used to support applications. This role involves performing programming, project management, and development assignments without close supervision, working on complex application/technical problem identification and resolution, interpreting design requirements, building and deploying CICD pipelines, implementing source to target mapping as pipeline code, and performing unit and integration testing. The candidate should have experience with software development, the SDLC, software product design/architecture, technical knowledge of software products, and testing. Experience with Python, Git, CI/CD tools, AWS components, Snowflake, and GenAI developer tools is preferred.

What you'd actually do

  1. As a Data Engineer, you will be responsible for developing Python data pipelines that build business data objects used to support applications.
  2. Responsibilities are across functional lines with individuals assigned in new program development
  3. Performs all programming, project management, and development assignments without close supervision; normally assigned the more complex aspects of systems work
  4. Works directly on complex application/technical problem identification and resolution
  5. Interpreting design requirements for engineering implementation

Skills

Required

  • Software Development
  • Software Development Life Cycle
  • Software Product Design/Architecture
  • Software Product Technical Knowledge
  • Software Product Testing

Nice to have

  • Software development experience and solid knowledge of OOP principles
  • Experience in designing and developing software applications in Python
  • Experience working with Git version control on medium to large teams
  • Deploying software using CI/CD tools such as Jenkins, Github Actions, Azure Devops etc.
  • Experience with AWS components such as Lambda, Kinesis, Event Bridge, CloudWatch, CloudFormation, Dynamo, Fargate, S3, Sage Maker, IAM and RDS
  • Experience working with Snowflake
  • Hands-on experience with GenAI developer tools
  • Experience with tools such as GitHub Copilot or Claude Code, or similar AI coding assistants to improve developer productivity, code quality, and delivery speed

What the JD emphasized

  • Python data pipelines
  • CICD pipelines
  • Unit and Integration testing
  • AWS components
  • Snowflake
  • GenAI developer tools
  • GitHub Copilot or Claude Code, or similar AI coding assistants