Data Engineer, Cat Digital Data & AI

Caterpillar Caterpillar · Industrial · Irving, TX

Data Engineer responsible for developing Python data pipelines to build business data objects, leveraging GenAI developer tools and frameworks like LangChain for enhanced productivity and problem-solving in an enterprise AI context.

What you'd actually do

  1. Competent to perform all programming, project management, and development assignments without close supervision; normally assigned the more complex aspects of systems work.
  2. Works directly on complex application/technical problem identification and resolution.
  3. Interpreting design requirements for engineering implementation
  4. Building and deploying CICD pipelines
  5. Implementing source to target mapping as pipeline code

Skills

Required

  • Software Development
  • Software Development Life Cycle
  • Software Product Design/Architecture
  • Software Product Technical Knowledge
  • Software Product Testing
  • Python
  • Git
  • AWS
  • Snowflake
  • GenAI developer tools
  • Lang Chain
  • Lang Graph

Nice to have

  • GitHub Copilot or Claude Code
  • AWS SageMaker

What the JD emphasized

  • Hands-on experience with GenAI developer tools
  • Applied knowledge of GenAI tools in real use cases
  • Ability to learn and adapt in a rapidly evolving space

Other signals

  • Hands-on experience with GenAI developer tools
  • Applied knowledge of GenAI tools in real use cases
  • Understanding of Lang Chain and Lang Graph frameworks