Lead Agentic Data Systems Engineer

Salesforce Salesforce · Enterprise · Mexico City, Mexico

Lead Agentic Data Systems Engineer responsible for architecting and maintaining a private ecosystem of 10+ autonomous agents for ETL, synthetic data generation, automated QA, and predictive modeling. The role involves designing multi-step reasoning architectures, verification protocols, and governance for agents, as well as developing MCP servers for data access. Requires strong Python, SQL, dbt, Airflow, Spark, Snowflake, and agentic framework expertise.

What you'd actually do

  1. Architect and maintain a private ecosystem of 10+ autonomous agents specialized in ETL, synthetic data generation, automated QA, and predictive modeling.
  2. Design multi-step reasoning architectures and verification protocols to ensure agents autonomously validate and peer-review their own outputs.
  3. Transform high-level, ambiguous business requirements into production-ready data products independently, bypassing the need for mid-level project management.
  4. Use domain knowledge to ensure deployed tools are well governed. Governance as code for data pipelines and Agentic development. Context aware Agent development.
  5. Develop and maintain Model Context Protocol (MCP) servers to provide agents with secure, deep-link access to Snowflake, Salesforce, AWS, and proprietary internal data catalogs.

Skills

Required

  • Python
  • dbt
  • Airflow
  • SQL
  • Apache Spark
  • Snowflake
  • Prompt Engineering
  • LangGraph
  • chain-of-thought prompting
  • self-correction loops
  • iterative reasoning paths
  • Docker
  • Kubernetes
  • serverless compute environments
  • 5+ years of experience in high-stakes Data Engineering, Architecture, or Data Science
  • generative AI to accelerate personal and departmental output
  • end-to-end data strategy for a business unit with minimal supervision
  • analytical judgment

Nice to have

  • Cursor
  • Codex
  • Claude Code
  • Salesforce Core and Data 360
  • Data Mesh
  • Data-as-a-Product (DaaP)
  • Event-Driven Architectures
  • Semantic layer
  • Knowledge Graphs

What the JD emphasized

  • production-grade data products
  • end-to-end
  • make a data product actually work, at quality, every day
  • production-ready data products
  • documented history of using generative AI to accelerate personal and departmental output by orders of magnitude
  • function as a "Domain Data Officer," managing end-to-end data strategy for a business unit with minimal supervision
  • Superior analytical judgment—the ability to identify subtle logic errors or hallucinations in agentic output before they reach production.

Other signals

  • building autonomous agents
  • multi-agent ecosystems
  • agentic orchestration
  • governance as code for agentic development
  • AI-native development environments