Software Engineering Lmts - Hyderabad (java + Spark + Ai/ml)

Salesforce Salesforce · Enterprise · Hyderabad, India

This role focuses on engineering the next generation of autonomous Data Governance and management solutions within Salesforce's IDMC team. It involves a full-stack backend mindset, combining data expertise with Spark, modern Data Lake architectures, and Generative AI. The responsibilities include end-to-end SDLC ownership, technical leadership, scalable backend development using Java/Python and Apache Spark, data architecture for Data Lakes, data modeling, and implementing automated testing and CI/CD pipelines. While the core role is in data engineering and backend development, there's a preferred skill in AI/GenAI, specifically LLMs, prompt engineering, and RAG agents for data classification or metadata discovery.

What you'd actually do

  1. Drive the full SDLC: requirements, system design, automated testing, deployment, and performance tuning
  2. Provide architectural guidance and mentorship; lead code reviews, design sessions, and technical roadmaps for Data Governance features
  3. Design and implement high-performance backend services using Java/Python and Apache Spark for massive dataset processing within IDMC
  4. Build and optimize modern Data Lake solutions for seamless data movement and governance across cloud environments
  5. Architect complex data models and optimize queries across distributed storage systems

Skills

Required

  • Java
  • Python
  • Apache Spark
  • RDBMS
  • Git
  • CI/CD
  • Agile/Scrum
  • Automated testing frameworks
  • B.S., B.E., or B.Tech in Computer Science or a related technical field

Nice to have

  • Lakehouse patterns
  • Apache Iceberg
  • Graph Databases
  • Neo4j
  • Elasticsearch
  • LLMs
  • prompt engineering
  • RAG agents
  • AWS
  • Azure
  • GCP
  • Docker
  • Kubernetes
  • Data Cataloging
  • Data Quality
  • Metadata Management
  • Salesforce Data Cloud
  • M.S. in Computer Science

What the JD emphasized

  • 6+ years of professional software development in an enterprise environment
  • Strong proficiency in Java (Multithreading, JVM tuning) or Python
  • Hands-on experience with Apache Spark for large-scale data processing and transformation
  • Deep understanding of RDBMS concepts and distributed storage systems