Sr. Sw Engineer – Pyspark | Spark Sql | Scala | Big Data | Etl

Visa Visa · Fintech · Bengaluru, India, IN

Visa is seeking a Senior Software Engineer for their Corporate Information Technology, Billing & Incentives Platforms team. The role focuses on designing, developing, and optimizing scalable data pipelines using PySpark, Spark SQL, and Scala for large-scale data processing. Responsibilities include building and optimizing ETL workflows, developing Spark jobs, writing efficient Spark SQL queries, architecting big data solutions, ensuring data quality, and performing performance tuning. The role also involves collaborating with stakeholders, creating documentation, following engineering best practices, and mentoring junior engineers. Exposure to AI/GenAI enabled data engineering solutions is a plus.

What you'd actually do

  1. Design, develop, and maintain scalable data pipelines using PySpark, Spark SQL, and Scala
  2. Build and optimize complex ETL workflows for extracting, transforming, and loading data across multiple systems such as Oracle, PostgreSQL, Hive, Hadoop, and cloud-based data platforms
  3. Develop and optimize Spark jobs for large-scale batch and, where applicable, streaming data processing
  4. Write efficient Spark SQL queries and optimize data transformations for performance and scalability
  5. Implement data processing strategies including partitioning, caching, parallel processing, file format optimization, and job tuning

Skills

Required

  • PySpark
  • Spark SQL
  • Scala
  • ETL/ELT data pipelines
  • Apache Spark architecture
  • Spark performance tuning
  • SQL
  • Hive
  • Hadoop
  • HDFS
  • Delta Lake
  • Parquet
  • ORC
  • Oracle
  • PostgreSQL
  • SQL Server
  • MySQL
  • Python
  • distributed computing
  • data partitioning
  • data modeling
  • large-scale data processing patterns
  • data quality checks
  • reconciliation
  • validation
  • exception handling
  • audit controls
  • CI/CD pipelines
  • Git/version control
  • code reviews
  • deployment processes
  • Agile/Scrum

Nice to have

  • AI/GenAI concepts
  • integrating AI/LLM APIs
  • data pipelines supporting ML/AI model development
  • AI tools for ETL code generation
  • data validation
  • anomaly detection
  • data profiling
  • workflow automation
  • AI-assisted development tools