Principal Data Engineer

Oracle Oracle · Enterprise · BENGALURU, KARNATAKA, India

This Principal Data Engineer role focuses on designing and optimizing data pipelines, infrastructure, and database architectures. It involves hands-on statistical analysis and predictive modeling, including regression, trend forecasting, and time-series modeling, to extract actionable insights from complex, high-volume data. The role requires collaboration with data scientists and business stakeholders to ensure high-quality datasets for strategic analysis and decision-making within Oracle's ADW and OCI environments.

What you'd actually do

  1. design scalable data pipelines
  2. optimize data infrastructure
  3. ensure the availability of high-quality datasets for strategic analysis
  4. hands-on involvement in statistical analysis and predictive modeling
  5. mentor junior team members and share best practices in data analysis, modeling, and domain expertise

Skills

Required

  • data engineering
  • analytics
  • designing scalable database architectures
  • building and optimizing data pipelines
  • applying statistical analysis
  • big data frameworks (Apache Spark, Apache Flink, Apache Airflow, Presto, Kafka)
  • data warehouse solutions
  • cloud platform collaboration (compute, networking, search, store)
  • database structure design and optimization (Oracle ADW, OCI)
  • ETL processes
  • data quality assessments
  • data integrity
  • SQL

Nice to have

  • statistical methods
  • hypothesis testing
  • data distribution
  • regression analysis
  • probability
  • Python for data analysis and statistical modeling
  • pandas
  • NumPy
  • SciPy
  • anomaly detection
  • data validation processes
  • visualization tools (Tableau, Power BI, Oracle Analytics Cloud)
  • Matplotlib
  • Seaborn
  • relational database management
  • Oracle Data Integrator (ODI)
  • predictive modeling
  • time-series modeling

What the JD emphasized

  • 7+ years of experience in data engineering and analytics
  • design scalable database architectures
  • building and optimizing data pipelines
  • applying statistical analysis to deliver strategic insights across complex, high-volume data environments