Principal Big Data Software

AT&T AT&T · Telecom · Plano, TX

This role focuses on building and maintaining batch and real-time data pipelines for network and NLP use cases, supporting data scientists and developers. It involves working with Big Data technologies (Hadoop, NoSQL, text mining) in Azure and on-prem environments, managing infrastructure (Hadoop, NiFi, Databricks), and automating processes using CICD/Ansible. The role also supports AI teams and builds Data and AI applications in Azure Cloud.

What you'd actually do

  1. Produce batch and real time data pipelines for our network and NLP based use cases for our data scientists and other developers.
  2. Analyze, design, program, debug, and modify software enhancements and/or new products used in distributed, large-scale analytics, and visualization solutions.
  3. Work on Hadoop administration, Apache NiFi support, and Databricks workspace administration.
  4. Responsible for automation of infrastructure using CICD/ansible or comparable technologies in Azure cloud.
  5. Build Data and AI applications in Azure Cloud.

Skills

Required

  • Hadoop
  • NoSQL
  • text mining
  • Databricks
  • Azure cloud administration
  • CICD
  • Ansible
  • UNIX administration
  • Agile Scrums
  • Azure Delta Lake
  • Snowflake

Nice to have

  • NLP
  • modeling
  • cloud-based environment architecting
  • software development
  • distributed computing
  • 5G Architecture/3GPP

What the JD emphasized

  • Big Data technologies such as Hadoop, NoSQL, text mining, and other distributed environment technologies in azure and on-prem
  • Utilize Hadoop, NoSQL, text mining.
  • Utilize Databricks workspaces.
  • Utilize Azure cloud administration and subscription management.
  • Utilize Agile Scrums.
  • Build Data and AI applications in Azure Cloud.
  • Utilize Databricks, Azure Delta Lake, and Snowflake.