Data Scientist, Network Fabric Engineering

Amazon Amazon · Big Tech · NSW, Australia +1 · Data Science

Data Scientist role focused on defining and driving the data science strategy for network operations automation, including agentic systems. The role involves defining metrics, building risk and reliability models, and evaluating the performance of automation and AI systems to improve network availability. It emphasizes statistical rigor and evidence-based decision-making within a team of network and software engineers.

What you'd actually do

  1. Define and drive the data science strategy for the program — the metrics, the experiments, and what counts as evidence that a change worked
  2. Lead the design and deployment of predictive risk and reliability models for network availability, using device failures, alarm telemetry, ticket data, and traffic signals
  3. Own the evidence behind program decisions: where availability is at risk, where automation is ready to expand, where engineering effort has the highest leverage. Defend recommendations to senior technical and business audiences
  4. Design and own the operational analytics and dashboards (Amazon QuickSight, Amazon CloudWatch, Python) used by senior leadership to track network health and the impact of operational change
  5. Design and run experiments to evaluate the automation we are rolling out — including agentic systems supporting engineers on incidents — measuring whether each rollout improved availability

Skills

Required

  • 2+ years of data scientist experience
  • 3+ years of data querying languages (e.g. SQL), scripting languages (e.g. Python) or statistical/mathematical software (e.g. R, SAS, Matlab, etc.) experience
  • 3+ years of machine learning/statistical modeling data analysis tools and techniques, and parameters that affect their performance experience
  • 1+ years of working with or evaluating AI systems experience

Nice to have

  • Bachelor's degree or above in Science, Technology, Engineering, or Mathematics (STEM)
  • Knowledge of machine learning concepts and their application to reasoning and problem-solving
  • Experience in Python, Perl, or another scripting language
  • Experience in a ML or data scientist role with a large technology company
  • Experience in defining and creating benchmarks for assessing GenAI model performance

What the JD emphasized

  • evaluating AI systems
  • agentic systems
  • evaluate the automation

Other signals

  • agentic systems
  • automation
  • risk and reliability models
  • evaluations