Data Scientist II - Amz9774210

Amazon Amazon · Big Tech · NY +1 · Corporate Operations

Design and implement scalable and reliable approaches to support or automate decision making throughout the business. Apply a range of data science techniques and tools combined with subject matter expertise to solve difficult business problems and cases in which the solution approach is unclear. Acquire data by building the necessary SQL / ETL queries. Import processes through various company specific interfaces for accessing Oracle, RedShift, and Spark storage systems. Build relationships with stakeholders and counterparts. Analyze data for trends and input validity by inspecting univariate distributions, exploring bivariate relationships, constructing appropriate transformations, and tracking down the source and meaning of anomalies. Build models using statistical modeling, mathematical modeling, econometric modeling, network modeling, social network modeling, natural language processing, machine learning algorithms, genetic algorithms, and neural networks. Validate models against alternative approaches, expected and observed outcome, and other business defined key performance indicators. Implement models that comply with evaluations of the computational demands, accuracy, and reliability of the relevant ETL processes at various stages of production.

What you'd actually do

  1. Design and implement scalable and reliable approaches to support or automate decision making throughout the business.
  2. Acquire data by building the necessary SQL / ETL queries.
  3. Analyze data for trends and input validity by inspecting univariate distributions, exploring bivariate relationships, constructing appropriate transformations, and tracking down the source and meaning of anomalies.
  4. Build models using statistical modeling, mathematical modeling, econometric modeling, network modeling, social network modeling, natural language processing, machine learning algorithms, genetic algorithms, and neural networks.
  5. Validate models against alternative approaches, expected and observed outcome, and other business defined key performance indicators.

Skills

Required

  • building statistical models and machine learning models using large datasets from multiple resources
  • writing SQL scripts for analysis and data migration
  • applying specialized modelling software including R, Python, or MATLAB

Other signals

  • building statistical models
  • machine learning algorithms
  • large datasets