Data Scientist II - Amz27028.1

Amazon Amazon · Big Tech · Houston, TX · Corporate Operations

Data Scientist II at Amazon Web Services (AWS) in Houston, TX. This role involves designing and implementing scalable data science approaches, acquiring and analyzing data using SQL/ETL, and building models with statistical and machine learning techniques. The position requires a Master's degree or equivalent experience and proficiency in SQL, R, Python, or MATLAB for model building and data analysis.

What you'd actually do

  1. Design and implement scalable and reliable approaches to support or automate decision making throughout the business.
  2. Apply a range of data science techniques and tools combined with subject matter expertise to solve difficult business problems and cases in which the solution approach is unclear.
  3. Acquire data by building the necessary SQL / ETL queries.
  4. Import processes through various company specific interfaces for accessing Oracle, RedShift, and Spark storage systems.
  5. Analyze data for trends and input validity by inspecting univariate distributions, exploring bivariate relationships, constructing appropriate transformations, and tracking down the source and meaning of anomalies.
  6. Build models using statistical modeling, mathematical modeling, econometric modeling, network modeling, social network modeling, natural language processing, machine learning algorithms, genetic algorithms, and neural networks.
  7. Validate models against alternative approaches, expected and observed outcome, and other business defined key performance indicators.
  8. Implement models that comply with evaluations of the computational demands, accuracy, and reliability of the relevant ETL processes at various stages of production.

Skills

Required

  • Master's degree or foreign equivalent degree in Statistics, Applied Mathematics, Economics, Engineering, Computer Science, or a related field and one year of experience in the job offered or a related occupation. Employer will accept a Bachelor’s degree or foreign equivalent degree in Statistics, Applied Mathematics, Economics, Engineering, Computer Science, or a related field and five years of progressive post-baccalaureate experience in the job offered or a related occupation as equivalent to the Master’s degree and one year of experience.
  • building statistical models and machine learning models using large datasets from multiple resources
  • writing SQL scripts for analysis and data migration
  • applying specialized modelling software including R, Python, or MATLAB

What the JD emphasized

  • building statistical models and machine learning models using large datasets from multiple resources

Other signals

  • build models using statistical modeling, mathematical modeling, econometric modeling, network modeling, social network modeling, natural language processing, machine learning algorithms, genetic algorithms, and neural networks
  • Acquire data by building the necessary SQL / ETL queries
  • Apply a range of data science techniques and tools combined with subject matter expertise to solve difficult business problems