Software Development Engineer, Open Data Analytics - Engines

Amazon Amazon · Big Tech · Arlington, VA · Software Development

Software Development Engineer role focused on open data analytics and large-scale data processing frameworks like Spark, Trino, Flink, Hive, Hadoop, and HBase within AWS EMR. The role involves designing and building next-generation technologies, working on complex distributed systems and query engines, and contributing to open-source projects.

What you'd actually do

  1. Designing and building the next-generation technologies that will make EMR the best environment to run large-scale data processing workloads.
  2. Working on complex problems in distributed systems and query engines.
  3. Translation of complex functional and technical requirements into detailed architecture and design.
  4. Deliver systems and features with top-notch quality, on time.
  5. Own the software development process end-to-end, including: working with engineers and product managers to develop requirements; designing, architecting, planning, implementing, and testing new systems and features; deploying, and operating the production EMR systems.

Skills

Required

  • Experience programming with at least one software programming language
  • 1+ years of contributing to new and current systems architecture and design (architecture, design patterns, reliability and scaling) experience
  • 2+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • 3+ years of experience with Master's degree in Computer Science or equivalent OR 5+ years experience with Bachelor's Degree in Computer Science or equivalent, in: software development, including design, implementation, debugging, and support
  • Very strong Computer Science fundamentals in algorithm design, data structures, problem solving, and complexity analysis
  • Strong proficiency in developing objected-oriented software, with deep experience in one or more relevant languages (Java, C, C++, C#)

Nice to have

  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Experience in query processing, query engines, and/or query optimization.
  • Thorough understanding of parallel algorithms, concurrency, asynchronous architectures
  • Proficiency in high-performance, multi-threaded programming
  • Knowledge of and contribution to Hadoop ecosystem