Software Engineer, Ray Data

Anyscale Anyscale · Data AI · Bengaluru, KA, India · Engineering

Software Engineer to build, optimize, and scale Ray's Datasets library and data processing capabilities, focusing on performance, integration with ML libraries, and stability of distributed systems for machine learning.

What you'd actually do

  1. Develop high quality open source software to simplify distributed programming (Ray)
  2. Identify, implement, and evaluate architectural improvements to Ray core and Datasets
  3. Improve the testing process for Ray to make releases as smooth as possible
  4. Communicate your work to a broader audience through talks, tutorials, and blog posts

Skills

Required

  • Algorithms
  • Data structures
  • System design
  • Building scalable and fault-tolerant distributed systems
  • Data processing
  • Database internals
  • Spark
  • Dask

Nice to have

  • Streaming data processing

What the JD emphasized

  • At least 5 years of relevant work experience
  • Experience in building scalable and fault-tolerant distributed systems
  • Experience with data processing, database internals including Spark or Dask (streaming is a plus)

Other signals

  • Ray Datasets library
  • data processing capabilities
  • distributed computing
  • scalable machine learning