Senior Python Engineer - ML and Data Science

ClickHouse ClickHouse · Data AI · Product & Engineering

This role focuses on enhancing ClickHouse's Python ecosystem to improve the experience for data scientists and ML engineers. The engineer will work on data ingestion, transformation, and analysis features, optimize Python integrations, and contribute to open-source initiatives, ensuring compatibility with data science toolkits and efficient query execution.

What you'd actually do

  1. Improve ClickHouse's Data Science Python experience: Design and implement features that simplify data ingestion, transformation, and analysis for data scientists using Python.
  2. Contribute to ClickHouse's Python Integrations: Enhance existing ClickHouse’s Python integrations to provide a seamless data science experience.
  3. Work with Open-Source Ecosystem: Contribute to ClickHouse’s open-source repositories, ensuring compatibility with popular data science toolkits.
  4. Performance Optimization: Ensure efficient query execution and data handling when interfacing with Python.
  5. Collaborate with Internal & External Teams: Work closely with product managers, engineers, and the data science community to gather feedback and refine features.

Skills

Required

  • Python
  • Pandas
  • Polars
  • Scikit-learn
  • PyTorch
  • databases
  • query engines
  • analytical tools
  • distributed systems
  • OLAP databases
  • data engineering workflows
  • integrating databases with machine learning or analytics pipelines
  • open-source projects
  • cross-functional team
  • problem-solving skills
  • communication skills

Nice to have

  • ClickHouse
  • cloud-based OLAP analytics systems
  • ML workloads
  • developer-first or database technology company

What the JD emphasized

  • data scientists
  • machine learning teams
  • data science
  • ML engineers
  • analytics professionals
  • data exploration
  • data ingestion
  • transformation
  • analysis
  • data science workflows
  • data science toolkits
  • machine learning
  • analytics pipelines
  • data science applications

Other signals

  • Enhance ClickHouse's Python ecosystem for data scientists and ML engineers
  • Optimize Python libraries and data science workflows
  • Integrate databases with machine learning or analytics pipelines