Senior Python Engineer - ML and Data Science

ClickHouse ClickHouse · Data AI · Product & Engineering

This role focuses on enhancing the Python ecosystem for ClickHouse, aiming to improve the experience for data scientists and ML engineers. The engineer will work on data ingestion, transformation, and analysis features, optimize Python integrations, and contribute to open-source projects, ensuring compatibility with data science toolkits and efficient query execution.

What you'd actually do

  1. Improve ClickHouse's Data Science Python experience: Design and implement features that simplify data ingestion, transformation, and analysis for data scientists using Python.
  2. Contribute to ClickHouse's Python Integrations: Enhance existing ClickHouse’s Python integrations to provide a seamless data science experience.
  3. Work with Open-Source Ecosystem: Contribute to ClickHouse’s open-source repositories, ensuring compatibility with popular data science toolkits.
  4. Performance Optimization: Ensure efficient query execution and data handling when interfacing with Python.
  5. Collaborate with Internal & External Teams: Work closely with product managers, engineers, and the data science community to gather feedback and refine features.

Skills

Required

  • Python
  • Pandas
  • Polars
  • Scikit-learn
  • PyTorch
  • databases
  • query engines
  • analytical tools
  • distributed systems
  • OLAP databases
  • data engineering workflows
  • integrating databases with machine learning or analytics pipelines
  • open-source projects
  • cross-functional team
  • problem-solving skills
  • communication skills

Nice to have

  • ClickHouse
  • cloud-based OLAP analytics systems
  • ML workloads
  • integration with analytical databases
  • developer-first or database technology company

What the JD emphasized

  • Python
  • data scientists
  • ML engineers
  • data ingestion
  • transformation
  • analysis
  • Python libraries
  • data science workflows
  • Python integrations
  • data science experience
  • data science toolkits
  • data science applications