Machine Learning Engineer

Cognite Cognite · Industrial · India · Engineering

Machine Learning Engineer role focused on building and deploying AI models and pipelines for industrial digitalization. The role involves data processing, fine-tuning foundational models (NLP, Vision-Language), creating APIs for ML capabilities, and ensuring production-grade code quality and system design.

What you'd actually do

  1. Build, test, and maintain robust data pipelines for large-scale industrial datasets.
  2. Assist in deploying data science prototypes to production.
  3. Write high-quality, testable Python code, primarily creating RESTful/gRPC APIs for ML capabilities.
  4. Containerize applications (Docker) and navigate Linux to handle massive volumes of unstructured industrial data.
  5. Run experiments, fine-tune existing foundational models (NLP, Vision-Language), and evaluate ML libraries to enhance document parsing and entity matching.

Skills

Required

  • Python
  • PyTorch
  • TensorFlow
  • Hugging Face
  • LangChain
  • SQL
  • PySpark
  • Pandas
  • Dask
  • MLOps
  • Docker
  • AWS
  • Azure
  • GCP
  • Computer Science fundamentals
  • data structures
  • algorithms

Nice to have

  • Vision-Language Models (VLMs)
  • Computer Vision
  • document layout analysis
  • RAG pipelines
  • Vector Databases
  • Pinecone
  • Milvus
  • Qdrant
  • agentic frameworks
  • LangGraph
  • columnar data
  • Parquet
  • data lakes
  • Delta Lake
  • Apache Iceberg
  • unstructured data
  • PDF extraction
  • OCR
  • FastAPI
  • Flask

What the JD emphasized

  • engineering-first role
  • production-grade code
  • system design
  • ML models as software components
  • building durable software
  • treating ML code like production software

Other signals

  • industrial digitalization
  • AI agents
  • transforming unstructured industrial data
  • Deep Learning
  • Generative AI
  • Computer Vision
  • MLOps
  • model deployment