Principal ML Engineer

Cognite Cognite · Industrial · India · Engineering

Principal ML Engineer to design, build, and deploy scalable ML systems for industrial digitalization, focusing on transforming unstructured data into actionable intelligence using Deep Learning, Generative AI, and Computer Vision. The role involves engineering production-grade code, optimizing inference, and integrating AI into existing systems, with a focus on scaling and real-time applications.

What you'd actually do

  1. Study, transform, and deploy data science prototypes into production environments, ensuring they are scalable, efficient, and maintainable.
  2. Architect and write high-quality, scalable, and testable production code (Python, Scala, C++, or Java). Build the robust APIs, distributed big data pipelines, and orchestration layers required to integrate AI into existing industrial master data systems.
  3. Navigate complex deployment environments. You will optimize inference bottlenecks, manage containerized deployments, and ensure our systems can handle massive volumes of unstructured industrial data reliably.
  4. Research and implement advanced ML algorithms (e.g., in NLP, Computer Vision) and extend existing ML libraries and frameworks.
  5. Run tests, perform statistical analysis, and monitor deployed models for performance, drift, and accuracy, implementing retraining strategies as needed.

Skills

Required

  • Python
  • C++
  • Java
  • PyTorch
  • TensorFlow
  • Keras
  • Scikit-learn
  • MLOps
  • AWS
  • Azure
  • GCP
  • data structures
  • algorithms
  • software architecture

Nice to have

  • fine-tuned multimodal models
  • Agentic AI
  • multi-step reasoning
  • Graph RAG
  • Knowledge Graphs
  • lakehouses
  • Delta Lake
  • Apache Iceberg
  • large language models
  • on-premises deployment
  • edge environments

What the JD emphasized

  • production-grade code
  • scalable ML systems
  • inference bottlenecks
  • low latency
  • Agentic AI
  • multimodal models
  • Graph RAG

Other signals

  • building AI
  • low-code AI agents
  • transforming unstructured industrial data
  • Deep Learning
  • Generative AI
  • Computer Vision
  • scalable ML systems
  • production-grade code
  • distributed systems
  • inference bottlenecks
  • containerized deployments
  • NLP
  • multimodal models
  • Agentic AI
  • Graph RAG
  • Knowledge Graphs
  • large language models
  • low latency