Senior Software Engineer(ai/ml Platform)

Autodesk Autodesk · Enterprise · Pune, India

Senior Software Engineer focused on designing and implementing scalable AI/ML serving systems within a hybrid cloud architecture, ensuring low latency and efficient model deployment and management.

What you'd actually do

  1. Design and Implement Scalable AI/ML Serving Systems: Develop scalable and efficient systems for serving AI/ML models, ensuring that these systems can handle varying loads and perform with low latency across diverse environments
  2. Hybrid Cloud Architecture Management: Architect and manage a hybrid cloud environment that uses both on-premises resources and multiple cloud platforms (e.g., AWS, Azure, GCP) to optimise performance, cost, and scalability
  3. Model Deployment and Versioning: Oversee the deployment of AI/ML models into production, including the setup of CI/CD pipelines for model deployment and versioning, ensuring smooth and reliable model updates and rollbacks
  4. Performance Monitoring and Optimization: Implement monitoring tools and practices to track the performance of AI/ML models in production, identifying bottlenecks and optimizing system and model performance for better efficiency and reduced costs
  5. Security and Compliance: Ensure that the AI/ML serving systems follow industry standards and regulatory requirements for data security and privacy, including the management of data encryption, access controls, and audit trails

Skills

Required

  • 5+ years of experience in software development and engineering
  • BS or MS in Computer Science, or equivalent practical experience
  • Hands-on experience with AI/ML frameworks (such as TensorFlow, PyTorch)
  • Familiarity with the lifecycle of AI/ML model development, from training to deployment
  • Strong coding skills in Python
  • Experience with designing and managing systems on hybrid cloud architectures
  • Working knowledge of cloud service providers like Azure
  • Familiarity with containerization technologies (e.g., Docker)
  • Familiarity with orchestration systems (e.g., Kubernetes)
  • Knowledge of CI/CD pipelines
  • Knowledge of infrastructure as code
  • Deep understanding of performance metrics
  • Deep understanding of latency optimization techniques

Nice to have

  • Cloud Certifications (AWS, GCP, Azure)
  • Experience with Big Data Technologies (Hadoop, Spark, Kafka)
  • Familiarity with AI/ML Model Monitoring Tools (MLflow, Kubeflow, TensorBoard)

What the JD emphasized

  • AI/ML serving platform
  • scalable
  • efficient systems
  • model serving
  • inference
  • hybrid cloud architecture
  • low latency
  • production systems and services
  • AI/ML frameworks
  • deployment
  • cloud architectures
  • containerization
  • orchestration
  • performance metrics
  • latency optimization

Other signals

  • AI/ML serving platform
  • scalable, efficient systems for model serving and inference
  • hybrid cloud architecture
  • low latency