AI Frameworks Engineer (openvino, Genai)

at Intel · Industrial · Leixlip, Ireland

AI Frameworks Engineer focused on optimizing generative AI models for efficient inference using the OpenVINO toolkit across Intel hardware, from edge to cloud. This role involves deep dives into generative model architectures and the OpenVINO ecosystem, implementing new features, and optimizing performance for state-of-the-art inference solutions.

What you'd actually do

  1. Spearhead the development and enhancement of innovative use cases for OpenVINO GenAI.
  2. Dive deep into the architecture of generative models and the OpenVINO ecosystem.
  3. Implement new features, and optimize performance to deliver state-of-the-art inference solutions.
  4. Empower developers worldwide by optimizing deep learning models for maximum performance across diverse Intel hardware.

Skills

Required

  • Bachelor's, Master's Computer Science, Computer Engineering, Mathematics, or a related field
  • Deep Learning architectures, particularly Generative AI models (e.g., Transformers, Diffusion models)
  • PyTorch, TensorFlow, and the Hugging Face Transformers library
  • C/C++, Python
  • multithreading concepts and practical experience in developing the thread-safe code
  • Strategic problem-solver
  • Impactful communicator and cross-functional collaborator
  • Effective verbal and written communication skills in English

Nice to have

  • model optimization techniques (i.e., quantization, pruning)
  • OpenVINO Toolkit or other deep learning inference runtimes(ONNX Runtime, Lite RT, Executorch)
  • contributing to open-source projects

What the JD emphasized

  • minimum of 5 years of hands-on development experience utilizing C/C++, Python
  • Proven experience in Deep Learning architectures, particularly Generative AI models
  • Familiarity with popular AI frameworks such as PyTorch, TensorFlow, and the Hugging Face Transformers library

Other signals

  • optimizing deep learning models for maximum performance
  • efficient and powerful generative AI pipeline deployments
  • implementing new features, and optimizing performance to deliver state-of-the-art inference solutions
Read full job description

Job Details:

Job Description:

Join Intel's OpenVINO team, a critical force in accelerating AI inference from the edge to the cloud. Our mission is to empower developers worldwide by optimizing deep learning models for maximum performance across diverse Intel hardware. You'll be an integral part of the OpenVINO GenAI team, which is at the forefront of enabling efficient and powerful generative AI pipeline deployments. As an AI Frameworks Engineer, you will spearhead the development and enhancement of innovative use cases for OpenVINO GenAI. This exciting role involves diving deep into the architecture of generative models and the OpenVINO ecosystem, implementing new features, and optimizing performance to deliver state-of-the-art inference solutions.

Qualifications:

  • Bachelor's, Master's Computer Science, Computer Engineering, Mathematics, or a related field.
  • Proven experience in Deep Learning architectures, particularly Generative AI models (e.g., Transformers, Diffusion models).
  • Familiarity with popular AI frameworks such as PyTorch, TensorFlow, and the Hugging Face Transformers library.
  • Minimum of 5 years of hands-on development experience utilizing C/C++, Python.
  • Demonstrated understanding of multithreading concepts and practical experience in developing the thread-safe code.
  • Strategic problem-solver: You possess the ability to dissect challenging technical problems and devise practical solutions that align with project goals and constraints.
  • Impactful communicator and cross-functional collaborator: You excel at conveying technical insights clearly and building strong partnerships, thriving in both independent work and collaborative environments.
  • Effective verbal and written communication skills in English (intermediate level or higher).

Preferred Qualifications (A Plus):

  • Experience with model optimization techniques (i.e., quantization, pruning).
  • Knowledge of the OpenVINO Toolkit or other deep learning inference runtimes(ONNX Runtime, Lite RT, Executorch).
  • Experience contributing to open-source projects.

Job Type:

Experienced Hire

Shift:

Shift 1 (Ireland)

Primary Location:

Ireland, Leixlip

Additional Locations:

Business group:

The Software Team drives customer value by enabling differentiated experiences through leadership AI technologies and foundational software stacks, products, and services. The group is responsible for developing the holistic strategy for client and data center software in collaboration with OSVs, ISVs, developers, partners and OEMs. The group delivers specialized NPU IP to enable the AI PC and GPU IP to support all of Intel's market segments. The group also has HW and SW engineering experts responsible for delivering IP, SOCs, runtimes, and platforms to support the CPU and GPU/accelerator roadmap, inclusive of integrated and discrete graphics.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Work Model for this Role

This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change.