Job Details:

Job Description:

Responsibilities include:

Develop Intel Neural Compressor product and related tools (auto-round), optimize for Intel AI platform, including CPU, GPU and AI Accelerator
Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models
Track and explore cutting-edge directions in efficient model deployment and inference/finetuning acceleration.

Qualifications:

Master’s or PHD’s degree, major in computer science or related subjects
Solid understanding of deep learning, deep learning framework and large language model (LLM) fundamentals
Familiarity with model compression techniques such as quantization and pruning
Proficiency in Python/C++ or other programming languages commonly used for deep learning development
Strong sense of teamwork and group collaboration
Good English oral and written skill

Preferred Qualifications

Strong self-motivation and problem-solving skills
Passion for technological innovation and practical engineering, with a drive for continuous exploration and improvement
Experience in model fine-tuning, inference optimization or related tool development is a plus

Job Type:

College Grad

Shift:

Shift 1 (China)

Primary Location:

PRC, Shanghai

Additional Locations:

Business group:

The Sales and Marketing Group (SMG) leverages the product portfolio to drive Intel's revenue growth and market expansion, blending strategic initiatives with dynamic sales efforts to capture and retain customers. SMG is responsible for empowering the sales force with tools and insights needed to close deals and build lasting customer relationships. Sales analytics and market research ensure strategies are both targeted and impactful. In SMG, disciplined execution, creativity, and ambition are celebrated, providing ample opportunities for career advancement and skill development.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Work Model for this Role

This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.

ADDITIONAL INFORMATION: Intel is committed to Responsible Business Alliance (RBA) compliance and ethical hiring practices. We do not charge any fees during our hiring process. Candidates should never be required to pay recruitment fees, medical examination fees, or any other charges as a condition of employment. If you are asked to pay any fees during our hiring process, please report this immediately to your recruiter.

Job Details:

Job Description:

Responsibilities include:

Develop Intel Neural Compressor product and related tools (auto-round), optimize for Intel AI platform, including CPU, GPU and AI Accelerator
Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models
Track and explore cutting-edge directions in efficient model deployment and inference/finetuning acceleration.

Qualifications:

Master’s or PHD’s degree, major in computer science or related subjects
Solid understanding of deep learning, deep learning framework and large language model (LLM) fundamentals
Familiarity with model compression techniques such as quantization and pruning
Proficiency in Python/C++ or other programming languages commonly used for deep learning development
Strong sense of teamwork and group collaboration
Good English oral and written skill

Preferred Qualifications

Strong self-motivation and problem-solving skills
Passion for technological innovation and practical engineering, with a drive for continuous exploration and improvement
Experience in model fine-tuning, inference optimization or related tool development is a plus

Job Type:

College Grad

Shift:

Shift 1 (China)

Primary Location:

PRC, Shanghai

Additional Locations:

Business group:

Posting Statement:

Position of Trust

N/A

Work Model for this Role

This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.

ADDITIONAL INFORMATION: Intel is committed to Responsible Business Alliance (RBA) compliance and ethical hiring practices. We do not charge any fees during our hiring process. Candidates should never be required to pay recruitment fees, medical examination fees, or any other charges as a condition of employment. If you are asked to pay any fees during our hiring process, please report this immediately to your recruiter.

AI Frameworks Software Engineer – Model Compression Algorithm

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Job Details:

Job Description:

Qualifications:

Job Type:

Shift:

Primary Location:

Additional Locations:

Business group:

Posting Statement:

Position of Trust

Job Details:

Job Description:

Qualifications:

Job Type:

Shift:

Primary Location:

Additional Locations:

Business group:

Posting Statement:

Position of Trust