Skip to content

Repository for final project of High Performance Machine Learning

Notifications You must be signed in to change notification settings

hariya99/quantization-hpml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Quantization-HPML

Authors: Harish Pal Chauhan, Vishnuram Hariharan

Summary: We created a quantized version of a LSTM (Language Model) model through Quantization Aware Training using python and pytorch. This quantized model is 25% of the original model in size and performs with same accuracy.

Prerequisites

pip install -r requirements.txt

Code execution

Run main.ipynb notebook

Results

Original Model Execution

Quantized model is able to perform at the same level as original model

Quantized Model Execution

Size of quantized model is reduced by 75%

Memory Footprints

About

Repository for final project of High Performance Machine Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published