Autoquant Config #368

HDCharles · 2024-06-14T23:32:14Z

Summary: this adds support for an autoquant config

#creating a AutoQuantConfig
aq_config = AutoQuantConfig(model)

#quantizing a model using hte AutoQuantConfig
aq_config.apply_to_model(model)

#save/load an AutoQuantConfig
aq_config.save(file_path)
aq_config.load(file_path) or AutoQuantConfig(file_path)

Test Plan:

python test_integration.py -k "test_autoquant_config"

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: this adds support for an autoquant config aq_config = AutoQuantConfig(model) or aq_config = AutoQuantConfig(file_path) ... aq_config.apply_to_model(model) aq_config.save(file_path) Test Plan: python test_integration.py -k "test_autoquant_config" Reviewers: Subscribers: Tasks: Tags:

pytorch-bot · 2024-06-14T23:32:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/368

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fa0ebfb with merge base bc2f8b7 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

msaroufim · 2024-06-14T23:45:59Z

torchao/quantization/autoquant.py

@@ -513,3 +513,42 @@ def clean_up_autoquant_hooks_and_attrs():
        model(*example_input)

    return model
+
+class AutoQuantConfig:


sorry I keep harping on names :P

But this sounds like it's a Cache instead?

In which case wouldn't something like this be clearer torch.autoquant(model, cache_path = "test.pkl") which would call apply_to_model() on behalf of the user?

And shouldn't you always save because most users would want shorter quantization times unless they're debugging cache issues?

The cache (https://github.com/pytorch/ao/blob/main/torchao/quantization/autoquant.py#L22) is something different. I still need to write automatic serialization for that.

unless you're saying this new thing is reall a Cache? How would you define the difference between a cache and a config?

I think of configs as a class representing a common set of input arguments to a function while a cache is something that saves the result of some search process

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 14, 2024

HDCharles requested review from msaroufim, cpuhrsch and jerryzh168 June 14, 2024 23:37

msaroufim reviewed Jun 14, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Autoquant Config #368

Autoquant Config #368

HDCharles commented Jun 14, 2024 •

edited

Loading

pytorch-bot bot commented Jun 14, 2024 •

edited

Loading

msaroufim Jun 14, 2024 •

edited

Loading

HDCharles Jun 15, 2024

msaroufim Jun 15, 2024

Autoquant Config #368

Are you sure you want to change the base?

Autoquant Config #368

Conversation

HDCharles commented Jun 14, 2024 • edited Loading

pytorch-bot bot commented Jun 14, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/368

✅ No Failures

msaroufim Jun 14, 2024 • edited Loading

Choose a reason for hiding this comment

HDCharles Jun 15, 2024

Choose a reason for hiding this comment

msaroufim Jun 15, 2024

Choose a reason for hiding this comment

HDCharles commented Jun 14, 2024 •

edited

Loading

pytorch-bot bot commented Jun 14, 2024 •

edited

Loading

msaroufim Jun 14, 2024 •

edited

Loading