-
Notifications
You must be signed in to change notification settings - Fork 23k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[quant] Add a new fused MovingAvg Obs + FakeQuant operator (GPU) #61589
Conversation
Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
💊 CI failures summary and remediationsAs of commit 29ef20f (more details on the Dr. CI page and at hud.pytorch.org/pr/61589):
🕵️ 1 new failure recognized by patternsThe following CI failures do not appear to be due to upstream breakages: pytorch_linux_xenial_cuda11_1_cudnn8_py3_gcc7_build (1/1)Step: "Build" (full log | diagnosis details | 🔁 rerun)
|
Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 56af3d77b4a309b132c75a9a3cb5a5d4c73a3d45 Pull Request resolved: #61589
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 9c8ae9a757add69ce4f937122c7eb940d1b337a7 Pull Request resolved: #61589
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 3efcb78e27aa1e27c6bdc74d5b59bedc95fe2ea7 Pull Request resolved: #61589
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 7f24c4d993cc6165db21ce848bf82f08e48b42e9 Pull Request resolved: #61589
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 014cccdbc36229ef30f0febd8620069be63e3bee Pull Request resolved: #61589
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 169406b47f75de4e0a2893185a7f2ebc099605b2 Pull Request resolved: #61589
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
… (GPU)" Summary: Custom GPU implementation that does the observer + calculate qparams calculation on GPU. It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step. Test Plan: python test/test_quantization.py TestFusedObsFakeQuant Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761) [ghstack-poisoned]
@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
This pull request has been merged in afdca41. |
Stack from ghstack:
Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.
Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant
Reviewers:
Subscribers:
Tasks:
Tags:
Differential Revision: D29682761