Nothing Special   »   [go: up one dir, main page]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[quant] Add a new fused MovingAvg Obs + FakeQuant operator (GPU) #61589

Closed
wants to merge 15 commits into from

Conversation

supriyar
Copy link
Contributor
@supriyar supriyar commented Jul 13, 2021

Stack from ghstack:

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D29682761

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor
facebook-github-bot commented Jul 13, 2021

💊 CI failures summary and remediations

As of commit 29ef20f (more details on the Dr. CI page and at hud.pytorch.org/pr/61589):


  • 1/1 failures introduced in this PR

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

See CircleCI build pytorch_linux_xenial_cuda11_1_cudnn8_py3_gcc7_build (1/1)

Step: "Build" (full log | diagnosis details | 🔁 rerun)

Jul 20 23:48:12 rm: cannot remove '/var/lib/jenkins/sccache_error.log': No such file or directory
Jul 20 23:48:12 ++++ extract_trap_cmd
Jul 20 23:48:12 ++++ printf '%s\n' ''
Jul 20 23:48:12 +++ printf '%s\n' cleanup
Jul 20 23:48:12 ++ trap -- '
Jul 20 23:48:12 cleanup' EXIT
Jul 20 23:48:12 ++ [[ pytorch-linux-xenial-cuda11.1-cudnn8-py3-gcc7-build != *pytorch-win-* ]]
Jul 20 23:48:12 ++ which sccache
Jul 20 23:48:12 ++ sccache --stop-server
Jul 20 23:48:12 ++ true
Jul 20 23:48:12 ++ rm /var/lib/jenkins/sccache_error.log
Jul 20 23:48:12 rm: cannot remove '/var/lib/jenkins/sccache_error.log': No such file or directory
Jul 20 23:48:12 ++ true
Jul 20 23:48:12 ++ [[ -n '' ]]
Jul 20 23:48:12 ++ [[ pytorch-linux-xenial-cuda11.1-cudnn8-py3-gcc7-build == *rocm* ]]
Jul 20 23:48:12 ++ SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log
Jul 20 23:48:12 ++ SCCACHE_IDLE_TIMEOUT=1200
Jul 20 23:48:12 ++ RUST_LOG=sccache::server=error
Jul 20 23:48:12 ++ sccache --start-server
Jul 20 23:48:12 sccache: Starting the server...
Jul 20 23:48:12 ++ sccache --zero-stats
Jul 20 23:48:12 Compile requests                      0

1 job timed out:

  • pytorch_linux_xenial_cuda11_1_cudnn8_py3_gcc7_build

Preview docs built from this PR

This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

supriyar added a commit that referenced this pull request Jul 13, 2021
Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 56af3d77b4a309b132c75a9a3cb5a5d4c73a3d45
Pull Request resolved: #61589
@supriyar supriyar marked this pull request as draft July 13, 2021 18:09
@supriyar supriyar removed the request for review from ezyang July 13, 2021 18:09
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
supriyar added a commit that referenced this pull request Jul 13, 2021
Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 9c8ae9a757add69ce4f937122c7eb940d1b337a7
Pull Request resolved: #61589
… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
supriyar added a commit that referenced this pull request Jul 14, 2021
Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 3efcb78e27aa1e27c6bdc74d5b59bedc95fe2ea7
Pull Request resolved: #61589
… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
supriyar added a commit that referenced this pull request Jul 14, 2021
Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 7f24c4d993cc6165db21ce848bf82f08e48b42e9
Pull Request resolved: #61589
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
supriyar added a commit that referenced this pull request Jul 14, 2021
Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 014cccdbc36229ef30f0febd8620069be63e3bee
Pull Request resolved: #61589
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
supriyar added a commit that referenced this pull request Jul 15, 2021
Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: 169406b47f75de4e0a2893185a7f2ebc099605b2
Pull Request resolved: #61589
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@supriyar supriyar marked this pull request as ready for review July 16, 2021 18:31
… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

supriyar added 2 commits July 20, 2021 13:56
… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
… (GPU)"

Summary:
Custom GPU implementation that does the observer + calculate qparams calculation on GPU.
It calls the aten fake_quant_per_tensor/channel functions to perform the fake quant step.

Test Plan:
python test/test_quantization.py TestFusedObsFakeQuant

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D29682761](https://our.internmc.facebook.com/intern/diff/D29682761)

[ghstack-poisoned]
@supriyar
Copy link
Contributor Author

@supriyar has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in afdca41.

@facebook-github-bot facebook-github-bot deleted the gh/supriyar/238/head branch July 25, 2021 14:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants