fix: Check underlying model device type when moving 8-bit quantized models to GPU at eval #3622

jeffkinnison · 2023-09-16T20:09:37Z

Check self.model.model.device.type == 'cuda' rather than self.device.type when moving 8-bit models to GPU for #3606. In Trainer, self.device holds the string "cuda", so checking self.device.type raises an AttributeError. Since we work with self.model.model directly in the following code block, it makes sense to bypass the Trainer and LLM objects, and get the model device from the model itself.

github-actions · 2023-09-16T20:50:19Z

Unit Test Results

  4 files -   2   4 suites - 2 31m 42s ⏱️ - 10m 44s
31 tests ±  0 26 ✔️ ±  0   5 💤 ±0 0 ❌ ±0
62 runs - 20 52 ✔️ - 14 10 💤 - 6 0 ❌ ±0

Results for commit 5f85080. ± Comparison against base commit 4806254.

check underlying model device type

5f85080

jeffkinnison requested review from justinxzhao, tgaddair and arnavgarg1 September 16, 2023 20:09

tgaddair approved these changes Sep 16, 2023

View reviewed changes

jeffkinnison merged commit 02ffb06 into master Sep 16, 2023
14 of 16 checks passed

jeffkinnison deleted the 8bit-device-check-fix branch September 16, 2023 21:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Check underlying model device type when moving 8-bit quantized models to GPU at eval #3622

fix: Check underlying model device type when moving 8-bit quantized models to GPU at eval #3622

jeffkinnison commented Sep 16, 2023 •

edited

Loading

github-actions bot commented Sep 16, 2023

fix: Check underlying model device type when moving 8-bit quantized models to GPU at eval #3622

fix: Check underlying model device type when moving 8-bit quantized models to GPU at eval #3622

Conversation

jeffkinnison commented Sep 16, 2023 • edited Loading

github-actions bot commented Sep 16, 2023

Unit Test Results

jeffkinnison commented Sep 16, 2023 •

edited

Loading