Pulse · pytorch/torchchat

October 26, 2024 – November 2, 2024

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

Replace WeightOnlyInt8Linear with TorchAO int8_weight_only quantization
#1328 commented on Oct 31, 2024 • 3 new comments
RFC: Make quantization a first class feature
#1032 commented on Oct 30, 2024 • 0 new comments
RFC: Quantization Evaluation
#1325 commented on Oct 30, 2024 • 0 new comments
convert_hf_checkpoint only relies on model_name to resolve TransformerArgs
#1179 commented on Oct 30, 2024 • 0 new comments
AOTI Export ignores user --device flag - expected behavior?
#1278 commented on Nov 1, 2024 • 0 new comments
Out of memory AOTI using llama 3.1 8b on RTX 4090
#1302 commented on Nov 1, 2024 • 0 new comments
Add benchmarking scripts
#1030 commented on Oct 31, 2024 • 0 new comments
[aoti] Remove need for -l in cmake call
#1159 commented on Oct 31, 2024 • 0 new comments
Tokenizers tokenizer
#1261 commented on Oct 30, 2024 • 0 new comments
Install ET nightly and bump up ET version to 20241101
#1312 commented on Nov 1, 2024 • 0 new comments
Use training IR in torchchat export
#1319 commented on Nov 2, 2024 • 0 new comments