Stop requiring CacheConfig in GenerationConfig with StaticCache #35026

poedator · 2024-11-30T13:00:53Z

I have an observation that in some common use cases, configuring StaticCache during GenerationConfig initialisation is unnecessary.
it was introduced in #32830

Specifically, when the model is used with .generate, only the cache_implementation config option is relevant. The rest of the cache config is determined inside generate(), specifically in transformers.generation.utils::GenerationMixin._get_cache(). In that function, StaticCache is created based on the requested generation parameters, ignoring cache_config entirely.

Suggestion:

do not require StaticCache config in GenerationConfig.init
have some custom logic for ExecuTorch, that does not affect other use cases
have default arguments for StaticCache.init() so that it would quietly get created with some default parameters (not a good idea though).
have tests in transformers that set just cache_implementation="static" and then call .generate()

current workaround:

do not set cache_implementation in GenerationConfig constructor, but set it afterwards with:
model.generation_config.cache_implementation = "static"

system info
applies to transformers from september 2024 up to the current 4.46.3

Who can help?

@gante

The text was updated successfully, but these errors were encountered:

Rocketknight1 · 2024-12-02T13:42:29Z

cc @zucchini-nlp as well!

github-actions · 2024-12-31T08:03:35Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

poedator · 2024-12-31T13:42:49Z

up
@zucchini-nlp @gante

poedator added the bug label Nov 30, 2024

poedator mentioned this issue Nov 30, 2024

Make StaticCache configurable at model construct time #32830

Merged

4 tasks

qubvel added Generation Cache labels Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop requiring CacheConfig in GenerationConfig with StaticCache #35026

Stop requiring CacheConfig in GenerationConfig with StaticCache #35026

poedator commented Nov 30, 2024 •

edited

Loading

Rocketknight1 commented Dec 2, 2024

github-actions bot commented Dec 31, 2024

poedator commented Dec 31, 2024

Stop requiring CacheConfig in GenerationConfig with StaticCache #35026

Stop requiring CacheConfig in GenerationConfig with StaticCache #35026

Comments

poedator commented Nov 30, 2024 • edited Loading

current workaround:

Who can help?

Rocketknight1 commented Dec 2, 2024

github-actions bot commented Dec 31, 2024

poedator commented Dec 31, 2024

poedator commented Nov 30, 2024 •

edited

Loading