Pipeline `__call__` functions are not in inference mode decorators → OOM when used externally from Python

If you use the pipelines from your own script (not the CLI), `__call__` isn’t under `torch.inference_mode()`. The text encoder keeps ~37 GB of graph/activations, so after you drop it and load the transformer you OOM.

**Fix:** Either put `@torch.inference_mode()` to respective places (i.e. on each pipeline’s `__call__`), or mention in the README that callers should wrap pipeline calls in `torch.inference_mode()`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline `call` functions are not in inference mode decorators → OOM when used externally from Python #152

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Pipeline __call__ functions are not in inference mode decorators → OOM when used externally from Python #152

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Pipeline `call` functions are not in inference mode decorators → OOM when used externally from Python #152