Releases: kddubey/cappr
Releases · kddubey/cappr
v0.9.6 - fix Llama 3 tokenizer
Breaking changes
None
New features
None
Bug fixes
cappr.huggingfaceis compatible with Llama 3/3.1's tokenizer. It works around this issue using code from this PR (with small modifications). See the updated list of supported architectures here.
v0.9.5 - address deprecation of HF KV tuple
Breaking changes
None
New features
cappr.huggingface.classifyinternally passes in aDynamicCacheobject if possible. This change gets rid of a warning you might see when running previous versions of CAPPr:
We detected that you are passing `past_key_values` as a tuple and this is deprecated and will be removed in v4.43. Please use an appropriate `Cache` class (https://huggingface.co/docs/transformers/v4.41.3/en/internal/generation_utils#transformers.Cache)
Bug fixes
None
v0.9.4 - don't repeat KV if possible
Breaking changes
None
New features
cappr.huggingface.classifydoesn't copy the prompt's KVs when broadcasting the prompt to completions ifbatch_size=1or if you pass in a single prompt. Instead, it repeats a view of it. This change saves memory for tasks where there are many completions. For example, in the Banking 77 demo, peak reserved CUDA memory goes from 13.8 GB to 8.3 GB (~40% decrease), and peak allocated CUDA memory goes from 9.3 GB to 7.7 GB (~17% decrease).
Bug fixes
None
v0.9.3 - log-probs is an array when possible
Breaking changes
None
New features
- The
agg_log_probsfunction returns a numpy array instead of a list of numpy arrays if there are a constant number of completions.
Bug fixes
None
v0.9.2 - arbitrary token log-prob aggregation
Breaking changes
None
New features
- The
agg_log_probsfunction appliesfuncinstead ofnp.exp ∘ func. So if you want average token log-probabilities, setfunc=np.mean
Bug fixes
None
v0.9.1 - no setup.py
Breaking changes
- There's no
setup.pyfile, in case you were relying on that.
New features
None
Bug fixes
None
v0.9.0 - don't require openai, tiktoken
Breaking changes
pip install capprwill no longer installopenai, tiktoken. Install them yourself, or install them usingpip install "cappr[openai]". For previous versions ofcappr, if you needed to installcapprwithout these dependencies, you had to run:
python -m pip install \
"numpy>=1.21.0" \
"tqdm>=4.27.0" && \
python -m pip install --no-deps capprcappr.openai.api.Modelno longer includes the deprecatedtext-*models
New features
None
Bug fixes
None
v0.8.8 - default axis for posterior prob
Breaking changes
None
New features
- The axis of
posterior_probdefaults to the last one
Bug fixes
None
v0.8.7 - Llama CPP no need for logits_all=True
Breaking changes
None
New features
- You no longer need to instantiate your Llama CPP model with
logits_all=True
Bug fixes
None
v0.8.6 - support LongLLaMA
Breaking changes
- Setting the internal
pastattribute of the cache toNonenow will cause an error to be raised if you try to use it again. Please use the original model instead
New features
- Support LongLLaMA
reprfor cached model- Don't check logits from Llama CPP
Bug fixes
None