You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Tokenize methods are now called Encode as that better expresses what they do.
The old methods still exist for now as redirects, marked as Obsolete.
Add new CreateBatchEnumerator and CreateAsyncBatchEnumerator APIs that support encoding of inputs that are longer than what fits in one model input (overlap/stride).
Make the Encode overload that returns ReadOnlyMemorys re-use it's internal buffers.
Use FrozenDictionary on .NET 8.
Add support for reading configuration from tokenizer.json files.
Add a LoadFromHuggingFaceAsync method to ease getting started.