Integrate KBNF grammar #815

EricLBuehler · 2024-10-02T00:53:47Z

No description provided.

github-actions · 2024-10-02T00:54:47Z

Code Metrics Report

  ===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 C Header                2           35           28            0            7
 Dockerfile              1           34           25            0            9
 Happy                   1          442          369            0           73
 JSON                   12          105          104            0            1
 Python                 52         2268         1930           69          269
 TOML                   20          625          559            2           64
 YAML                    2           21           19            2            0
-------------------------------------------------------------------------------
 Jupyter Notebooks       4            0            0            0            0
 |- Markdown             2           77           32           31           14
 |- Python               2          196          169            1           26
 (Total)                            273          201           32           40
-------------------------------------------------------------------------------
 Markdown               38         2759            0         2093          666
 |- BASH                 6          103          100            0            3
 |- JSON                 1           12           12            0            0
 |- Python               5           92           82            0           10
 |- Rust                 9          322          274            0           48
 |- TOML                 2           75           63            0           12
 (Total)                           3363          531         2093          739
-------------------------------------------------------------------------------
 Rust                  260        75643        68177         1547         5919
 |- Markdown           123         1217           25         1117           75
 (Total)                          76860        68202         2664         5994
===============================================================================
 Total                 393        81932        71211         3713         7008
===============================================================================

EricLBuehler

@Dan-wanna-M I was wondering if you could perhaps review some of this code and take a look at some of the review comments I made?

EricLBuehler · 2024-10-02T00:54:46Z

examples/server/grammar.py

Are the grammars here correct? Perhaps we can have a simpler example.

They look mostly correct to me; I have added some comments that can make the grammar more familiar and/or simpler to general users. In terms of simpler example, I think we can use something within regular grammar's capacity but can be more clearly written in kbnf. I think markdown list and the kbnf for a concrete json schema both qualify. Which one do you prefer, or you would like some examples in other areas?

EricLBuehler · 2024-10-02T00:55:05Z

mistralrs-core/src/pipeline/sampling.rs

+ Ok(second_sampled)
+ }
+ KbnfGrammarBias::FinishedGeneration => {
+ todo!()


TODO: Need to mark the sequence as done.

EricLBuehler · 2024-10-02T00:56:51Z

mistralrs-core/src/kbnf.rs

I was wondering if this file generally looks correct? It is paired with the sampling routines in sampling.rs and initialized in engine/mod.rs.

Dan-wanna-M

@EricLBuehler I have reviewed the PR and suggested some changes.

Dan-wanna-M · 2024-10-02T20:01:07Z

examples/server/grammar.py

+(* JSON Grammar *)
+
+(* JSON text must contain a single JSON value *)
+start = value ;


Should be start ::= value;

Dan-wanna-M · 2024-10-02T20:02:04Z

examples/server/grammar.py

+ | "null" ;
+
+(* A JSON object is a collection of key/value pairs enclosed in curly braces *)
+object ::= "{" [ members ] "}" ;


You can use both [members] and members? to express optional nonterminal

Dan-wanna-M · 2024-10-02T20:04:18Z

examples/server/grammar.py

+
+(* A JSON object is a collection of key/value pairs enclosed in curly braces *)
+object ::= "{" [ members ] "}" ;
+members ::= pair { "," pair } ;


You can use both {members} and members* to express 0 or infinite repetition.

Dan-wanna-M · 2024-10-02T20:06:45Z

examples/server/grammar.py

+elements ::= value { "," value } ;
+
+(* A JSON string is a sequence of Unicode characters enclosed in double quotes *)
+string ::= "\"" { character } "\"" ;


You can use regex in kbnf, see https://docs.rs/kbnf/latest/kbnf/#regular-expression.

Dan-wanna-M · 2024-10-02T20:15:04Z

examples/server/grammar.py

They look mostly correct to me; I have added some comments that can make the grammar more familiar and/or simpler to general users. In terms of simpler example, I think we can use something within regular grammar's capacity but can be more clearly written in kbnf. I think markdown list and the kbnf for a concrete json schema both qualify. Which one do you prefer, or you would like some examples in other areas?

Dan-wanna-M · 2024-10-02T20:25:59Z

examples/server/grammar.py

+
+client = OpenAI(api_key="foobar", base_url="http://localhost:1234/v1/")
+
+JSON_KBNF = '''


You may also want to take a look at https://github.com/Dan-wanna-M/formatron/blob/master/src/formatron/formats/json.py where I put a json grammar definition. It also contains whitespace pattern that makes llm generate better json.

Dan-wanna-M · 2024-10-02T20:31:30Z

mistralrs-core/src/kbnf.rs

+
+impl KbnfGrammar {
+ pub fn new(grammar: &str, tokenizer: &Tokenizer) -> Result<Self> {
+ let tokenizer_vocab = tokenizer.get_vocab(true);


I think we still need to "unpreprocess" the vocabulary or the control ASCII characters(like \n) and all non-ASCII characters will not be recognized correctly.

Dan-wanna-M · 2024-10-02T20:38:49Z

mistralrs-core/src/kbnf.rs

+ let mut bias = vec![0f32; self.vocab_size];
+ match self.engine.mask_logits(&mut bias) {
+ Ok(()) => {
+ let new_logits = (logits.to_device(&Device::Cpu)?.to_dtype(DType::F32)?


It might not be a good idea to move logits back to CPU due to potential CUDA synchronization issues. I think we can benchmark this to check latency. If it does bring too much latency, we can follow a strategy like this file which essentially only moves the indices tensor(tends to be smaller than the whole logits) to GPU and operate there.

Dan-wanna-M · 2024-10-02T20:40:54Z

mistralrs-core/src/kbnf.rs

+ }
+ }
+
+ /// Add a token, also to the trie.


technically speaking the cache is a hashmap, not a trie.

EricLBuehler added 7 commits September 2, 2024 06:23

Add kbnf grammar mechanics

5e7ddae

Add example and integrate

9fee83b

Merge branch 'master' into kbnf

baca0da

Fix missing nonterminal

6a2143b

Merge branch 'master' into kbnf

d623a9f

Merge branch 'master' into kbnf

4fb4482

Complete merge

fb123ac

EricLBuehler mentioned this pull request Oct 2, 2024

[Bug?] Incompatible with Hugging Face Tokenizers Dan-wanna-M/kbnf#18

Open

EricLBuehler commented Oct 2, 2024

View reviewed changes

Dan-wanna-M suggested changes Oct 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate KBNF grammar #815

Integrate KBNF grammar #815

EricLBuehler commented Oct 2, 2024

github-actions bot commented Oct 2, 2024

EricLBuehler left a comment

EricLBuehler Oct 2, 2024

Dan-wanna-M Oct 2, 2024

EricLBuehler Oct 2, 2024

EricLBuehler Oct 2, 2024

Dan-wanna-M left a comment

Dan-wanna-M Oct 2, 2024

Dan-wanna-M Oct 2, 2024

Dan-wanna-M Oct 2, 2024

Dan-wanna-M Oct 2, 2024

Dan-wanna-M Oct 2, 2024

Dan-wanna-M Oct 2, 2024

Dan-wanna-M Oct 2, 2024

Dan-wanna-M Oct 2, 2024

Dan-wanna-M Oct 2, 2024


		client = OpenAI(api_key="foobar", base_url="http://localhost:1234/v1/")

		JSON_KBNF = '''

Integrate KBNF grammar #815

Are you sure you want to change the base?

Integrate KBNF grammar #815

Conversation

EricLBuehler commented Oct 2, 2024

github-actions bot commented Oct 2, 2024

EricLBuehler left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Dan-wanna-M left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment