TODO "Figure out how to do verbatim sequences in ANTLR" is impossible to implement with ANTLR #39

ST92 · 2021-12-19T22:46:41Z

I spent last few days digging around why would such an innocently looking feature be left as a TODO. I was looking for something to put my hands into, and it looked promising enough.

Turns out ANTLR doesn't support back-references or forward-references at all. There is no good way to do it using only it.

A known workaround is to embed actions that verify whether the two tokens match (analogous to how XML opening-closing tag pair match tagname), but those actions involve putting a piece of code inside the grammar lexer definition in a programming language that matches the language of the generated lexer.

That would mean putting Java code in concise-encoding grammar definitions, and thus tying it tightly to Java.

I want to write a Rust 100% implementation. AFAIK at this moment I need to write a custom lexer and parser to make verbatim escape sequences work.

TLDR; ANTLR is insufficient, because VES grammar is context-sensitive

The text was updated successfully, but these errors were encountered:

ST92 · 2021-12-19T22:49:25Z

On a positive note, the spec is detailed enough, such that wrong grammar files don't impact anything really. Honestly I'm a bit disappointed that ANTLR seems the best tool for the job but is very much lacking.

kstenerud · 2021-12-20T07:25:32Z

Yeah, I was hoping to rig something up with a templating engine (in python or whatever) to generate a finalized grammar file with stub code for whatever language is being built. In theory the actual verbatim sequence code itself is simple since you're just reading termination token data until the next whitespace, then reading content data until you encounter the termination token again.

kstenerud · 2021-12-20T07:30:52Z

BTW please do write up anything that you find confusing or weird in the spec. If it's confusing, it's badly written!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TODO "Figure out how to do verbatim sequences in ANTLR" is impossible to implement with ANTLR #39

TODO "Figure out how to do verbatim sequences in ANTLR" is impossible to implement with ANTLR #39

ST92 commented Dec 19, 2021

ST92 commented Dec 19, 2021

kstenerud commented Dec 20, 2021

kstenerud commented Dec 20, 2021

TODO "Figure out how to do verbatim sequences in ANTLR" is impossible to implement with ANTLR #39

TODO "Figure out how to do verbatim sequences in ANTLR" is impossible to implement with ANTLR #39

Comments

ST92 commented Dec 19, 2021

ST92 commented Dec 19, 2021

kstenerud commented Dec 20, 2021

kstenerud commented Dec 20, 2021