Change lexical alignment default to 1-grams #26

fusaroli · 2018-05-16T20:02:13Z

No article in the current literature uses n-grams above 1 for lexical alignment. It'd make sense to change the default to 1.

fusaroli · 2018-05-18T11:32:14Z

The fix seems pretty straightforward: in the calculate alignment.py, function LexicalPOSAlignment, the line
for ngram in range(2,maxngram+1):

should become

for ngram in range(1,maxngram+1):

This however has the potential drawback of providing the user with an additional meaningless syntactic alignment of 1-grams.
No biggie for me, but if we want to avoid that, it could be solved by not passing along the penn_tok1 and penn_lem1 (including the stan stuff).

a-paxton added the enhancement label Jan 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change lexical alignment default to 1-grams #26

Change lexical alignment default to 1-grams #26

fusaroli commented May 16, 2018

fusaroli commented May 18, 2018

Change lexical alignment default to 1-grams #26

Change lexical alignment default to 1-grams #26

Comments

fusaroli commented May 16, 2018

fusaroli commented May 18, 2018