TODO for spellchecked column, remove double spaces before running for future
## Test passed 🎊
should get resolved into a copy of the pre_abstract file
TODO how to do substring verification where it’ll show the lines that error!
TODO did we lose the NAs somewhere??
or actually maybe the gpt tagging is pretty iffy and we should just doing it closed class?
what do we do about the tagging sometimes being bad?
https://github.com/mjpost/sacrebleu
run
sacrebleu --i=pre_charf_model.txt --metric=chrf --chrf-word-order 2 --chrf-whitespace pre_charf_correct.txt
not sure what settings actually make sense –>