OpenGrm NGram Library Known Bugs
- Giving ngramread a text file where lines are terminiated with space causes models built with this to have a missing symbol. (BER: fixed -- deals with leading and trailing spaces)
- (empty) -- causes wiki cross referencing to be wonky if omitted
- (empty)
- (empty)
- Giving ngramread a blank line causes models built with missing symbol (at least in the version I'm using).
- Giving ngramprint --ARPA an FST with no symbol tables segfaults
- (empty)
- Checking for normalized model is perhaps too exact (allow small delta) (MDR: made --norm_eps a flag for relevant binaries)
- ngramread doesn't complain about missing labels if --symbols is passed (e.g. <s> in ARPA format but <S> in symbols file)
- ngramread fails if first line isn't blank. Can't read Google ARPA files. OK if first line is \data\? (fixed.)
- Editing Google file to fix 10. results in OK read and empty FST. (seems to be fixed.)
- Perplexity measure on war of worlds corpus gives warning about bad fST (but o.w. seems to work)
- ngrammake fails with zero arguments (BER: fixed)
- random error message when symbol tables are missing
- empty or non-coaccess machines give odd errors / segfaults
- ngramcount using -epsilon_as_backoff has problem finding state if counting order different from model order
--
MichaelRiley - 04 Nov 2010
This topic: GRM
> WebHome >
NGramLibrary > NGramBugs
Topic revision: r14 - 2011-11-07 - BrianRoark