TWiki> GRM Web>NGramLibrary>NGramBugs (revision 9)EditAttach

OpenGrm NGram Library Known Bugs

  1. Giving ngramread a text file where lines are terminiated with space causes models built with this to have a missing symbol. (BER: fixed -- deals with leading and trailing spaces)
  2. Giving ngramread a blank line causes models built with missing symbol (at least in the version I'm using).
  3. Giving ngramprint --ARPA an FST with no symbol tables segfaults
  4. Checking for normalized model is perhaps too exact (allow small delta) (MDR: made --norm_eps a flag for relevant binaries)
  5. ngramread doesn't complain about missing labels if --symbols is passed (e.g. <s> in ARPA format but <S> in symbols file)
  6. ngramread fails if first line isn't blank. Can't read Google ARPA files. OK if first line is \data\? 11 Editing Google file to fix 10. results in OK read and empty FST.

-- MichaelRiley - 04 Nov 2010

Edit | Attach | Watch | Print version | History: r20 | r11 < r10 < r9 < r8 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r9 - 2011-03-08 - MichaelRiley
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback