Difference: NGramSymbols (1 vs. 5)

Revision 52012-03-04 - BrianRoark

Line: 1 to 1
 
META TOPICPARENT name="NGramQuickTour"

NGramSymbols

Line: 21 to 21
 

Caveats

Deleted:
<
<

-- MichaelRiley - 09 Dec 2011

Revision 42011-12-14 - BrianRoark

Line: 1 to 1
 
META TOPICPARENT name="NGramQuickTour"

NGramSymbols

Description

Added:
>
>
Command line utility to produce a symbol table from an input text corpus. Creates a symbol entry for every type in the corpus, as well as for <epsilon> (index 0) and an out-of-vocabulary symbol (last in the symbol table). Command line options --epsilon_symbol and --OOV_symbol permit the specification of the labels wanted for those special symbols.
 

Usage

Added:
>
>
ngramsymbols [--options] [in.txt [out.txt]]
  --epsilon_symbol: type = string, default = <epsilon>
  --OOV_symbol: type = string, default = <unk>
 
 

Examples

Added:
>
>
$ ngramsymbols <earnest.txt >earnest.syms
 

Caveats

Revision 32011-12-13 - MichaelRiley

Line: 1 to 1
 
META TOPICPARENT name="NGramQuickTour"

NGramSymbols

Line: 6 to 6
 

Usage

Added:
>
>

Examples

 

Caveats

Revision 22011-12-10 - MichaelRiley

Line: 1 to 1
 
META TOPICPARENT name="NGramQuickTour"

NGramSymbols

Line: 6 to 6
 

Usage

Deleted:
<
<

Complexity

 

Caveats

Deleted:
<
<

References

  -- MichaelRiley - 09 Dec 2011 \ No newline at end of file

Revision 12011-12-09 - MichaelRiley

Line: 1 to 1
Added:
>
>
META TOPICPARENT name="NGramQuickTour"

NGramSymbols

Description

Usage

Complexity

Caveats

References

-- MichaelRiley - 09 Dec 2011

 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback