|
|
1.1 root 1:
2:
3: prep Command prep
4:
5:
6:
7:
8: Produce a word list
9:
10: pprreepp [ -ddffpp ] [ -ii _i_f_i_l_e ] [ -oo _o_f_i_l_e ] [ _f_i_l_e ... ]
11:
12: The command prep prepares a word list that is useful for statis-
13: tical processing from the textual data found in each input file.
14: If no file is given, prep reads the standard input for text.
15:
16: For the purposes of prep, a word consists of a string of al-
17: phabetic letters and apostrophes. Words are written, one per
18: line, to the standard output. Hyphenated words are treated as
19: two words. However, any word hyphenated between two lines is
20: rejoined as one word.
21:
22: prep recognizes the following options:
23:
24: -dd Print a sequence number (of words in the input text) before
25: each output word.
26:
27: -ff Fold upper-case letters into lower case. This is sometimes
28: useful for producing unique lists of words.
29:
30: -ii _i_f_i_l_e
31: Ignore words found in ifile. ifile has words one per line
32: that are matched again each input word independent of case.
33:
34: -oo _o_f_i_l_e
35: Print only words found in ofile. Only one of -i or -o may
36: be specified.
37:
38: -pp In addition to printing words, also print each punctuation
39: character (printable, non-numeric characters that separate
40: words), one per line. These lines are not counted for -d.
41:
42: ***** See Also *****
43:
44: commands, deroff, sort, spell, typo, wc
45:
46: ***** Notes *****
47:
48: What constitutes a word is different in deroff, prep, and wc.
49:
50:
51:
52:
53:
54:
55:
56:
57:
58:
59:
60:
61:
62:
63:
64: COHERENT Lexicon Page 1
65:
66:
This archive runs on limited infrastructure. Preserving old code on modern bandwidth. Automated agents are requested to crawl responsibly.