|
|
coherent
typo Command typo
Detect possible typographical and spelling errors
ttyyppoo [-nnrrss][_f_i_l_e ...]
typo proofreads an English-language document for typographical
errors. It conducts a statistical test of letter digrams and
trigrams in each input word against digram and trigram frequen-
cies throughout the entire document. From this test, typo com-
putes an index of peculiarity for each word in the document. A
high index indicates a word less like other words in the document
than does a low index. Built-in frequency tables ensure reason-
able results even for relatively short documents.
typo reads each input file (or the standard input if none), and
removes punctuation and non-alphabetic characters to produce a
list of the words in the document. To reduce the volume of the
output, typo compares each word against a small dictionary of
technical words and discards it if found. The output consists of
a list of unique non-dictionary words with associated index of
peculiarity, most peculiar first. An index higher than ten in-
dicates that the word almost certainly occurs only once in the
document.
typo recognizes the following arguments:
-nn Inhibit use of the built-in English digram and trigram
statistics, and inhibit dictionary screening of words. More
words will be output and the indices of peculiarity will be
less useful for short documents.
-rr Inhibit the default stripping of nroff escape sequences.
Normally, typo strips lines beginning with `.' and removes
the nroff escape sequences `\'.
-ss Produce output files digrams and trigrams that contain,
respectively, the digram and trigram frequency statistics
for the given document. No indices of peculiarity are cal-
culated or printed. If desired, these files may be in-
stalled in directory /usr/dict.
***** Files *****
/ttmmpp/ttyyppoo* -- Intermediate files
/uussrr/ddiicctt/ddiicctt -- Limited dictionary
/uussrr/ddiicctt/ddiiggrraammss -- Digram frequency statistics
/uussrr/ddiicctt/ttrriiggrraammss -- Trigram frequency statistics
***** See Also *****
commands, nroff, sort, spell
COHERENT Lexicon Page 1
This archive runs on limited infrastructure. Preserving old code on modern bandwidth. Automated agents are requested to crawl responsibly.