global menu:
WORD COUNT and ANALYSIS
(also see: the UXUTL or the Txtutils sets for UNIXish WC)
1. WCNT- Count and analyze word frequency in text and HTML documents.
2. WC- Simple word count program.
3. wc24- Word counter also counts sentences, calculates readability index.
4. TI- ("Text Information") Comprehensive text file statistics generator.
1. WCNT:* * * * One of the more comprehensive "word count" programs I've encountered. It includes a host of options: Can analyze HTML documents (ignores tags in word counts). Count of lines, characters, non-whitespace characters, words, distinct words and unique words. Average length of words, distinct words and unique words. Sorted word lists with frequencies. Word length distribution histograms. Configurable word sets. DOS code page awareness. Multiple filespecs with wildcards: Outputs combined statistics of all files when passed a filespec with wildcards. Donationware. Author: Branko Radovanovic, Croatia (1997)
2. WC: * * * A DOS clone of the Unix WC utility with some added features. Unlike WCNT, WC 1) lists individual file stats when passed a filespec with wildcards. 2) Can read from standard input as well as from files. WC also generates error level values for use in batch files. Author: Roman Nurilov.
3. wc24: * * * Another word count program that can optionally count sentences and generate a rough and ready "readability" index based on a combination of word length and sentence length. Can read from standard input as well as from files. Author: J. R. Ferguson, Netherlands (1996)
4. TI: unrated [added 08-19-99]. This program generates a wealth of statistics about a text file including size of file, whitespace, lines, blank lines, shortest line, longest line, average line length, average with blanks, number of pages, lines per page. Expects a single filename.
Options:
-A# Display how many times each letter is used.
the '#' represents an optional character to be counted specifically.
-C# Same as '-A' switch, except # is the numerical ASCII value of the
character to be counted specifically (A = 65 or 97, @ = 64).
-F# Treat words > # characters as long when calculating fog index *
# must be included in -F switch (default 9 if no -F switch)
-L# Display length of each line in the file. # is an optional number
specifying how long a 'long' line is, in characters
-O:name Prints output to both the screen and an output file, the name of the
output file is optional - the input name with .ti extension will be
used if no name is given. 'ti input.txt > output.txt' will not work.
-P# The number of lines per page for calculating number of pages.
-T Assume non-text file, changes handling of ascii values 128-255.
-W Display a list of word lengths used in the file.
-? This help screen.
Author: Quentin J. Christensen, Australia (1999), Home Page
ASCII TEXT SPELLERS
1. JSPELL- Excellent interactive spell checker (English dictionary).
* * * * * [added 9-17-98 updated 10-25-98]
When considering both ease-of-use and versatility, you won't find a better choice than JSPELL. Note: JSPELL may not run on some faster Pentiums (divide overflow error)- use SLOWDOWN to avoid the error.
Author: Joohee Jeong (1998); Suggested by Robert Bull, Scott Nesbitt.
New in v2.1x (10-98): "Added a feature that can omit lines starting with > or any other string specified by the user in the configuration file jspell.cfg. This feature is useful in spell-checking a reply to an email message. This version is freeware. (No need for the registration code file.) "
download jspel211.zip (209K)
2. SpellTest- Spell checker for plain or html text; interactive mode or file report. (English dictionary)
unrated [added 04-18-99]
This speller could be particularly useful to web authors because it ignores HTML codes in documents during a spell check. SpellTest can run in two modes: 1. A simple interactive mode allows manual replace of unknown terms - but has no features like "ignore all" or "add to custom dic"; 2. SpellTest probably functions best as a report-to-file speller. Reported terms are referenced by original document line numbers. No limit on text file sizes. Includes a large 2MB dictionary and user dictionaries are supported. Requires a fast 386+ PC and about 2MB RAM. Author: Stepanyuk Oleg (1999).
Usage : spelltst.exe <file> <options>
Options:
-r:<report name> , by default report.txt
-n Dont load addishional dictionaries.
-o Online error fixing. (Ascii text files only).
-nr Dont create a report file.
download spelltst.zip (about 1 MB)
3. GDSpell- Interactive spell checker handles big files. (English dictionary)
* * *
GDSpell (SPELL) is an easy to use stand-alone spell checker from the developers of the freeware NE editor. Both programs use the same dictionary- so you don't need to clutter your hard disk with different dictionaries. Unlike NE, SPELL can spell check big files. Can create and use a custom dictionary. Spell checking dialog is similar to those found in popular word processors.
Limitations:
EXE size: 55K; Dictionary size: 370K
(Thanks to Yves Bellefeuille's freeware list for pointing me to this one.)
download gdsp300b.zip (420K)
4. ISPELL- Interactive spell checker.
unrated
ISPELL is an interactive spell checker developed for UNIX platforms. ISPELL can be used as a stand-alone program or as an external checker for many power editors. I'm aware of at least three DOS manifestations of ISPELL representing the older v4.0 and the newer 3.1x versions. There are some capabilities which are absent in ispell 4.0 vs 3.1x. V4.0 is not case sensitive, suffix handling is more primitive and it won't allow non-alphabetical characters into the dictionary.
1. GNU ISPELL 4.0. [added 4-6-98] This old but widely distributed ISPELL includes only an English dictionary. Run ISPELL without parameters to check a single word, or pass ISPELL a filespec and it will enter a line-by-line interactive check / correction mode. ISPELL can check multiple files in sequence if you pass it a wildcarded filespec. Uses a 38,000 word dictionary (156K on disk). The package lacks usage documentation and unless you're familiar with ISPELL, you could end up frustrated. Just hit the "?" key when inside the program to get the list of navigation commands. Easy to use. I'm sure there are additional hidden features, but I haven't used it much. 16-bit. Runs briskly enough on a 386/20. (1993)
download ispel40x.zip (260K)
2. International ISPELL 3.1.20b. [added 7-3-98 updated 06-09-00] Untested. This is a 32-bit DJGPP compiled port which includes an English dictionary (# of words?). Requires a 386+ and a DPMI-provider (cwsdpmi.exe). (2000)
06-09-00: May 2000 new build.
download isp3120b.zip (690K)
3. International ISPELL 3.1.20. [added 4-6-98] This 8-bit EMX/gcc compiled DOS port minimally requires a 386 PC, but I'd recommend a fast 486 or Pentium with at least 8MB RAM and a disk cache. The package is a very large download (2.5MB zip), containing executables, source, and multiple language dictionaries (Dutch, English, French and German). The compiled English dictionary requires about 4.7MB disk space (contains at least 210,000 unique words including many technical and scientific terms). Supports maintenance of a user ("private") dictionary, which by default is stored in the root directory with the filename _english. All in all, I like the comprehensiveness and "intelligence" of this ISPELL. The program itself loads slowly on a Pentium 60 (w/ 8MB RAM), and is much too slow on a 386/20 (8MB). Requires use of ANSI.SYS or equivalent. I wouldn't waste time downloading this package unless you're willing to invest a _little_ time with setup.
download ispellw32.zip (2.5Mb)
5. Tschek- Spell checker outputs list of all misspelled words to screen or file.
* * *
Most of us use word processors or stand-alone dialog spell checkers to perform "on-the-fly" spell checking and correction (e.g., GDSpell). But sometimes these spell checkers can be cumbersome and time consuming because they prompt word by word. If you are spell checking an HTML or technical document with a "dumb" spell checker, this can be tedious. Of course, you could add all the those strange words or tags to a user dictionary, but that's no fun either. Or, you could use a spell checker that simply outputs a list of unrecognized words to a file without any prompting or correction. You can browse the output file, quickly locate words that are obvious typos, and manually correct the original document (e.g., using a search /replace tool).
Features:
Limitations:
BIGSPELL.BAT
rem USAGE: bigspell any.txt @echo off spell 1.dic %1 1.tmp /b spell 2.dic 1.tmp 2.tmp /b spell 3.dic 2.tmp 3.tmp /b spell 4.dic 3.tmp 4.tmp /b spell 5.dic 4.tmp 5.tmp /b spell 6.dic 5.tmp misspell.txt /b del *.tmp echo Spell check complete. See MISSPELL.TXT
download tschek15.zip (68K)
DICTIONARIES, ENCYCLOPEDIAS
Words- ASCII English word list also functions as dictionary for some spellers.
unrated
Words consists of four text files containing approximately 110,000 English words total (including plural and other word forms). I use it as a large dictionary for spellers that can use ASCII-only dictionaries. See Tscheck for an application of Words.
download words1.zip (A - D, 95K)
download words2.zip (E - K, 75K)
download words3.zip (L - R, 99K)
download words4.zip (S - Z, 85K)
JORJ- English dictionary for DOS.
* * *
JORJ is a stand-alone dictionary program for DOS. Several executable versions are packaged together (compiled for different memory usage). Jorj can be run in memory resident (pop-up) or non-memory resident modes. One of the provided executables ("omega") will use XMS memory when the program is run as a tsr.
One unique feature of Jorj is it's ability to search for entries even when your spelling is incorrect. Jorj also has a "word scan" feature that will list all entries containing a given search string. The lexicon has some significant drawbacks. The word list is small but adequate (larger in registered version) and definitions are brief- and not authoritative. Words are syllabified, but parts of speech are lacking. Even with these shortcomings, JORJ still serves as a handy reference.
EXE size: 35K (alpha) or 64K (omega). Dictionary size: 1.2MB
download jorj97.zip (650K)
Penc- Text-only encyclopedia.
* * [updated 11-10-98]
Penc is a text-only encyclopedia (not a program) comprehensive enough to be useful. An encyclopedia this small can't add much detail to entries- and Penc doesn't. Penc should not be viewed as "authoritative" since there is no reference to author's source information. The text version is obviously difficult to navigate (hint: use LIST viewer and its find function). There are online encyclopedias on the Net (including the updated, but commercial/ copyrighted online version of Penc) that are easier to use and won't gobble up your disk space. Beyond these limitations, Penc may be a useful reference for you. Penc is divided into major topical sections, within which entries are sorted alphabetically. There are some areas covered which some "popular" CD-ROM dictionaries often neglect. Author: Servile Software.
This encyclopedia now appears to be a shareware/commercial product- the final freeware versions are still available below, but are no longer being updated.
download plain text version ...
PENC1G.ZIP (1.7 MB; 8-98)
or download HTML version ...
penc1ih.zip (3.6 MB, 12-98)
[ Go to Top | Front Page ]
© 1994-2001. Rich Green