------------------------------------
CPT Word Lists 1.1.4

Win32 version, MS JVM (jview) or Sun's
Java JDK/JRE 1.1, 1.2 or 1.3 required.

Shareware, runs in demo mode until registered.
Freely distributable.

Updated: 31-July-2001
------------------------------------

DESCRIPTION
-----------
CPT Word Lists is collection of tools for processing word
lists and text files, supporting Unicode and unlimited number
of encodings via the Java converters. Its main goal is
to create dictionaries for the other CPT programs
but it can be used as completely independent program.

Features:

The set of operations over textual (plain text or HTML)
files include:
- browsing/searching in any standard encoding including
  decomposition and bidi support;
- extract words, calculate letter and word frequencies;
- flexible 'word' definition and filtering;
- change encoding, letter case;
- standard Unicode and custom normalizations;
- visual/logical order conversion for RTL scripts;
- simple spell checking and tagging.

The word lists (text or dictionary format) operations
include the above plus:
- creating highly compressed dictionaries optionally
  including tags and definitions (one million words
  can be stored in browsable, less then 1MB file);
- several types of sorting including user defined
  order and alphabets (80 alphabets supplied);
- compare/add/delete functions over dictionaries;
- global assignment of tags and extracting
  subsets via selected tags;
- automatic or user defined suffixes packing;
- via user definitions: creating and expanding
  munched lists, creating and filtering tagged lists,
  translating tags in tagged lists, tagging;
- searching and extracting word patterns;
- creating inverted indexes;
- 3 levels of protecting the dictionaries.

More details can be found in the HTML documentation.
The differences with the old versions are listed in Changes.txt.

There is a supporting program CPT Dictionary
for browsing the files created by CPT Word Lists.
CPT Dictionary is free for non-commercial use and
you can distribute it together with your dictionaries.

SYSTEM REQUIREMENTS
-------------------
- Supported OS: Windows 95/98/ME/NT/2K
  (for Linux there is a separate distribution);
- Requires 1 MB of disk space and 32 MB RAM;
- This is Java program and Sun's JDK/JRE 1.1.3 or
  greater or compatible is needed. The Sun's Java
  Runtime Environment is available for download at: 
    http://java.sun.com/products/
  (version 1.1.8 is 5 MB, 1.2.2 is 12 MB, 1.3.1 is 7.8 MB)
  For Java 1.1, please, look at our JavaFonts.txt
  for tuning your JDK/JRE installation.
NOTE: Java 1.2 on Win 95/98 has problems with non
ANSI characters and if you want to display
international characters, use 1.1 or 1.3.

  If you prefer MS JVM (JView is included in any Windows
  before WXP), and if your version does not support JNI
  (it gives "...UnsatisfiedLinkError"), take new one from:
    http://www.microsoft.com/java/vm/
  (the recent versions are about 6 MB)
NOTE: MS JVM has faster GUI but supports limited number of
character encodings and you might get an error message by
CPT programs like "...Converter not found!".

INSTALL
-------
1. Extract this zip file into temporary directory.

2. Edit "install.bat" to reflect your Java VM.
   Run it from the temporary directory.
   This will start the wizard and according to your
   choices, the CPT Word Lists program will be installed.

3. The installation program (install.class) is a
   self-extracting class file whose contents get
   extracted during the installation and two
   directories will be created:
   - the target one chosen for the installation;
   - <user-home>\ITJ directory for the uninstall program
   (see UNINSTALL below).
   Note that CPT Word Lists is 'single user application'
   and the user running the program should have
   full access to the installation directory.

4. During the installation you will be asked for a
   License Key, if you don't have, leave the field
   empty - the program will run in Demo mode. In this
   mode the size of word lists/dictionaries is
   limited to 500 words and the optional Thai
   composition will not be available.

5. If you have problem running CPT Word Lists,
   check/modify the generated cpt_wl11.bat file to
   reflect your JDK/JRE environment, especially
   if you install a new version of JRE after you
   install this program.

UNINSTALL
---------
To uninstall, do one of the following:

1. Click on the uninstall icon added to the
   start menu or desktop folder.

2. Go to the Add/Remove Programs dialog in the
   Windows Control Panel and remove the program.

3. Run in command line
   <user-home>\ITJ\juninst <CPT-home>\UnInst
   where <CPT-home> is the installation directory,
   and <user-home> for Win 2K is:
     c:\Documents and Settings\<user-name>
   for NT it is:
     c:\winnt\Profiles\<user-name>
   and for ME or single user Win 95 it is:
     c:\windows

If you have installed a new version of JRE after
the installation of this program, check/modify
juninst.bat in your <user-home>\ITJ sub directory.
After the uninstallation the ITJ directory
will not be removed because it serves all CPT
packages. If you don't have any other CPT
program, you can delete it.

LICENSE AND REGISTER
--------------------
Check out the License.txt and Register.txt for
license and register information.

CONTACT
-------
We are very interested in receiving your comments,
suggestions, and bug reports at our email:
cpt.software@usa.net
