

SIMLIS v1.1 -- Similarity sort of text file lines.

Usage:
  SIMLIS [-dnp] [-sxx] [-cx] [-o...;...] < infile [{>|>>} outfile]

Options: 
 -d   displays similarity factor (0..1000) as output line header.
 -sxx skips assumed (fixed length xx) header data in every line.
 -n   includes digits and special characters  :;<=>?@ in evaluation.
 -p   makes process item position sensitive:
      SIMLIS takes into account the relative item position within the lines
      when comparing. Assume 2 lines:
      "dog bites man"   and
      "man bites dog"
      only the -p option would see a difference (lower grade similarity).
 -cx  everything before first character x (if present) within a line
      will be ignored.
 -o....;...;...  strings given here are ignored in evaluation,
      max. 10 strings, max 14 chars. each.
            

Notes:
   SIMLIS v1.1 is a MSDOS application with filter functionality
   that sorts the lines of a text file  depending of the similarity of
   the items contained (words, character sequences, etc). 
   It takes every line from standard input, evalutes the similarity in
   relation to all other lines und inserts it in the appropriate line 
   position of standard output. The default process works case insensitive,
   alpha characters only. It sacrifices speed to precision, don't
   expect fastest execution. In any case output goes to stderr (usually 
   display) too.
   Restrictions: max.15000 lines, max.line length 500 bytes.

Examples:
SIMLIS -d <readme.txt
     readme.txt ist sorted and as header of each output line the
     internal calculated 'similarity factor' (range 0...1000) is given.
     This factor means degree of similarity to the respective previous line.

SIMLIS  -pn <readme.txt >readme.srt
     readme.txt ist sorted, including the contained numerics and some
     special characters and output goes to readme.srt.
     -p produces a more logical result (if input is structured accordingly).
       
SIMLIS -s33 -c: -ofree;freeware;win;windows <00_index.txt       
     If 00_index.txt is a SIMTEL formatted index file, the lines will be kind
     of keyword sorted, ignoring the (here irrelevant) leading 33 bytes 
     headers. The first colon ':' (if present) within each line  starts
     operation, everything before is ignored. 
     The words "free", "freeware", "win", "windows" are ignored too,
     assuming these word should not have special relevance for distinction.
     
     
     
Status of the program and distribution:
SIMLIS v1.1 is FREEWARE.
It can be freely distributed in its unmodified form and 
be included in any software collection such as CD-ROM's but may NOT be sold.  

Installation:
As you can read this you are ready to run SIMLIS.EXE right here,  
no need for further installation procedures.

History:
vers. 1.1:
New options -c and -o,
faster operation and other minor improvements.
vers. 1.0:
First published version.


Compatibility:
The same SIMLIN version should be available for WIN32, LINUX, HP-UX.

Comments, suggestions, requests for information to:
joda@SDF.LONESTAR.ORG
File access:
ftp://SDF.LONESTAR.ORG/pub/users/joda/simlis

Legal Stuff:
Copyright 1999, Joachim Dathe ("The author")

THE SOFTWARE IS PROVIDED "AS-IS" AND WITHOUT WARRANTY OF ANY KIND, EXPRESSED,
IMPLIED OR OTHERWISE, INCLUDING AND WITHOUT LIMITATION, ANY WARRANTY OF
MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.  IN NO EVENT SHALL THE
AUTHOR BE LIABLE FOR ANY SPECIAL, INCIDENTAL, INDIRECT OR CONSEQUENTIAL DAMAGES
WHATSOEVER (INCLUDING, WITHOUT LIMITATION, DAMAGE FOR LOSS OF PROFITS,
BUSINESS INTERRUPTION, LOSS OF INFORMATION, OR ANY OTHER LOSS) , WHETHER OR NOT
ADVISED OF THE POSSIBILITY OF DAMAGES, AND ON ANY THEORY OF LIABILITY, ARISING
OUT OF OR IN CONNECTION WITH THE USE OR INABILITY TO USE THIS SOFTWARE.

