


GENDAT(UCL)       MISC. REFERENCE MANUAL PAGES        GENDAT(UCL)



NAME
     gendat -- generate binary pattern vector files from text

SYNOPSIS
     gendat (-I) pvname.txt
     or
     gendat (-I) pvname <pvname.txt

DESCRIPTION
     gendat Produces standard format  PRW  pattern  vector  files
     from   text  descriptions  of  pattern  vectors.   The  text
     descriptions can be supplied in file on the command line  or
     on  the standard input.  For command line files, the pattern
     vector name is taken to  be  the  first  part  of  the  text
     filename;  for  standard  input,  the pattern vector name is
     supplied   directly.    gendat   produces    files    called
     <pvname>.dat  -  the main data file, <pvname>.lab - an index
     into the .dat file, and <pvname>.inf - a text file  contain-
     ing a description of the contents of <pvname>.dat.

TEXT FILE FORMAT
     For the most common case of one label per pattern, no  addi-
     tional   formatting  is  required,  although  identification
     labels for the classes are recommended.  In this  case,  the
     first  number  on  each line is taken to be the class label,
     and the rest of the line is taken to be  the  vector.   Note
     that  in  PRW,  labels are numbers only, usually assigned to
     classes in sequence as: 0,1,2,...  You assign names  to  the
     numeric  values  using  the  'label' text description.  Note
     also that by default, the patterns are assumed  to  be  two-
     class  continuous,  that  is only labels 0 and 1 are allowed
     and the classifier is simply a detector for  class  1.   Use
     the  "discrete"  text description to change to a multi-class
     discrimination task.

     Header lines in the text control the pattern  vector  confi-
     guration,  each line consists of a key word followed by 1 or
     2 arguments as listed below:

     input <len>
                Specifies the number of  class  labels  for  each
                pattern vector.  The class labels are taken to be
                the first <len> values  of  each  pattern  vector
                supplied.  Default 1.

     discrete <nclass>
                Specifies that the vectors are to be  used  in  a
                multi-class  discrimination  task, with labels in
                the region 0..<nclass>-1.  Default: continuous.

     label <class> <name>
                Allows the specification of descriptive names for



Sun Release 4.1         Last change: PRW1                       1






GENDAT(UCL)       MISC. REFERENCE MANUAL PAGES        GENDAT(UCL)



                each  pattern class.  For example 'label 0 VOI-',
                or 'label 1 VOI+'.  Default is  to  assign  names
                such as 'LABEL0', etc.

     The body of the file should consist of  pattern  vectors  in
     one  of  two  formats: (i) raw data, with one pattern vector
     per line (with class label(s) at beginning of each line), or
     (ii)  array  data, with a vector header specifying the class
     and numbers of rows and columns in the vector.   The  vector
     header is as follows:

     class <class> row <numrows> col <numcols>
                Where <class> is a single numeric value  indicat-
                ing the pattern class, <numrows> is the number of
                lines of the file over which the vector  extends,
                and  <numcols>  is  the  number of values on each
                line.  The overall length of the vector is  taken
                to be numrows*numcols.

     In addition, lines beginning with "#" are  treated  as  com-
     ments and are redirected into the .inf file for information.

OPTIONS
     -I         Identify program name and version number.

EXAMPLE
             discrete 3
             label 0 pot
             label 1 jam
             label 2 sid
             class 0 row 5 col 6
             0 1 1 1 1 0
             0 0 1 1 0 0
             0 1 1 1 1 0
             0 1 1 1 1 0
             0 0 0 0 0 0
             class 1 row 5 col 6
             ...etc

VERSION/AUTHOR
     3.0  Mark Huckvale

SEE ALSO
     prdat(PRW1)

BUGS
SOURCE
     /users/mark/sfs/man/../prw/util.src/gendat.c







Sun Release 4.1         Last change: PRW1                       2



