By downloading this code you agree to the
Source Code Use License (PDF). |
{ version = 1.17; (* of winrs.p 2016 Jan 25} (* begin module describe.winrs *) (* name winrs: Compute Rs/window vs window in a single program synopsis winrs(book: in, cmp: in, winrsp: in, xyin: out, output: out) files book: A book from Delila. Only the first DNA piece in the book is used. cmp: A composition from the comp program. If cmp is empty, then equal frequencies are assumed. winrsp: : Parameters to control the program: one per line StartCoordinate (integer): The base of the sequence to be used for the zero coordinate of the first window. For simplicity, internal coordinates are used: the sequence begins at base 1. MinimumWindowSize (integer): The shortest window to use for the computations. MaximumWindowSize (integer): The longest window to use for the computations. SpaceBetweenWindows (integer): Windows are spaced so as to not collide. You can add extra distance with this parameter. If it is negative, the windows will overlap. NumberofWindows (integer): The number of windows to use for each computation. * The program will object if any of these parameters cause a window to go outside the sequence. xyin: output of this program, input to the xyplo graphics program. columns: 1: window size 2: Rs, Rsequence, bits 3: SD, Standard Deviation of Rs, bits 4: Rs/(window size), bits/base 5: SD/(window size), bits/base output: messages to the user description A window size is chosen. The first sequence in the book is broken into fragments of this size. These are aligned and the information content, Rsequence (Rs) is computed. This process is repeated for a range of window sizes defined in the parameters. The output is a data file (xyin) containing columns for window size (w), Rs and the standard deviation of Rs (SD). There are also two normalized columns for Rs/w and SD/w, which are reports in bits per base. Repeating structures will appear as peaks in a graph of Rs/w versus w. examples documentation [1996 April 8] see also lidel.p, winscan.p, delila.p, encode.p, rseq.p, xyplo.p author Thomas Dana Schneider bugs The program should produce a xyplop so that it can put the genomic information on the curve automatically. technical notes Paremeter dnamax must be set as large as possible so that the entire sequence can fit into a single array and therefore be accessed as rapidly as possible. Since Delila stores sequences with 2 bits/base (assuming a good compiler) so one can compute the size of the array in bytes. *) (* end module describe.winrs *) {This manual page was created by makman 1.45}{created by htmlink 1.62}