|
||
| Home | People | Research | Publications | News |
|
Target Sequence File The sequence file of the target contains one-letter code of residues
in the target protein. Any space, return, or tab in the file will
be ignored. Any character not in the standard coding for the 20 amino
acids, e.g., X, Z, or 3, will be treated as an unknown residue. The
file can be either in a standard FASTA format (see example 1) or in a flexible
format (see example 2).
PHD Secondary Structure PredictionPROSPECT allow users to include a secondary structure prediction to identify the most probable loop positions on the target sequence or to find a fold whose secondary structures are most compatible with the prediction, with or without other energy terms. The secondary structure prediction can be obtained from the on-line server PHD developed by Burkhard Rost, or by the tool prospect_ssp. PROSPECT can read both the old PHD output format (see example 1) and the new PHD output format (see example 2). The information used by PROSPECT are pointed in the following, where "Rel" gives the reliability index (0-9) of prediction, and "prH", "prE", and "prL" give the "probability" (0-9) for assigning helix, strand, and loop, respectively: 2...,....43...,....44...,....45...,....46...,....47...,....4
These files are passed to the program with the flag '-phdfile'. Profile To enhance the threading performance, one may employ a profile
(frequency matrix) derived from a multiple sequence alignment in the
protein family of the target. There are two essential formats that prospect understands, '.chk'
file and '.freq' file, which are essentially the same file in different
forms. PsiBlast produces a '.chk' file which is a binary file comprised
of the sequence discreption, followed by the profile. We have also
included a 'read_chk' program which takes this binary file format, and
outputs the data in the exact same format, but this time in ASCII. The
advantage to this is that is is human readable, but more importantly, non-platform
dependent. Binary files are subject to the byte order of the machine
they were created on, this can cause incompatability between machines with
different byte orders, say Mac and PC. We now, we'll call the ASCII
converstion of checkpoint files 'freq files' Checkpoint files can be used via the command line argument '-chkfile'.
Template ListsSometime you do not want to a search aginst the whole database,
but rather just a subset. Prospect takes a list of templates to run
via the -tfile <file> argument. By default, the template list
is $PROSPECT_PATH/data/parameters/fssp.list. |
|||||||||||||||||
| Life Sciences Division - ORNL - Disclaimer - Webmaster |