Intro and Install
| FAQ | Command Line
Tools
prospect |
|
Primary tool for protein
threading. Includes pthread support to speed up threading against
a library on a multi-core system |
prospect_mpi |
An MPI enabled version of
Prospect, threads multiple templates on different computers simlatiously |
|
prospect_convert |
Sort and convert alignments
output by prospect |
|
prospect2modeller |
Convert a set of prospect files
into a modeller run |
|
prospect_psipred.pl |
Perl Script to run PsiPred
secondary structure prediction and convert the results to the SSP format |
|
template_gen.pl |
A Perl Script Generate your own
threading
template based on a PDB file |
|
template_gen.py | A Python Script to Generate your
own threading templates based on a PDB file (this one is considered to
be more 'current') |
|
get_chk_file.pl |
Generate a checkpoint file with
PsiBlast |
|
read_chk | Parse the checkpoint file output by PsiBlast |
Tool Examples
prospect_psipred.pl d1ash__.seq
Run BlastPGP on a FASTA file, then run
PsiPred to predict the secondary structure, and then convert the data
to a format usable by prospect
prospect -fssp d1ash_.ssp -o out.d1ash__.xml
Run prospect on the SSP file
d1ash__.ssp, and save the results in out.d1ash_.xml
prospect_convert out.d1ash__.xml -list -raw
List contents of out.d1ash__.xml,
sorted by RawScore
template_gen.py d1ash__.ent > d1ash__.xml
Generate a template file for the PDB
file d1ash__.ent
prospect -fssp d1ash__.ssp -ftemplate d1ash__.xml -o out.d1ash__.xml
Run prospect on the SSP file
d1ash__.ssp against the specific template d1ash__.xml and save the
results in out.d1ash__.xml
prospect -fxml d1ash__.xml -ftlist test.list -o out.d1ash__.xml -ncpus 1:2 -zscore -zscore_full -zcycles 100 -zscore_energy -wname vardel -oweights
Run prospect on the xml file
d1ash__.xml against the template list in file test.list and output to
out.d1ash__.xml. The multiprocessing option should do one
threading at a time, but use 2 processors when possible (usually when
calculating the zscore). A request to calculate the lite zscore
and the full zscore, and do a sampling of 100 decoys. Also
include the zscore energy in the file output as well as the weight
parameters that were used. Also use the weight parameter set that
include the variable deletion model.
prospect2modeller -t test out.d1ash__.xml d1jboa_ my_modeller_dirConvert the alignment to d1jboa_ in
out.d1ash__.xml to the files needed to do a modeller run inside the
directory my_modeller_dir. Name the modeller target 'test'.
Prospect Command Line Arguments
-fssp <file path> : The SSP file of the target to be threaded
-ftemplate <file path> : The xml file of the template
the target is to be threaded against
-frestart <file path> : The output xml file to resume
a set of threadings from
-zscore : Calculate and output the Zscore
-zscore_full : Calculate and output the Zscore Full
-ainteger : Use the integer programming methods
to solve threading problems
-fphd : PHD formated secondary structure prediction input
-ffreq : Freq (text version of a chk file) formatted input
-fssp : SSP secondary structure prediction formatted input
-frestart : XML file to resume threading list
-fdist : Distance definitions (test code)
-fconf : Configuration file (expired)
-o : Output file path
-qname : Define target name
-ftlist : File listing the templates to be threaded against
-ftemplate : Specific template file to be threaded against
-tname : Find template in path by name
-help : Print instructions
-zscore : Calculate the zscore
-zscore_full : Calculate the zscore using the 'full' calculation (rethread shuffled decoys using dynamic programming)
-zscore_energy : Calculate the zscores of the individual energies
-zcycles : Number of samples to try with zscore calculations
-filter : Attempt to shrink the alignment problem by filtering out bad places to put cores (TEST CODE)
-noloop : Don't include loop regions in calculating the energy against the template (TEST CODE)
-adynamic : Use the dynamic programming algorithm to calculate the alignment (DEFAULT)
-ainteger : Use the integer programming algorithm to calculate the alignment
-ainteger_div : (TEST CODE)
-ainteger_gtmip : (TEST CODE)
-ainteger_sift : (TEST CODE)
-ainteger_coredel : Use the integer programming algorithm to calculate the alignment, with an expanded model that allows for core deletions
-atreedecomp : (TEST CODE)
-fork : Fork before doing the threading, thus if there is a crash, the main app won't die
-ncpus : Expand out to multiple CPUs, using pThreads
-max_mem : (TEST CODE)
-wmutation : Adjust the weight parameter of the Mutation energy
-wmutationlog : Adjust the weight parameter of the Mutation Log energy
-wsingleton : Adjust the weight parameter of the Singleton energy
-wsec_struct : Adjust the weight parameter of the Secondary Structure energy
-wgap : Adjust the weight parameter of the Gap energy
-wcoredel : Adjust the weight parameter of the Core Deletion energy
-wtwobody : Adjust the weight parameter of the Twobody cuttoff energy
-wthreebody : Adjust the weight parameter of the threebody energy (TEST CODE)
-wfourbody : Adjust the weight parameter of the fourbody energy (TEST CODE)
-wgonnet : Adjust the weight parameter of the Gonnet energy
-wtwobody_frozen : Adjust the weight parameter of the Frozen twobody energy
-wdfire : Adjust the weight parameter of the Dfire distance dependent twobody energy
-wdistconst : Adjust the weight parameter of the Distance constraint energy (TEST CODE)
-simple_twobody : Use a simple assignment comparison, rather then the matrix mulitple for calculating twobody score (TEST CODE)
-oweights : Include the Energy Weight parameters used to produce the alignment in the output file
-otimeinfo : (BROKEN)
-otemplate_loc : (EXPIRED)
-ochk :
-opdb : (EXPIRED)
-obackbone : (TEST CODE)
-v : Verbosity level 1
-vv : Verbosity level 2
-vvv : Verbosity level 3
Prospect Convert
Prospect Convert has the ability to parse equations based on values of the different features for protein alignments. This feature is primarily for integrating with machine learning based fold recognition techniques.
prospect_convert out.d1ash__.xml -lformat "%sn %sr %sz"
List all of the templates with name raw
score and zscore.
List of Feature names
sa - score average
sr - score raw
srz - score raw zscore
sz - score z
sf - score zfull
sn - score neural network
sel - score z mutation log
se1 - score z singleton
ses - score z secondary structure
se2 - score z twobody
sed - score z dfire
su0 - score user defined
su1 - score user defined
su2 - score user defined
su3 - score user defined
szm - score z mean
szs - score z sd
sfm - score zfull mean
sfs - score zfull sd
selm - score z mutation log mean
sels - score z mutation log sd
se1m - score z singleton mean
se1s - score z singleton sd
sesm - score z secondary structure mean
sess - score z secondary structure sd
se2m - score z twobody mean
se2s - score z twobody sd
sedm - score z dfire mean
seds - score z dfire sd
//features
ft - template length
fq - target length
fc - core count
fm - core residue count
fr - length ratio (fq/ft)
fR - length ratio (ft/fq)
//alignment info
an - num align res
aN - % align res
ac - num align core
aC - % align core
ai - num align ident res
aI - % align ident res
am - num align core res
aM - % align core res
a2c - num contact pairs
a2d - num half contact pairs
af - num target alignment fragments
//weights
wm - mutation weight
wl - mutation log weight
w1 - singleton weight
ws - secondary struct weight
w2 - twobody weight
wd - dfire weight
wo - gap open weight
we - gap const weight
wt - coredel weight
wa - template singledel weight
wi - template singleins weight
wqa - target singledel weight
wqi - target singleins weight
wc - contactdel weight
//energy
em - mutation energy
el - mutation log energy
e1 - singleton energy
es - secondary struct energy
e2 - twobody energy
ed - dfire energy
et - coredel energy
eo - gap open energy
ee - gap const energy
ea - template singledel energy
ei - template singleins energy
eqa - target singledel energy
eqi - target singleins energy
ec - contactdel energy
emw - mutation energy * weight
elw - mutation log energy * weight
e1w - singleton energy * weight
esw - secondary struct energy * weight
e2w - twobody energy * weight
edw - dfire energy * weight
etw - coredel energy * weight
eow - gap open energy * weight
ecw - gap const energy * weight
eaw - template singledel energy * weight
eiw - template singleins energy * weight
eqaw - query singledel energy * weight
eqiw - query singleins energy * weight
ecw - contactdel energy * weight