./dripTrain.py [options] --psm-library <PSM library>
--spectra <spectra file>
DripTrain learns the model parameters for DRIP via the expectation-maximization algorithm utilizing a library of high-confidence PSMs (such as this PSM library and the corresponding set of spectra). The learned parameters may then be used in dripSearch. If you use DRIP trained parameters in your research, please cite:
John T. Halloran, Jeff A. Bilmes, and William S. Noble. "Learning Peptide-Spectrum Alignment Models for Tandem Mass Spectrometry". Thirtieth Conference on Uncertainty in Artificial Intelligence (UAI 2014). AUAI, Quebic City, Quebec Canada, July 2014.
--psm-library <PSM library>
– Collection of high-confidence
peptide-spectrum matches (PSMs). File must be in
tab-delimited format with
fields Peptide
, Scan
,
and Charge
.
--spectra <spectra file>
– Corresponding ms2 spectra for
the PSM library.The program writes the learned parameters
to dripLearned.params
by default. The name of the
output file can be set by the user using
the --dripTrain-file
option.
--dripTrain-file <string>
– Name of GMTK output file. Default = dripLearned.params
.
--output-mean-file <string>
– Name of output file for learned Gaussian means. Default = dripLearned.means
.
--output-covar-file <string>
– Name of output file for learned Gaussian covariances. Default = dripLearned.covars
.
--high-res-ms2 <T|F>
–
boolean, whether the search is over high-res ms2 (high-high)
spectra. When this parameter is true, DRIP uses the real valued
masses of candidate peptides as its Gaussian means; for low-res
ms2 (low-low or high-low), the observed m/z measurements are much
less accurate so these Gaussian means are learned using training
data. Default = False
.
--mods-spec <string>
–
The general form of a modification specification has three
components, as exemplified by 1STY+79.966331.C+57.02146
.
--cterm-peptide-mods-spec <string>
–
Specify peptide c-terminal modifications. See
nterm-peptide-mods-spec for syntax. Default
= <empty>
.
--nterm-peptide-mods-spec <string>
–
Specify peptide n-terminal modifications. Like --mods-spec, this
specification has three components, but with a slightly different
syntax. The max_per_peptide can
be either "1", in which case it defines a variable terminal
modification, or missing, in which case the modification is
static. The residues field
indicates which amino acids are subject to the modification, with
the reside X corresponding to
any amino acid. Finally, added_mass is defined as before. Default
= <empty>
.
output.params
, run in the download directory: