GPMDB Video Tutorials

The Global Proteome Machine Organization

The GPMDB contains thousands of data sets contributed by researchers around the world. The large volume of data and biological complexity means that it can be a little intimidating for first time users to figure out how to extract information form the database. This page contains our video tutorials for using the system, presented as a set of case studies and discussions.

Adding a proteome to GPM Personal Editions

This video describes the steps required to add a new proteome/FASTA sequence file to GPM-PE so that it can be used from the GPM Manager application. The same steps apply to any install of GPM.

Sanibel conference, 2011.01.23 12:00 EST

This video is a dramatization of a talk given by Ron Beavis at the ASMS Sanibel conference in 2011, held in St. Petersberg, FL. The talk explains some of the objective measures that we use at GPM to try to understand the quality of data sets for use in annotated spectrum library creation. These same measures can be used to check search engine results, particular when changes are made to the underlying algorithms. A PDF version of the slides for this talk can be downloaded here.

Discussion: Knowledgebases in PTM Analysis

This video discusses the mechanism that X! Tandem and P3 use to change the potential modifications tested on a protein-by-protein basis. The video refers to the following references:

Biemann, K. and Martin, S. A. (1987) Mass spectrometric determination of the amino acid sequence of peptides and proteins. Mass Spectrom. Rev., 6, 1-76.
Boeckmann, B., et al. (2005) Protein variety and functional diversity: Swiss-Prot annotation in its biological context. Compt. Rend. Bio., 328, 882-899.
Craig R., et al. (2006) Using annotated peptide mass spectrum libraries for protein identification. J Proteome Res, 5, 1843-1849.
Craig, R. and Beavis, R.C. (2004) TANDEM: matching proteins with mass spectra, Bioinformatics, 20, 1466-7.
Eng, J., et al. (1994) An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom., 5, 976-989.
Fenyö, D. (1999) The Biopolymer Markup Language, Bioinformatics, 15, 339-40.
Lederberg, J., et al. (1969) Applications of artificial intelligence for chemical inference. I. The number of possible organic compounds. Acyclic structures contain-ing C, H, O, and N. J. Am. Chem. Soc., 91, 2973-2976.
Slebos, R.J.C. et al. (2008) Evaluation of Strong Cation Exchange versus Isoelectric Focusing of Peptides for Multidimensional Liquid Chromatography-Tandem Mass Spectrometry. J. Proteome Res., 7:5286-5294.
Smedley, D. et al. (2009) BioMart - biological queries made easy. BMC Genomics, 10:22.
Tabb, D.L., et al. (2007) MyriMatch: Highly Accurate Tandem Mass Spectral Peptide Identification by Multivariate Hypergeometric Analysis. J. Proteome Res., 6:654-661.

Case: Finding phosphorylation sites using GPMDB and pSYT

This video describes the steps necessary to find the observed phosphorylation sites for a particular protein. The description is in the form of a dialogue between a biomedical researcher (as played by HRM Queen Elizabeth II) and a GPMDB power user (Beavo the clown). Together, they get the researcher started obtaining the phosphorylation sites for her protein of interest C21orf66 and its corresponding pSYT entry.

Discussion: Interpreting details found in high-throughput data

Two characters discuss the issues associated with discovering post-translational modifications and alternately spliced protein sequences from high throughput data and give some advice. The script was adapted from our list of suggested interpretation guidelines.