Published in

National Academy of Sciences, Proceedings of the National Academy of Sciences, 37(106), p. 15544-15548, 2009

DOI: 10.1073/pnas.0904100106

Links

Tools

Export citation

Search in Google Scholar

Protein quantification across hundreds of experimental conditions

Journal article published in 2009 by Zia Khan, Joshua S. Bloom ORCID, Benjamin A. Garcia, Mona Singh, Leonid Kruglyak
This paper was not found in any repository, but could be made available legally by the author.
This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Orange circle
Published version: archiving restricted
Data provided by SHERPA/RoMEO

Abstract

Quantitative studies of protein abundance rarely span more than a small number of experimental conditions and replicates. In contrast, quantitative studies of transcript abundance often span hundreds of experimental conditions and replicates. This situation exists, in part, because extracting quantitative data from large proteomics datasets is significantly more difficult than reading quantitative data from a gene expression microarray. To address this problem, we introduce two algorithmic advances in the processing of quantitative proteomics data. First, we use space-partitioning data structures to handle the large size of these datasets. Second, we introduce techniques that combine graph-theoretic algorithms with space-partitioning data structures to collect relative protein abundance data across hundreds of experimental conditions and replicates. We validate these algorithmic techniques by analyzing several datasets and computing both internal and external measures of quantification accuracy. We demonstrate the scalability of these techniques by applying them to a large dataset that comprises a total of 472 experimental conditions and replicates.

Beta version