Protein misprediction uncovered by new technique
A new bioinformatics tool is capable of identifying and correcting abnormal, incomplete and mispredicted protein annotations in public databases. The MisPred tool, described today in the open access journal BMC Bioinformatics, currently uses five principles to identify suspect proteins that are likely to be abnormal or mispredicted. László Patthy led a team from the Institute of Enzymology of the Hungarian Academy of Sciences, Budapest, that developed this new approach. He explained how necessary it is, "Recent studies have shown that a significant proportion of eukaryotic genes are mispredicted at the transcript level. As the MisPred routines are able to detect many of these errors, and may aid in their correction, we suggest that it may significantly improve the quality of protein sequence data based on gene predictions". The MisPred approach promises to save much time and effort that would otherwise be spent in further investigation of erroneously identified genes.
The MisPred approach rates annotations according to five dogmas:
- Extracellular or transmembrane proteins must have appropriate secretory signals.
- A protein with intra- and extra-cellular parts must have a transmembrane segment.
- Extracellular and nuclear domains must not occur in a single protein.
- The number of amino acid residues in closely related members of a globular domain family must fall into a relatively narrow range.
- A protein must be encoded by exons located on a single chromosome.
There are some exceptions to these rules, as pointed out by Patthy, "Some secreted proteins may truly lack secretory signal peptides since they are subject to leaderless protein secretion. Similarly, it cannot be excluded at present that transchromosomal chimeras can be formed and may have normal physiological functions. Nevertheless, the fact that MisPred analyses of protein sequences of the Swiss-Prot database identified very few such exceptions indicates that the rules of MisPred are generally valid".
The authors found that the absence of expected signal peptides and violation of domain integrity account for the majority of mispredictions. The authors note that "Interestingly, even the manually curated UniProtKB/Swiss-Prot dataset is contaminated with mispredicted or abnormal proteins, although to a much lesser extent than UniProtKB/TrEMBL or the EnsEMBL or GNOMON predicted entries".
Source: BioMed Central
Related
- Biologist enhances use of bioinformatic tools and achieves precision in genetic annotationThu, 15 Jan 2009, 17:36:34 EST
- Powerful online tool for protein analysis provided pro bono by Stanford geneticistMon, 1 Dec 2008, 17:15:14 EST
- Large-scale community protein annotation -- WikiProteinsTue, 27 May 2008, 22:35:58 EDT
- Key to virulence protein entry into host cells discoveredMon, 4 Aug 2008, 17:22:15 EDT
- TRAPping proteins that work together inside living cellsMon, 15 Jun 2009, 14:43:57 EDT
Other sources
- Protein misprediction uncovered by new techniquefrom Biology News NetThu, 28 Aug 2008, 11:56:19 EDT
- Protein Misprediction Uncovered By New Techniquefrom Science DailyWed, 27 Aug 2008, 23:35:28 EDT
- Protein misprediction uncovered by new techniquefrom Biology News NetWed, 27 Aug 2008, 12:21:38 EDT
- Protein misprediction uncovered by new techniquefrom PhysorgWed, 27 Aug 2008, 4:49:16 EDT
Latest Science Newsletter
Get the latest and most popular science news articles of the week in your Inbox!Learn more about
Popular science news articles
- Nanoparticles used in common household items caused genetic damage in mice
- Transcendental Meditation helped heart disease patients lower cardiac disease risks by 50 percent
- Beyond sunlight: Explorers census 17,650 ocean species between edge of darkness and black abyss
- Boehringer Ingelheim announces Phase III data of flibanserin in pre-menopausal women with HSDD
- Therapy 32 times more cost effective at increasing happiness than money
- Nanoparticles used in common household items caused genetic damage in mice
- Treatment with folic acid, vitamin B12 associated with increased risk of cancer, death
- Therapy 32 times more cost effective at increasing happiness than money
- 5 exercises can reduce neck, shoulder pain of women office workers
- Transcendental Meditation helped heart disease patients lower cardiac disease risks by 50 percent