Bedoukian   RussellIPM   RussellIPM   Piezoelectric Micro-Sprayer


Home
Animal Taxa
Plant Taxa
Semiochemicals
Floral Compounds
Semiochemical Detail
Semiochemicals & Taxa
Synthesis
Control
Invasive spp.
References

Abstract

Guide

Alphascents
Pherobio
InsectScience
E-Econex
Counterpart-Semiochemicals
Print
Email to a Friend
Kindly Donate for The Pherobase

« Previous AbstractThe influence of yeast on chemical composition and sensory properties of dry white wines    Next AbstractThe evolution of volatile compounds profile of 'Toscano' dry-cured ham during ripening as revealed by SPME-GC-MS approach »

Protein Pept Lett


Title:Nglyc: A Random Forest Method for Prediction of N-Glycosylation Sites in Eukaryotic Protein Sequence
Author(s):Pugalenthi G; Nithya V; Chou KC; Archunan G;
Address:"Pheromone Technology Laboratory, Department of Animal Science, Bharathidasan University, Tiruchirappalli- 620024, India. Department of Animal Health Management, Alagappa University, Karaikudi-630003, India. Gordon Life Science Institute, San Diego, CA 92130, United States"
Journal Title:Protein Pept Lett
Year:2020
Volume:27
Issue:3
Page Number:178 - 186
DOI: 10.2174/0929866526666191002111404
ISSN/ISBN:1875-5305 (Electronic) 0929-8665 (Linking)
Abstract:"BACKGROUND: N-Glycosylation is one of the most important post-translational mechanisms in eukaryotes. N-glycosylation predominantly occurs in N-X-[S/T] sequon where X is any amino acid other than proline. However, not all N-X-[S/T] sequons in proteins are glycosylated. Therefore, accurate prediction of N-glycosylation sites is essential to understand Nglycosylation mechanism. OBJECTIVE: In this article, our motivation is to develop a computational method to predict Nglycosylation sites in eukaryotic protein sequences. METHODS: In this article, we report a random forest method, Nglyc, to predict N-glycosylation site from protein sequence, using 315 sequence features. The method was trained using a dataset of 600 N-glycosylation sites and 600 non-glycosylation sites and tested on the dataset containing 295 Nglycosylation sites and 253 non-glycosylation sites. Nglyc prediction was compared with NetNGlyc, EnsembleGly and GPP methods. Further, the performance of Nglyc was evaluated using human and mouse N-glycosylation sites. RESULT: Nglyc method achieved an overall training accuracy of 0.8033 with all 315 features. Performance comparison with NetNGlyc, EnsembleGly and GPP methods shows that Nglyc performs better than the other methods with high sensitivity and specificity rate. CONCLUSION: Our method achieved an overall accuracy of 0.8248 with 0.8305 sensitivity and 0.8182 specificity. Comparison study shows that our method performs better than the other methods. Applicability and success of our method was further evaluated using human and mouse N-glycosylation sites. Nglyc method is freely available at https://github.com/bioinformaticsML/ Ngly"
Keywords:"Animals Computational Biology/*methods Databases, Protein Glycosylation Humans Mice Proteins/*chemistry Sequence Analysis, Protein/*methods Software N-glycosylation glycoproteins glycosites machine learning method protein function protein sequence.;"
Notes:"MedlinePugalenthi, Ganesan Nithya, Varadharaju Chou, Kuo-Chen Archunan, Govindaraju eng Netherlands 2019/10/03 Protein Pept Lett. 2020; 27(3):178-186. doi: 10.2174/0929866526666191002111404"

 
Back to top
 
Citation: El-Sayed AM 2024. The Pherobase: Database of Pheromones and Semiochemicals. <http://www.pherobase.com>.
© 2003-2024 The Pherobase - Extensive Database of Pheromones and Semiochemicals. Ashraf M. El-Sayed.
Page created on 16-11-2024