Distribution of short oligopeptides in a dataset of selected polypeptides

: MOJ Proteomics & Bioinformatics
: Varun Ravishankar,¹ Natasha Kelkar,^1,2 Nachiket Pathak,¹ Rutuj Kolhe,¹ Onkar Ghuge,¹ Shantanu Madiwale,¹ Dhanashree Deore,¹Anupam Saraph,³Milner Kumar,⁴ Anil Gore,⁵SP Modak^6,7

PDF Full Text

Abstract

DNAbases act as alphabets and nucleotide triplets, each representing an amino acid, or a punctuation mark, dictate the order and frequency of occurrence for different amino acids in the newly synthesized polypeptide. The presence of the triplet code in DNA raises the possibility that there may be another code or linguistic formulation composed of 20 amino acids as different alphabets dictating the frequency and the serial order in which 20 amino acids are arranged on different polypeptide strings. With this in mind, we have created a database of di-, tri-, tetra- and pentapeptides and examined the distribution and frequency of occurrence of different types of short oligopeptides in a set of 51,865 polypeptide sequences selected from the Swiss Prot database.

Keywords

di-, tri-, tetra- and pentapeptides, oligopeptide matrices, forbidden oligopeptides, clustering algorithm

Distribution of short oligopeptides in a dataset of selected polypeptides

Abstract

Keywords

Quick Links

Related Journals