The existence of complete genome sequences makes it important to develop different approaches for classification of large-scale data sets and to make extraction of biological insights easier. Here, we propose an approach for classification of complete proteomes/protein sets based on protein distributions on some basic attributes. We demonstrate the usefulness of this approach by determining protein distributions in terms of two attributes: protein lengths and protein intrinsic disorder contents (ID). The protein distributions based on L and ID are surveyed for representative proteome organisms and protein sets from the three domains of life. The two-dimensional maps (designated as fingerprints here) from the protein distribution densities in the LD space defined by ln(L) and ID are then constructed. The fingerprints for different organisms and protein sets are found to be distinct with each other, and they can therefore be used for comparative studies. As a test case, phylogenetic trees have been constructed based on the protein distribution densities in the fingerprints of proteomes of organisms without performing any protein sequence comparison and alignments. The phylogenetic trees generated are biologically meaningful, demonstrating that the protein distributions in the LD space may serve as unique phylogenetic signals of the organisms at the proteome level.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2tgnCbp
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Abstract Purpose To test the effects of 4 weeks of unilateral low-load resistance training (LLRT), with and without blood flow restricti...
-
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2juls25 via IFTTT
-
Swedish medical imaging software developer SyntheticMR has received the CE... Read more on AuntMinnieEurope.com Related Reading: Siemen...
-
A phase 1 dose-escalation and expansion study of binimetinib (MEK162), a potent and selective oral MEK1/2 inhibitor British Journal of Canc...
-
ACS Nano DOI: 10.1021/acsnano.6b06114 from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2kOsUGq via...
-
Long-term clinical outcomes and economic evaluation of the ketogenic diet versus care as usual in children and adolescents with intract...
-
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2qoeMDm via IFTTT
-
Relativistic hydrodynamics has been quite successful in explaining the collective behaviour of the QCD matter produced in high energy heavy-...
-
GE Healthcare has introduced Logiq S8 XDclear 2.0, an ultrasound system that... Read more on AuntMinnieEurope.com Related Reading: GE r...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου