Telomere Science Library

Publications, Presentations, and Videos
about the Nobel-Prize Winning Science of Telomere Biology

Towards a piRNA prediction using multiple kernel fusion and support vector machine.

Authors: Jocelyn J. Brayet, Farida F. Zehraoui, Laurence L. Jeanson-Leh, David D. Israeli, Fariza F. Tahi
Published: 08/27/2014, Bioinformatics (Oxford, England)

Motivation

Piwi-interacting RNA (piRNA) is the most recently discovered and the least investigated class of Argonaute/Piwi protein-interacting small non-coding RNAs. The piRNAs are mostly known to be involved in protecting the genome from invasive transposable elements. But recent discoveries suggest their involvement in the pathophysiology of diseases, such as cancer. Their identification is therefore an important task, and computational methods are needed. However, the lack of conserved piRNA sequences and structural elements makes this identification challenging and difficult.

Results

In the present study, we propose a new modular and extensible machine learning method based on multiple kernels and a support vector machine (SVM) classifier for piRNA identification. Very few piRNA features are known to date. The use of a multiple kernels approach allows editing, adding or removing piRNA features that can be heterogeneous in a modular manner according to their relevance in a given species. Our algorithm is based on a combination of the previously identified features [sequence features (k-mer motifs and a uridine at the first position) and piRNAs cluster feature] and a new telomere/centromere vicinity feature. These features are heterogeneous, and the kernels allow to unify their representation. The proposed algorithm, named piRPred, gives promising results on Drosophila and Human data and outscores previously published piRNA identification algorithms.

Availability And Implementation

piRPred is freely available to non-commercial users on our Web server EvryRNA http://EvryRNA.ibisc.univ-evry.fr.

© The Author 2014. Published by Oxford University Press.
PubMed Full Text