IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection - Université d'Évry Access content directly
Journal Articles Bioinformatics Year : 2018

IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection

Abstract

Motivation: Non-coding RNAs (ncRNAs) play important roles in many biological processes and are involved in many diseases. Their identification is an important task, and many tools exist in the literature for this purpose. However, almost all of them are focused on the discrimination of coding and ncRNAs without giving more biological insight. In this paper, we propose a new reliable method called IRSOM, based on a supervised Self-Organizing Map (SOM) with a rejection option, that overcomes these limitations. The rejection option in IRSOM improves the accuracy of the method and also allows identifing the ambiguous transcripts. Furthermore, with the visualization of the SOM, we analyze the rejected predictions and highlight the ambiguity of the transcripts. Results: IRSOM was tested on datasets of several species from different reigns, and shown better results compared to state-of-art. The accuracy of IRSOM is always greater than 0.95 for all the species with an average specificity of 0.98 and an average sensitivity of 0.99. Besides, IRSOM is fast (it takes around 254 s to analyze a dataset of 147 000 transcripts) and is able to handle very large datasets. Availability and implementation: IRSOM is implemented in Python and C++. It is available on our software platform EvryRNA (http://EvryRNA.ibisc.univ-evry.fr).

Dates and versions

hal-02864104 , version 1 (10-06-2020)

Identifiers

Cite

Ludovic Platon, Farida Zehraoui, Abdelhafid A. Bendahmane, Fariza Tahi. IRSOM, a reliable identifier of ncRNAs based on supervised self-organizing maps with rejection. Bioinformatics, 2018, 34 (17), pp.i620-i628. ⟨10.1093/bioinformatics/bty572⟩. ⟨hal-02864104⟩
59 View
0 Download

Altmetric

Share

Gmail Facebook X LinkedIn More