Electronic Theses and Dissertations

End-to-end learning framework for circular RNA classiﬁcation from other long non-coding RNAs using multi-modal deep learning.

Mohamed Chaabane, University of LouisvilleFollow

Date on Master's Thesis/Doctoral Dissertation

5-2018

Document Type

Master's Thesis

Degree Name

M.S.

Department

Computer Engineering and Computer Science

Degree Program

Computer Science, MS

Committee Chair

Park, Juw Won

Committee Co-Chair (if applicable)

Rouchka, Eric

Committee Member

Rouchka, Eric

Committee Member

Chung, Donghoon

Author's Keywords

bioinformatics; circular RNA; machine learning; deep learning

Abstract

Over the past two decades, a circular form of RNA (circular RNA) produced from splicing mechanism has become the focus of scientiﬁc studies due to its major role as a microRNA (miR) ac tivity modulator and its association with various diseases including cancer. Therefore, the detection of circular RNAs is a vital operation for continued comprehension of their biogenesis and purpose. Prediction of circular RNA can be achieved by ﬁrst distinguishing non-coding RNAs from protein coding gene transcripts, separating short and long non-coding RNAs (lncRNAs), and ﬁnally pre dicting circular RNAs from other lncRNAs. However, available tools to distinguish circular RNAs from other lncRNAs have only reached 80% accuracy due to the diﬃculty of classifying circular RNAs from other lncRNAs. Therefore, the availability of a faster, more accurate machine learning method for the identiﬁcation of circular RNAs, which will take into account the speciﬁc features of circular RNA, is essential in the development of systematic annotation. Here we present an End to-End multimodal deep learning framework, our tool, to classify circular RNA from other lncRNA. It fuses a RCM descriptor, an ACNN-BLSTM sequence descriptor, and a conservation descriptor into high level abstraction descriptors, where the shared representations across diﬀerent modalities are integrated. The experiments show that our tool is not only faster compared to existing tools but also eclipses other tools by an over 12% increase in accuracy. Another interesting result found from analysis of a ACNN-BLSTM sequence descriptor is that circular RNA sequences share the characteristics of the coding sequences

Recommended Citation

Chaabane, Mohamed, "End-to-end learning framework for circular RNA classiﬁcation from other long non-coding RNAs using multi-modal deep learning." (2018). Electronic Theses and Dissertations. Paper 2954.
https://doi.org/10.18297/etd/2954

Download

Included in

Computer Engineering Commons

COinS

ThinkIR: The University of Louisville's Institutional Repository

Electronic Theses and Dissertations

End-to-end learning framework for circular RNA classiﬁcation from other long non-coding RNAs using multi-modal deep learning.

Date on Master's Thesis/Doctoral Dissertation

Document Type

Degree Name

Department

Degree Program

Committee Chair

Committee Co-Chair (if applicable)

Committee Member

Committee Member

Author's Keywords

Abstract

Recommended Citation

Included in

Search

Browse

Author Corner

Related Links

Contact:

ThinkIR: The University of Louisville's Institutional Repository

Electronic Theses and Dissertations

End-to-end learning framework for circular RNA classiﬁcation from other long non-coding RNAs using multi-modal deep learning.

Author

Date on Master's Thesis/Doctoral Dissertation

Document Type

Degree Name

Department

Degree Program

Committee Chair

Committee Co-Chair (if applicable)

Committee Member

Committee Member

Author's Keywords

Abstract

Recommended Citation

Included in

Share

Search

Browse

Author Corner

Related Links

Contact: