Presentation: 2024 ND EPSCoR Annual conference
November 21, 2024, Alerus Center, Grand Forks, North Dakota
Machine learning approaches for classifying temporal patterns and tracing causality in RNA-Seq datasets.
Nimasha
Samarawickrama
Doctoral Student
University of North Dakota
Co-authors: Yen Lee Loh, Associate Professor, Department of Physics and Astrophysics, UND; Manu Manu, Associate Professor, Department of Biology, UND; Andrea Repele
Session
Concurrent Presentation Session 3
A complete understanding of cell-fate choice requires inferring the underlying gene regulatory network from detailed time-series gene expression data. Machine learning algorithms are valuable for identifying features or patterns in large datasets. We have analyzed a genome-wide RNA-Seq time series dataset from an in-vitro macrophage-neutrophil differentiation process. We performed feature extraction using non-negative matrix factorization (NMF), which is an unsupervised machine learning technique. We have implemented and demonstrated an NMF algorithm on expression levels of 36255 genes at 29 timepoints under two experimental conditions that favor differentiation into macrophages and neutrophils respectively. The algorithm produces a lower-dimensional representation of the dataset in terms of ten groups of genes (behavior) classified according to their gene expression time dependence. We performed Gene Ontology analysis using a functional annotation clustering tool to determine the biological functions of genes within each behavior (metagene) identified by the NMF algorithm.
The ND-ACES NSF Track-1 cooperative agreement is a federal-state partnership to manage a comprehensive research development plan. ND EPSCoR manages the Track-1 award. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. Current funding is provided by the State of North Dakota and NSF EPSCoR Research Infrastructure Improvement Program Track-1 (RII Track-1) Cooperative Agreement Award OIA #1946202.