Mid-Michigan Symposium for Undergraduate Research Experiences 2023
Symposium by ForagerOne
    Skip navigation
  • arrow_back_ios
    Exit Event
  • Welcome Page
  • Presentations
  • Live Sessions
  • Login
  • Sign Up

Deep Learning-based Pipeline to Benchmark the Preprocessing of Single Cell Data



Voiceover

Presenter(s)

Divyalakshmi Varadha Rajan Prem Sudha

Presentation Number

2011

Abstract or Description

Several advances in the domain of single-cell transcriptomics have paved the pathway for the discovery of new cell types and a more comprehensive understanding of human diseases. However, one of the central challenges to be confronted with when handling scRNA-seq data is its significant noisy nature stemming from several technical factors, such as amplification bias, cell cycle effects, library size differences, and notably, a low RNA capture rate. Therefore, preprocessing of the training and testing datasets is a crucial first step in the analysis of scRNA-seq data. Not to mention that different downstream tasks require specific types of preprocessing methods, namely, Quality Control of datasets, Normalization of count matrices, Selection of Highly Variable genes, and Dimensionality Reduction. To ensure a seamless and resource-efficient experience for researchers while utilizing computational models to perform experiments on scRNA-seq data, we propose an extension to our current Python toolkit - DANCE: A Deep Learning Library and Benchmark Platform for Single-Cell Analysis - aimed at supporting deep learning models for analyzing single-cell gene expression at scale. This pipeline, currently in its developmental stage, serves as a culmination of the most widely utilized and crucial preprocessing functions/methods that would aid researchers in selecting the optimum combination of the aforementioned preprocessing functions to suit their computational models. This would be achieved by numerous iterations of the collected benchmark datasets and different available preprocessing methods that would analyze the performance of the user's single-cell analysis model.

Mentor

Yuying Xie

of 1
Current View
Current View

Enter the password to open this PDF file.

File name:

-

File size:

-

Title:

-

Author:

-

Subject:

-

Keywords:

-

Creation Date:

-

Modification Date:

-

Creator:

-

PDF Producer:

-

PDF Version:

-

Page Count:

-

Page Size:

-

Fast Web View:

-

Preparing document for printing…
0%

Comments

Daniyal Kabir Dar2 years ago
Great Presentation Divyalakshmi! I see that you have done some great work in developing a DANCE pipeline to streamline single cell data, but it would be great if you could talk about what deep learning models would achieve as in you mention that they would help to analyze single cell gene expression at scale, can you elaborate on how this would be achieved?
•
Symposium™ by ForagerOne © 2026
AboutContact UsTerms of ServicePrivacy Policy