|国家预印本平台
首页|A Bayesian framework for inter-cellular information sharing improves dscRNA-seq quantification

A Bayesian framework for inter-cellular information sharing improves dscRNA-seq quantification

A Bayesian framework for inter-cellular information sharing improves dscRNA-seq quantification

来源:bioRxiv_logobioRxiv
英文摘要

Abstract MotivationDroplet based single cell RNA-seq (dscRNA-seq) data is being generated at an unprecedented pace, and the accurate estimation of gene level abundances for each cell is a crucial first step in most dscRNA-seq analyses. When preprocessing the raw dscRNA-seq data to generate a count matrix, care must be taken to account for the potentially large number of multi-mapping locations per read. The sparsity of dscRNA-seq data, and the strong 3’ sampling bias, makes it difficult to disambiguate cases where there is no uniquely mapping read to any of the candidate target genes. ResultsWe introduce a Bayesian framework for information sharing across cells within a sample, or across multiple modalities of data using the same sample, to improve gene quantification estimates for dscRNA-seq data. We use an anchor-based approach to connect cells with similar gene expression patterns, and learn informative, empirical priors which we provide to alevin’s gene multi-mapping resolution algorithm. This improves the quantification estimates for genes with no uniquely mapping reads (i.e. when there is no unique intra-cellular information). We show our new model improves the per cell gene level estimates and provides a principled framework for information sharing across multiple modalities. We test our method on a combination of simulated and real datasets under various setups. AvailabilityThe information sharing model is included in alevin and is implemented in C++14. It is available as open-source software, under GPL v3, at https://github.com/COMBINE-lab/salmon as of version 1.1.0. Contactasrivastava@cs.stonybrook.edu, rob@cs.umd.edu

Patro Rob、Srivastava Avi、Sarkar Hirak、Malik Laraib

Computer Science Department, University of MarylandDepartment of Computer Science, Stony Brook University, Stony BrookComputer Science Department, University of MarylandDepartment of Computer Science, Stony Brook University, Stony Brook

10.1101/2020.04.10.035899

生物科学研究方法、生物科学研究技术生物化学生物物理学

Patro Rob,Srivastava Avi,Sarkar Hirak,Malik Laraib.A Bayesian framework for inter-cellular information sharing improves dscRNA-seq quantification[EB/OL].(2025-03-28)[2025-05-26].https://www.biorxiv.org/content/10.1101/2020.04.10.035899.点此复制

评论