|国家预印本平台
首页|Multi-sample Full-length Transcriptome Analysis of 22 Breast Cancer Clinical Specimens with Long-Read Sequencing

Multi-sample Full-length Transcriptome Analysis of 22 Breast Cancer Clinical Specimens with Long-Read Sequencing

Multi-sample Full-length Transcriptome Analysis of 22 Breast Cancer Clinical Specimens with Long-Read Sequencing

来源:bioRxiv_logobioRxiv
英文摘要

Abstract Although transcriptome alteration is considered as one of the essential drivers of carcinogenesis, conventional short-read RNAseq technology has limited researchers from directly exploring full-length transcripts, only focusing on individual splice sites. We developed a pipeline for Multi-Sample long-read Transcriptome Assembly, MuSTA, and showed through simulations that it enables construction of transcriptome from the transcripts expressed in target samples and more accurate evaluation of transcript usage. We applied it to 22 breast cancer clinical specimens to successfully acquire cohort-wide full-length transcriptome from long-read RNAseq data. By comparing isoform existence and expression between estrogen receptor positive and triple-negative subtypes, we obtained a comprehensive set of subtype-specific isoforms and differentially used isoforms which consisted of both known and unannotated isoforms. We have also found that exon-intron structure of fusion transcripts tends to depend on their genomic regions, and have found three-piece fusion transcripts that were transcribed from complex structural rearrangements. For example, a three-piece fusion transcript resulted in aberrant expression of an endogenous retroviral gene, ERVFRD-1, which is normally expressed exclusively in placenta and supposed to protect fetus from maternal rejection, and expression of which were increased in several TCGA samples with ERVFRD-1 fusions. Our analyses of real clinical specimens and simulated data provide direct evidence that full-length transcript sequencing in multiple samples can add to our understanding of cancer biology and genomics in general.

Inoue Satoshi、Maeda Noriko、Namba Shinichi、Ueno Toshihide、Shiraishi Yuichi、Tanaka Yosuke、Kawazu Masahito、Kojima Shinya、Mano Hiroyuki、Ogawa Tomoko、Kishigami Fumishi、Hazama Shoichi

Division of Cellular Signaling, Research Institute, National Cancer CenterDepartment of Gastroenterological, Breast and Endocrine Surgery, Yamaguchi University Graduate School of MedicineDivision of Cellular Signaling, Research Institute, National Cancer CenterDivision of Cellular Signaling, Research Institute, National Cancer CenterDivision of Genome Analysis Platform Development, Research Institute, National Cancer CenterDivision of Cellular Signaling, Research Institute, National Cancer CenterDivision of Cellular Signaling, Research Institute, National Cancer CenterDivision of Cellular Signaling, Research Institute, National Cancer CenterDivision of Cellular Signaling, Research Institute, National Cancer CenterDepartment of Breast Surgery, Mie University HospitalDivision of Cellular Signaling, Research Institute, National Cancer CenterDepartment of Translational Research and Developmental Therapeutics against Cancer, Yamaguchi University Graduate School of Medicine

10.1101/2020.07.15.199851

医学研究方法肿瘤学分子生物学

Inoue Satoshi,Maeda Noriko,Namba Shinichi,Ueno Toshihide,Shiraishi Yuichi,Tanaka Yosuke,Kawazu Masahito,Kojima Shinya,Mano Hiroyuki,Ogawa Tomoko,Kishigami Fumishi,Hazama Shoichi.Multi-sample Full-length Transcriptome Analysis of 22 Breast Cancer Clinical Specimens with Long-Read Sequencing[EB/OL].(2025-03-28)[2025-05-02].https://www.biorxiv.org/content/10.1101/2020.07.15.199851.点此复制

评论