TY - JOUR
T1 - Misexpression of inactive genes in whole blood is associated with nearby rare structural variants
AU - Vanderstichele, Thomas
AU - Burnham, Katie L.
AU - de Klein, Niek
AU - Tardaguila, Manuel
AU - Howell, Brittany
AU - Walter, Klaudia
AU - Kundu, Kousik
AU - Koeppel, Jonas
AU - Lee, Wanseon
AU - Tokolyi, Alex
AU - Persyn, Elodie
AU - Nath, Artika P.
AU - Marten, Jonathan
AU - Petrovski, Slavé
AU - Roberts, David J.
AU - Di Angelantonio, Emanuele
AU - Danesh, John
AU - Berton, Alix
AU - Platt, Adam
AU - Butterworth, Adam S.
AU - Soranzo, Nicole
AU - Parts, Leopold
AU - Inouye, Michael
AU - Paul, Dirk S.
AU - Davenport, Emma E.
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/8/8
Y1 - 2024/8/8
N2 - Gene misexpression is the aberrant transcription of a gene in a context where it is usually inactive. Despite its known pathological consequences in specific rare diseases, we have a limited understanding of its wider prevalence and mechanisms in humans. To address this, we analyzed gene misexpression in 4,568 whole-blood bulk RNA sequencing samples from INTERVAL study blood donors. We found that while individual misexpression events occur rarely, in aggregate they were found in almost all samples and a third of inactive protein-coding genes. Using 2,821 paired whole-genome and RNA sequencing samples, we identified that misexpression events are enriched in cis for rare structural variants. We established putative mechanisms through which a subset of SVs lead to gene misexpression, including transcriptional readthrough, transcript fusions, and gene inversion. Overall, we develop misexpression as a type of transcriptomic outlier analysis and extend our understanding of the variety of mechanisms by which genetic variants can influence gene expression.
AB - Gene misexpression is the aberrant transcription of a gene in a context where it is usually inactive. Despite its known pathological consequences in specific rare diseases, we have a limited understanding of its wider prevalence and mechanisms in humans. To address this, we analyzed gene misexpression in 4,568 whole-blood bulk RNA sequencing samples from INTERVAL study blood donors. We found that while individual misexpression events occur rarely, in aggregate they were found in almost all samples and a third of inactive protein-coding genes. Using 2,821 paired whole-genome and RNA sequencing samples, we identified that misexpression events are enriched in cis for rare structural variants. We established putative mechanisms through which a subset of SVs lead to gene misexpression, including transcriptional readthrough, transcript fusions, and gene inversion. Overall, we develop misexpression as a type of transcriptomic outlier analysis and extend our understanding of the variety of mechanisms by which genetic variants can influence gene expression.
KW - aberrant expression
KW - ectopic expression
KW - misexpression
KW - structural variants
KW - transcript fusion
KW - transcriptional readthrough
KW - transcriptomic outlier
UR - http://www.scopus.com/inward/record.url?scp=85200219230&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2024.06.017
DO - 10.1016/j.ajhg.2024.06.017
M3 - Article
C2 - 39053458
AN - SCOPUS:85200219230
SN - 0002-9297
VL - 111
SP - 1524
EP - 1543
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 8
ER -