TY - JOUR
T1 - A systematic variant annotation approach for ranking genes associated with autism spectrum disorders
AU - Larsen, Eric
AU - Menashe, Idan
AU - Ziats, Mark N.
AU - Pereanu, Wayne
AU - Packer, Alan
AU - Banerjee-Basu, Sharmila
N1 - Funding Information:
This research was funded by the Simons Foundation.
Publisher Copyright:
© 2016 The Author(s).
PY - 2016/10/21
Y1 - 2016/10/21
N2 - Background: The search for genetic factors underlying autism spectrum disorders (ASD) has led to the identification of hundreds of genes containing thousands of variants that differ in mode of inheritance, effect size, frequency, and function. A major challenge involves assessing the collective evidence in an unbiased, systematic manner for their functional relevance. Methods: Here, we describe a scoring algorithm for prioritization of candidate genes based on the cumulative strength of evidence for each ASD-associated variant cataloged in AutDB (also known as SFARI Gene). We retrieved data from 889 publications to generate a dataset of 2187 rare and 711 common variants distributed across 461 genes implicated in ASD. Each individual variant was manually annotated with multiple attributes extracted from the original report, followed by score assignment using a set of standardized parameters yielding a single score for each gene. Results: There was a wide variation in scores; SHANK3, CHD8, and ADNP had distinctly higher scores than all other genes in the dataset. Our gene scores were significantly correlated with other recently published rankings of ASD genes (R Spearman = 0.40-0.63; p< 0.0001), providing support for our scoring algorithm. Conclusions: This new resource, which is freely available, for the first time aggregates on one-platform variants identified from various study types (simplex, multiplex, multigenerational, and consanguineous families), from both common and rare variants, and also incorporates their putative functional consequences to arrive at a genetically and biologically driven ranking scheme. This work represents a major step in moving from simply cataloging autism variants to using data-driven approaches to gain insight into their significance.
AB - Background: The search for genetic factors underlying autism spectrum disorders (ASD) has led to the identification of hundreds of genes containing thousands of variants that differ in mode of inheritance, effect size, frequency, and function. A major challenge involves assessing the collective evidence in an unbiased, systematic manner for their functional relevance. Methods: Here, we describe a scoring algorithm for prioritization of candidate genes based on the cumulative strength of evidence for each ASD-associated variant cataloged in AutDB (also known as SFARI Gene). We retrieved data from 889 publications to generate a dataset of 2187 rare and 711 common variants distributed across 461 genes implicated in ASD. Each individual variant was manually annotated with multiple attributes extracted from the original report, followed by score assignment using a set of standardized parameters yielding a single score for each gene. Results: There was a wide variation in scores; SHANK3, CHD8, and ADNP had distinctly higher scores than all other genes in the dataset. Our gene scores were significantly correlated with other recently published rankings of ASD genes (R Spearman = 0.40-0.63; p< 0.0001), providing support for our scoring algorithm. Conclusions: This new resource, which is freely available, for the first time aggregates on one-platform variants identified from various study types (simplex, multiplex, multigenerational, and consanguineous families), from both common and rare variants, and also incorporates their putative functional consequences to arrive at a genetically and biologically driven ranking scheme. This work represents a major step in moving from simply cataloging autism variants to using data-driven approaches to gain insight into their significance.
KW - Autistic disorder
KW - Autosomal recessive
KW - Common variants
KW - Genetic variation
KW - Rare variants
UR - http://www.scopus.com/inward/record.url?scp=84992147507&partnerID=8YFLogxK
U2 - 10.1186/s13229-016-0103-y
DO - 10.1186/s13229-016-0103-y
M3 - Article
C2 - 27790361
AN - SCOPUS:84992147507
SN - 2040-2392
VL - 7
JO - Molecular Autism
JF - Molecular Autism
IS - 1
M1 - 44
ER -