TY - GEN
T1 - A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books
AU - Goldberg, Yoav
AU - Orwant, Jon
N1 - Publisher Copyright:
c 2013 Association for Computational Linguistics
PY - 2013/1/1
Y1 - 2013/1/1
N2 - We created a dataset of syntactic-ngrams (counted dependency-tree fragments) based on a corpus of 3.5 million English books. The dataset includes over 10 billion distinct items covering a wide range of syntactic configurations. It also includes temporal information, facilitating new kinds of research into lexical semantics over time. This paper describes the dataset, the syntactic representation, and the kinds of information provided.
AB - We created a dataset of syntactic-ngrams (counted dependency-tree fragments) based on a corpus of 3.5 million English books. The dataset includes over 10 billion distinct items covering a wide range of syntactic configurations. It also includes temporal information, facilitating new kinds of research into lexical semantics over time. This paper describes the dataset, the syntactic representation, and the kinds of information provided.
UR - https://www.scopus.com/pages/publications/84943742382
M3 - Conference contribution
AN - SCOPUS:84943742382
T3 - *SEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics
SP - 241
EP - 247
BT - *SEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics
PB - Association for Computational Linguistics (ACL)
T2 - 2nd Joint Conference on Lexical and Computational Semantics, *SEM 2013
Y2 - 13 June 2013 through 14 June 2013
ER -