TY - GEN
T1 - Capturing the Content of a Document through Complex Event Identification
AU - Qi, Zheng
AU - Sulem, Elior
AU - Wang, Haoyu
AU - Yu, Xiaodong
AU - Roth, Dan
N1 - Publisher Copyright:
© 2022 Association for Computational Linguistics.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - Granular events, instantiated in a document by predicates, can usually be grouped into more general events, called complex events. Together, they capture the major content of the document. Recent work grouped granular events by defining event regions, filtering out sentences that are irrelevant to the main content. However, this approach assumes that a given complex event is always described in consecutive sentences, which does not always hold in practice. In this paper, we introduce the task of complex event identification. We address this task as a pipeline, first predicting whether two granular events mentioned in the text belong to the same complex event, independently of their position in the text, and then using this to cluster them into complex events. Due to the difficulty of predicting whether two granular events belong to the same complex event in isolation, we propose a context-augmented representation learning approach CONTEXTRL that adds additional context to better model the pairwise relation between granular events. We show that our approach outperforms strong baselines on the complex event identification task and further present a promising case study exploring the effectiveness of using complex events as input for document-level argument extraction.
AB - Granular events, instantiated in a document by predicates, can usually be grouped into more general events, called complex events. Together, they capture the major content of the document. Recent work grouped granular events by defining event regions, filtering out sentences that are irrelevant to the main content. However, this approach assumes that a given complex event is always described in consecutive sentences, which does not always hold in practice. In this paper, we introduce the task of complex event identification. We address this task as a pipeline, first predicting whether two granular events mentioned in the text belong to the same complex event, independently of their position in the text, and then using this to cluster them into complex events. Due to the difficulty of predicting whether two granular events belong to the same complex event in isolation, we propose a context-augmented representation learning approach CONTEXTRL that adds additional context to better model the pairwise relation between granular events. We show that our approach outperforms strong baselines on the complex event identification task and further present a promising case study exploring the effectiveness of using complex events as input for document-level argument extraction.
UR - http://www.scopus.com/inward/record.url?scp=85139121007&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85139121007
T3 - *SEM 2022 - 11th Joint Conference on Lexical and Computational Semantics, Proceedings of the Conference
SP - 331
EP - 340
BT - *SEM 2022 - 11th Joint Conference on Lexical and Computational Semantics, Proceedings of the Conference
A2 - Nastase, Vivi
A2 - Pavlick, Ellie
A2 - Pilehvar, Mohammad Taher
A2 - Camacho-Collados, Jose
A2 - Raganato, Alessandro
PB - Association for Computational Linguistics (ACL)
T2 - 11th Joint Conference on Lexical and Computational Semantics, *SEM 2022
Y2 - 14 July 2022 through 15 July 2022
ER -