Definition extraction from generic and mathematical domains with deep ensemble learning

Natalia Vanetik, Marina Litvak

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Definitions are extremely important for efficient learning of new materials. In particular, mathematical definitions are necessary for understanding mathematics-related areas. Automated extraction of definitions could be very useful for automated indexing educational materials, building taxonomies of relevant concepts, and more. For definitions that are contained within a single sentence, this problem can be viewed as a binary classification of sentences into definitions and non-definitions. In this paper, we focus on automatic detection of one-sentence definitions in mathematical and general texts. We experiment with different classification models arranged in an ensemble and applied to a sentence representation containing syntactic and semantic information, to classify sentences. Our ensemble model is applied to the data adjusted with oversampling. Our experiments demonstrate the superiority of our approach over state-of-the-art methods in both general and mathematical domains.

Original languageEnglish
Article number2502
JournalMathematics
Volume9
Issue number19
DOIs
StatePublished - 1 Oct 2021
Externally publishedYes

Keywords

  • Deep learning
  • Definition extraction
  • Ensemble
  • Mathematical domain

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • General Mathematics
  • Engineering (miscellaneous)

Fingerprint

Dive into the research topics of 'Definition extraction from generic and mathematical domains with deep ensemble learning'. Together they form a unique fingerprint.

Cite this