TY - GEN
T1 - Methodology for connecting nouns to their modifying adjectives
AU - Ofek, Nir
AU - Rokach, Lior
AU - Mitra, Prasenjit
PY - 2014/1/1
Y1 - 2014/1/1
N2 - Adjectives are words that describe or modify other elements in a sentence. As such, they are frequently used to convey facts and opinions about the nouns they modify. Connecting nouns to the corresponding adjectives becomes vital for intelligent tasks such as aspect-level sentiment analysis or interpretation of complex queries (e.g., "small hotel with large rooms") for fine-grained information retrieval. To respond to the need, we propose a methodology that identifies dependencies of nouns and adjectives by looking at syntactic clues related to part-of-speech sequences that help recognize such relationships. These sequences are generalized into patterns that are used to train a binary classifier using machine learning methods. The capabilities of the new method are demonstrated in two, syntactically different languages: English, the leading language of international discourse, and Hebrew, whose rich morphology poses additional challenges for parsing. In each language we compare our method with a designated, state-of-the-art parser and show that it performs similarly in terms of accuracy while: (a) our method uses a simple and relatively small training set; (b) it does not require a language specific adaptation, and (c) it is robust across a variety of writing styles.
AB - Adjectives are words that describe or modify other elements in a sentence. As such, they are frequently used to convey facts and opinions about the nouns they modify. Connecting nouns to the corresponding adjectives becomes vital for intelligent tasks such as aspect-level sentiment analysis or interpretation of complex queries (e.g., "small hotel with large rooms") for fine-grained information retrieval. To respond to the need, we propose a methodology that identifies dependencies of nouns and adjectives by looking at syntactic clues related to part-of-speech sequences that help recognize such relationships. These sequences are generalized into patterns that are used to train a binary classifier using machine learning methods. The capabilities of the new method are demonstrated in two, syntactically different languages: English, the leading language of international discourse, and Hebrew, whose rich morphology poses additional challenges for parsing. In each language we compare our method with a designated, state-of-the-art parser and show that it performs similarly in terms of accuracy while: (a) our method uses a simple and relatively small training set; (b) it does not require a language specific adaptation, and (c) it is robust across a variety of writing styles.
KW - Information Retrieval
KW - Parsing
KW - Relation Extraction
UR - http://www.scopus.com/inward/record.url?scp=84958531562&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-54906-9_22
DO - 10.1007/978-3-642-54906-9_22
M3 - Conference contribution
AN - SCOPUS:84958531562
SN - 9783642549052
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 271
EP - 284
BT - Computational Linguistics and Intelligent Text Processing - 15th International Conference, CICLing 2014, Proceedings
PB - Springer Verlag
T2 - 15th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2014
Y2 - 6 April 2014 through 12 April 2014
ER -