Abstract
We investigate one technique to produce a summary of an original text without requiring its full semantic interpretation, but instead relying on a model of the topic progression in the text derived from lexical chains. We present a new algorithm to compute lexical chains in a text, merging several robust knowledge sources: the WordNet thesaurus, a part-of-speech tagger, shallow parser for the identification of nominal groups, and a segmentation algorithm. Summarization proceeds in four steps: the original text is segmented, lexical chains are constructed strong chains are identified and significant sentences are extracted.
| Original language | English |
|---|---|
| Title of host publication | Advances in automatic text summarization |
| Editors | Inderjeet Mani, Mark T Maybury |
| Place of Publication | Cambridge, Massachusetts |
| Publisher | The MIT Press |
| Chapter | 10 |
| Pages | 111-121 |
| Number of pages | 11 |
| ISBN (Print) | 0262133598, 9780262133593 |
| State | Published - 1999 |