Identification of Terrorist Web Sites with Cross-Lingual Classification Tools

Alex Markov, Mark Last

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

The following sections are included: • Introduction • Document Categorization and Classification • Selected Applications of Web Document Classification • Automatic Web News Extraction • Personalization and E-Commerce • Organization of Web Document Collections • Multi-Lingual Applications • Document Representation • Traditional Text Models • Web Document Models • Graph Based Representations of Web Documents • Graph Structure • Term Extraction Methods • Naïve Extraction • Smart Extraction • Frequent Sub-Graph Extraction Problem • Cross-Lingual Web Document Classification with Graphs • Representation and Classification Process • Web Document Representation Example • Case Study: Identification of Terrorist Web Sites in Arabic • About Document Collection • Preprocessing of Documents in Arabic • Experiment and Evaluation of Results • Conclusions • Acknowledgment.

Original languageEnglish
Title of host publicationFighting Terror In Cyberspace
EditorsMark Last, Abraham kandel
PublisherWorld Scientific Publishing Co.
Chapter8
Pages117-141
Number of pages25
ISBN (Electronic)9789812703255
ISBN (Print)978-9-81256-493-1
DOIs
StatePublished - 2005

Publication series

NameSeries in Machine Perception and Artificial Intelligence

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Identification of Terrorist Web Sites with Cross-Lingual Classification Tools'. Together they form a unique fingerprint.

Cite this