TY - GEN
T1 - Computing temporal trends in web documents
AU - Last, Mark
PY - 2005/12/1
Y1 - 2005/12/1
N2 - Most existing methods of web content mining assume a static nature of the web documents. This approach is inadequate for long-term monitoring and analysis of the web content, since both the users' interests and the content of most web sites are subject to continuous changes over time. In this research, we are interested in developing computationally intelligent and efficient text mining techniques that will enable continuous comparison between documents provided by the same source (website, institute, organization, cult, author etc.) or viewed by the same group of users (e.g., university students) and timely detection of temporal trends in those documents. Our approach builds upon the recently developed methodology for fuzzy comparison of frequency distributions. The proposed techniques are evaluated on a real-world stream of web traffic.
AB - Most existing methods of web content mining assume a static nature of the web documents. This approach is inadequate for long-term monitoring and analysis of the web content, since both the users' interests and the content of most web sites are subject to continuous changes over time. In this research, we are interested in developing computationally intelligent and efficient text mining techniques that will enable continuous comparison between documents provided by the same source (website, institute, organization, cult, author etc.) or viewed by the same group of users (e.g., university students) and timely detection of temporal trends in those documents. Our approach builds upon the recently developed methodology for fuzzy comparison of frequency distributions. The proposed techniques are evaluated on a real-world stream of web traffic.
KW - Automated Perceptions
KW - Text Mining Trend Detection
KW - Trend Discovery
KW - Web Content Mining
UR - http://www.scopus.com/inward/record.url?scp=84871973307&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84871973307
SN - 8476538723
SN - 9788476538722
T3 - Proceedings - 4th Conference of the European Society for Fuzzy Logic and Technology and 11th French Days on Fuzzy Logic and Applications, EUSFLAT-LFA 2005 Joint Conference
SP - 615
EP - 620
BT - Proceedings - 4th Conference of the European Society for Fuzzy Logic and Technology and 11th French Days on Fuzzy Logic and Applications, EUSFLAT-LFA 2005 Joint Conference
T2 - Joint 4th Conference of the European Society for Fuzzy Logic and Technology, EUSFLAT 2005 and 11th French Days on Fuzzy Logic and Applications, LFA 2005
Y2 - 7 September 2005 through 9 September 2005
ER -