Links to Papers


Data Mining

 

Introductory Material

Data Cleaning

Classification

Gini Index
Web Mining *

(Links from  Prof.Zaki's page at RPI)
  • Web Mining: Information and Pattern Discovery on the World Wide Web (A Survey Paper) (1997) (with R. Cooley and J. Srivastava), in Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'97), November 1997. Available from http://maya.cs.depaul.edu/~mobasher/pubs.html
  • Enhanced hypertext categorization using hyperlinks. With Byron Dom and Piotr Indyk. In SIGMOD 1998. Available athttp://http.cs.berkeley.edu/~soumen/
  • Document Categorization and Query Generation on the World Wide Web Using WebACE (1999). Daniel Boley, Maria Gini, Robert Gross, Eui-Hong (Sam) Han, Kyle Hastings, George Karypis, Vipin Kumar, Bamshad Mobasher, and Jerome Moore, To appear in AI Review. Available from ftp://ftp.cs.umn.edu/dept/users/kumar/WEB/papers.html#bbbb
  • Knowlege Discovery from User's Web-Page Navigation, Cyrus Shahabi, Amir Zarkesh, Jafar Abidi, and Vishal Shah, Seventh International Workshop on Research Issues in Data Engineering, April 7-8,1997. Available from http://imsc.usc.edu/Tools/profiler.html
  • From User Access Patterns to Dynamic Hypertext Linking, Tak Woon Yan, Matthew Jacobsen, Hector Garcia-Molina, Umeshwar Dayal, Fifth International World Wide Web Conference, May 1996. Avaialble online athttp://www5conf.inria.fr/fich_html/papers/P8/Overview.html.
  • O. R. Zaiane, M. Xin, J. Han, `` Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs'', Proc. Advances in Digital Libraries Conf. (ADL'98), Santa Barbara, CA, April 1998, pp. 19-29. Avaialble at http://www.cs.sfu.ca/research/groups/DB/sections/publication/smmdb/smmdb.html
  • D.W. Cheung, B. Kao, and J.W. Lee, Discovering User Access Patterns on the World-Wide-Web. Proc. First Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD-97), Singapore, February, 1997. Available from http://www.csis.hku.hk/~dcheung/publication.html
  • Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan: Scalable Feature Selection, Classification and Signature Generation for Organizing Large Text Databases into Hierarchical Topic Taxonomies, VLDB Journal 1998. Available at http://www.cs.berkeley.edu/~soumen/VLDB54_3.PDF
Text Mining

Clustering