CSE 5800 Advanced Topics in CS:
Learning/Mining and the Internet
MW 6:30-7:45pm, EC 132

Philip Chan
242 Engineering Complex, 674-7280

Office Hours: MW 1-3pm (or by appointment)

Syllabus

Schedule

Week Dates Monday Wednesday
1Aug 17 & 19 Introduction
A Mining/Learning Algorithm [Classification]
Syskill & Webert: Identifying interesting web sites. M. Pazzani, J. Muramatsu & D. Billsus, Proc. AAAI, p54-61, 1996
2Aug 24 & 26 Classification
[More details in Induction of Decision Trees. J. Quinlan. Machine Learning journal, 1:81-106, 1986.]
Data Mining Methods for Detection of New Malicious Executables. M. Schultz, E. Eskin, E. Zadok & S. Stolfo. Proc. IEEE Security & Privacy Symp., p38-49, 2001.
3Aug 31 & Sep 2 Fast Effective Rule Induction W. Cohen. Proc. ICML, p115-123, 1995. Anomaly Detection
Learning Rules for Anomaly Detection of Hostile Network Traffic. M. Mahoney & P. Chan. Proc. ICDM, pp. 601-604, 2003.
[More details in FIT Tech Report CS-2003-16.]
4Sep 7 & 9 Labor Day holiday HW 1 demo
5Sep 14 & 16 A comparative study of anomaly detection schemes in network intrusion detection. A. Lazarevic, L. Ertoz, A. Ozgur, J. Srivastava & V. Kumar. Proc. SDM, p25-36, 2003. An Experimental Comparison of Naive Bayesian and Keyword-Based Anti-Spam Filtering with Personal E-mail Messages. I. Androutsopoulos, J. Koutsias, K. Chandrinos, C. Spyropoulos. Proc SIGIR, pp 160-167, 2000.
6Sep 21 & 23 No class (at NSF) No class (at NSF)
7Sep 28 & 30 Clustering
A Comparison of Document Clustering Techniques. M Steinbach, G Karypis, V Kumar. U. Minnesota Tech Report 00-034, 2000.
[shorter version: A Comparison of Document Clustering Techniques. M Steinbach, G Karypis, V Kumar. KDD Workshop on Text Mining, 2000.]
HW 2 demo
8Oct 5 & 7 A Personalized Search Engine Based on Web-Snippet Hierarchical Clustering. P Ferragina, A Gulli. Proc. WWW, p801-810, 2005. Adaptive web sites: Conceptual cluster mining. M. Perkowitz & O. Etzioni. Proc. IJCAI, p264-269, 1999.
9Oct 12 & 14 Columbus Day Fall break A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. M. Ester, H. Kriegel, J. Sander & X. Xu. Proc. KDD, p. 226-231, 1996.
10Oct 19 & 21 Graphs
Bridging Centrality: Graph Mining from Element Level to Group Level. W. Hwang, T. Kim, M. Ramanathan and A. Zhang. Proc. KDD, p. 336-344, 2008.
HW 3 demo
11Oct 26 & 28 The Link Prediction Problem for Social Networks. D. Liben-Nowell & J. Kleinberg. Proc. CIKM, p. 556-559, 2003. Social Capital in Friendship-Event Networks. L. Licamele and L. Getoor. Proc. ICDM, p. 959-964, 2006.
12Nov 2 & 4 Association Rules from handout--6.1-6.3
[More details in Fast Algorithms for Mining Association Rules. R. Agrawal & R. Srikant. Proc. VLDB, p. 487-499, 1994.]
Mining Web Logs for Prediction Models in WWW Caching and Prefetching Q. Yang, H. Zhang & T Li. Proc. KDD, p. 473-478, 2001.
13Nov 9 & 11 HW 4 demo Veteran's Day holiday
14Nov 16 & 18 HW 4 demo
[canceled: Mining frequent spatio-temporal sequential patterns. H. Cao, N. Mamoulis, D. Cheung. Proc. ICDM, p. 82 - 89, 2005.]
ROAM: Rule- and Motif-Based Anomaly Detection in Massive Moving Object Data Sets. X. Li, J. Han, S. Kim & H. Gonzalez. Proc. SDM, p. 273-284 , 2007.
15Nov 23 & 25 Google News Personalization: Scalable Online Collaborative Filtering. A. Das, M. Datar, A. Garg, S. Rajaram. Proc. WWW, p. 271-280, 2007 School holiday (Thanksgiving)
16Nov 30 & Dec 2 Term Project presentation and demo Term Project presentation and demo

Abbreviation/acronym of research conferences

Assignments (Submit Server)

Resources