Institute for Computer Science

 
Home
Events
People
Research
Publications

Teaching
Job and Students

   Opportunities
Tools and Data
Miscellanous
Contact

Seminar

Link Mining

Prof. Dr. Luc De Raedt

Co-Organisators: Björn Bringmann
  • Times:  Tuesday 14-16 o'clock (Room: SR 01-019, Building: 079)
  • Credit points (Kreditpunkte):  3
  • Language:  english, german

  • Report: 7-10 pages (content!) till the 21 of March 2005.
    Electronic version preferred (PS, PDF -- exceptionally Word is accepted)
    [It is expected that you don't copy from any publications]

Index

[ Final Reports ]
[Basic Talks]
[Advanced Talks]

Reports

Author
Title
Download
Alcazar, Karla
PAGERANK
-
Altmeyer, Micha
Collaborative Filtering
-
Eyerich, Patrick
Object Identification
-
Fischer, Jochen Focussed Web Crawlers with Reinforcement Learning
-
Gütlein, Martin
gSpan und CloseGraph
-
Holschuh, Joerg
Latent Semantic Analysis und Probabilistic Latent Semantic Analysis
-
Ihmsen, Markus
Automatisches Klassifizieren von Webseiten
-
Lempp, Benjamin
Communities in statischen und dynamischen Graphen
-
Metzger, Manuel
Probabilistic Relational Models
-
Qiu, Haiyin
From Hidden Markov Models to Conditional Random Fields
-
Rendle, Steffen
Hubs & Authorities
-
Schultze, Hans-Martin
Dependency Networks fuer relationale Daten
-
Stritt, Manuel
Labeled and Unlabeled Data
-
Zimmermann, Nico Random Graphs and Social Networks
-


Basic Talks

Date Authors Title PPT
PDF / PS
19.10.2004
L. De Raedt,
B. Bringmann
Link Mining PPT


  • L. Getoor. Link Mining, SIGKDD Explorations,Volume 5, Issue 1, July 2003.

02.11.2004
Karla Alcazar
PageRank
PPT

  • Larry age, Sergey Brin, R. Motwani, T. Winograd. The PageRank Citation Ranking: Bringing order to the Web (1998). [citeseer]

  • Monika Henzinger. Link Analysis in Web Information Retrieval. [pdf]

Steffen Rendle
HITS

PDF
  • J. Kleinberg. Authoritative Sources in a Hyperlinked Environment (1999). [ps, citeseer]

09.11.2004
Jochen Fischer
Citeseer
PPT

  • Steve Lawrence, C. Lee Giles, Kurt Bollacker. Digital Libraries and Autonomous Citation Indexing (1999). [citeseer]

Nico Zimmermann
Random Graphs

PS
  • Mark E. J. Newman. Random graphs as models of networks (2002). [ps] [pdf]

Martin Gütlein
Graph Mining PPT

16.11.2004
Joerg Holschuh
Latent Semantic Analysis
PPT
PDF
  • Pierre Baldi, Paolo Frasconi, Padhraic Smyth. Modeling the Internet and the Web (2003).

  • Scott Deerwester, Susan T. Dumais, Richard Harshman. Indexing by Latent Semantic Analysis (1990). [citeseer]

Manuel Stritt
Labeled and Unlabeled Data
PDF
  • Pierre Baldi, Paolo Frasconi, Padhraic Smyth. Modeling the Internet and the Web (2003) pg 114ff.

  • The Expectation Maximization Algorithm / Bayesian Networks

Manuel Metzger
Probabilistic Relational Models

  • Nir Friedman, Lise Getoor, Daphne Koller, Avi Pfeffer. Learning Probabilistic Relational Models (1999). [pdf, citeseer]

23.11.2004

Patrick Eyerich
Object Identification
PPT

  • Hanna Pasula, Bhaskara Marthi, Brian Milch, Stuart Russell, Ilya Shpitser. Identity Uncertainty and Citation Matching. [pdf, citeseer]

Benjamin Lempp
Web Communities
PPT

  • Gary W. Flake, Steve Lawrence, C. Lee Giles, Frans M. Coetzee. Self-Organisation and Identification of Web Communities (2002) [citeseer]

Markus Ihmsen
Collective Classification
PPT

  • Rayid Ghani, Sean Slattery, Yiming Yang. Hypertext Categorization using Hyperlink Patterns and Meta Data (2001). [citeseer]

  • Rayid Ghani. Combining Labeled and Unlabeled Data for MultiClass Text Categorization (2002) [citeseer]

30.11.2004
Micha Altmeyer
Collaborative Filtering
PDF
  • David Heckerman, David Maxwell Chickering, Christopher Meek, Robert Rounthwaite, Carl Kadie. Dependency Networks for Inference, Collaborative Filtering, and Data Visualization (2000). [citeseer]

Haiyin Qiu
Random Fields







Advanced Talks

Date Authors Title PPT PDF/PS
07.12.2004
Karla Alcazar
Deeper Inside PageRank
PPT

  • Larry Page, Sergey Brin, R. Motwani, T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web (1998). [citeseer]

  • Monika Henzinger. Link Analysis in Web Information Retrieval. [pdf]

  • Amy N. Langville and Carl D. Meyer. Deeper Inside PageRank. Internet Mathematics Vol 1, No 3. 355-400. [pdf]

Steffen Rendle
HITS

  • J. Kleinberg. Authoritative Sources in a Hyperlinked Environment (1999). [ps, citeseer]

  • R. Lempel, S. Moran. The Stochastic Approach for Link-Structure Analysis (SALSA) and the TKC Effect . [pdf]

14.12.2004
Martin Gütlein
Graph Mining

  • X. Yan, J. Han. gSpan: Graph-Based Substructure Pattern Mining (2003). [gspan-short, gspan, citeseer]

  • X. Yan, J. Han. CloseGraph: Mining Closed Frequent Graph Patterns (KDD, 2003).  [acm]

Nico Zimmermann
Random Graphs and Social Networks

  • Jon M. Kleinberg, Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Andrew S. Tomkins. The Web as a graph: measurements, models, and methods (1999) [citeseer]

  • Soumen Chakrabarti. Mining the web: Discovering Knowledge from Hypertext Data, chapter on Social network analysis.

21.12.2004
Joerg Holschuh
Latent Semantic Analysis
PPT
PDF
  • Pierre Baldi, Paolo Frasconi, Padhraic Smyth. Modeling the Internet and the Web (2003) pg 88ff.

  • Christos H. Papadimitriou, Prabhakar Raghavan, Hisao Tamaki. Latent Semantic Indexing: A Probabilistic Analysis (1997). [citeseer]
  • Thomas Hofmann. Probabilistic Latent Semantic Indexing (1999).  [citeseer]

Manuel Stritt
Labeled and Unlabeled Data

PDF
  • Pierre Baldi, Paolo Frasconi, Padhraic Smyth. Modeling the Internet and the Web (2003) pg 114ff.

  • Avrim Blum, Tom Mitchell. Combining Labeled and Unlabeled Data with Co-Training (1998). [citeseer]

11.01.2005
Manuel Metzger
Learing and applying PRMs
PPT

  • Nir Friedman, Lise Getoor, Daphne Koller, Avi Pfeffer. Learning Probabilistic Relational Models (1999). [citeseer]

  • E. Segal, D. Koller, D.Ormoneit. Rich Probabilistic Models for Gene Expression (Bioinformatics 2003) [pdf]

Patrick Eyerich
Object Identification

  • Parag and Pedro Domingos. Multi-Relational Record Linkage (MRDM, 2004). [pdf]

  • Sheila Tejada, Craig A. Knoblock, Steven Minton. Learning Object Identification Rules for Information Integration (IS, 2001). [citeseer]

18.01.2005
Jochen Fischer
Focused Crawling
PPT

  • Andrew Kachites McCallum et al. Automating the Construction of Internet Portals with Machine Learning (Information Retrieval Journal, volume 3, 2000). [citeseer]

  • M Diligenti et al. Focused Crawling Using Context Graphs (VLDB 2000). [citeseer]

Benjamin Lempp Web Communities PPT

  • Corinna Cortes, Daryl Pregibon, and Chris T. Volinsky. Communities of Interest (IDA, 2001). [ps]

25.01.2005
Markus Ihmsen
Collective Classification
PPT

  • Rayid Ghani, Sean Slattery, Yiming Yang. Hypertext Categorization using Hyperlink Patterns and Meta Data (2001). [citeseer]

  • Rayid Ghani. Combining Labeled and Unlabeled Data for MultiClass Text Categorization (ICML, 2002) [citeseer]

  • Jensen, D., J. Neville and B. Gallagher. Why Collective Inference Improves Relational Classification (KDD 2004). [pdf]

Micha Altmeyer
Collaborative Filtering

PDF
  • David Heckerman, David Maxwell Chickering, Christopher Meek, Robert Rounthwaite, Carl Kadie. Dependency Networks for Inference, Collaborative Filtering, and Data Visualization (2000). [citeseer]

  • Prem Melville, Raymond J. Mooney, Ramadass Nagarajan. Content-Boosted Collaborative Filtering (2001) [citeseer]

01.02.2005

There will be no seminar session

(do something useful on your own :-)

08.02.2005
15.02.2005
Haiyin Qiu Random Fields



Hans-Martin Schulze