|
|
Seminar
Link Mining
Prof. Dr. Luc De Raedt
Co-Organisators: Björn Bringmann
- Times: Tuesday 14-16 o'clock (Room: SR
01-019, Building: 079)
- Credit points (Kreditpunkte): 3
- Language: english, german
- Report: 7-10 pages
(content!) till the 21 of March 2005.
Electronic version preferred (PS, PDF -- exceptionally Word is accepted)
[It
is expected that you don't copy from any publications]
Author
|
Title
|
Download
|
Alcazar, Karla
|
PAGERANK
|
-
|
Altmeyer, Micha
|
Collaborative Filtering
|
-
|
Eyerich, Patrick
|
Object Identification
|
-
|
| Fischer, Jochen |
Focussed Web Crawlers with Reinforcement Learning
|
-
|
Gütlein, Martin
|
gSpan und CloseGraph
|
-
|
Holschuh, Joerg
|
Latent Semantic Analysis und Probabilistic Latent Semantic
Analysis
|
-
|
Ihmsen, Markus
|
Automatisches Klassifizieren von Webseiten
|
-
|
Lempp, Benjamin
|
Communities in statischen und dynamischen Graphen
|
-
|
Metzger, Manuel
|
Probabilistic Relational Models
|
-
|
Qiu, Haiyin
|
From Hidden Markov Models to Conditional Random Fields
|
-
|
Rendle, Steffen
|
Hubs & Authorities
|
-
|
Schultze, Hans-Martin
|
Dependency Networks fuer relationale Daten
|
-
|
Stritt, Manuel
|
Labeled and Unlabeled Data
|
-
|
| Zimmermann, Nico |
Random Graphs and Social Networks
|
-
|
| Date |
Authors |
Title |
PPT
|
PDF
/ PS
|
19.10.2004
|
L.
De
Raedt,
B. Bringmann
|
Link Mining |
PPT
|
|
- L. Getoor. Link Mining, SIGKDD
Explorations,Volume 5, Issue 1, July 2003.
|
02.11.2004
|
Karla
Alcazar
|
PageRank
|
PPT
|
|
- Larry
age, Sergey Brin, R. Motwani, T. Winograd. The PageRank Citation Ranking: Bringing
order to the Web (1998). [citeseer]
- Monika Henzinger. Link Analysis in Web Information
Retrieval. [pdf]
|
Steffen
Rendle
|
HITS
|
|
PDF
|
- J. Kleinberg. Authoritative Sources in a Hyperlinked
Environment (1999). [ps, citeseer]
|
09.11.2004
|
Jochen
Fischer
|
Citeseer
|
PPT
|
|
- Steve Lawrence, C. Lee Giles, Kurt
Bollacker. Digital Libraries and Autonomous Citation Indexing
(1999). [citeseer]
|
Nico
Zimmermann
|
Random
Graphs
|
|
PS
|
- Mark E. J. Newman. Random
graphs as models of networks (2002). [ps] [pdf]
|
Martin
Gütlein
|
Graph Mining
|
PPT
|
|
|
|
16.11.2004
|
Joerg
Holschuh
|
Latent
Semantic
Analysis
|
PPT
|
PDF
|
- Pierre Baldi, Paolo Frasconi, Padhraic Smyth. Modeling the Internet and the Web (2003).
- Scott Deerwester, Susan T. Dumais, Richard Harshman. Indexing by Latent Semantic Analysis
(1990). [citeseer]
|
Manuel
Stritt
|
Labeled and Unlabeled Data |
|
PDF
|
- Pierre Baldi, Paolo Frasconi, Padhraic Smyth. Modeling the Internet and the Web (2003) pg
114ff.
- The Expectation Maximization Algorithm / Bayesian
Networks
|
Manuel
Metzger
|
Probabilistic
Relational Models
|
|
|
- Nir Friedman, Lise Getoor, Daphne
Koller, Avi Pfeffer. Learning
Probabilistic Relational Models (1999). [pdf, citeseer]
|
23.11.2004
|
Patrick
Eyerich
|
Object
Identification
|
PPT
|
|
- Hanna Pasula, Bhaskara Marthi, Brian Milch, Stuart
Russell,
Ilya Shpitser. Identity Uncertainty
and Citation Matching. [pdf, citeseer]
|
Benjamin
Lempp
|
Web
Communities
|
PPT
|
|
- Gary W. Flake, Steve Lawrence, C. Lee Giles, Frans M.
Coetzee. Self-Organisation and
Identification of Web Communities (2002) [citeseer]
|
Markus
Ihmsen
|
Collective
Classification
|
PPT
|
|
- Rayid Ghani, Sean Slattery, Yiming Yang. Hypertext Categorization using Hyperlink
Patterns and Meta Data (2001). [citeseer]
- Rayid Ghani. Combining
Labeled and Unlabeled Data for MultiClass Text Categorization (2002) [citeseer]
|
30.11.2004
|
Micha Altmeyer
|
Collaborative
Filtering |
|
PDF
|
- David Heckerman, David Maxwell
Chickering,
Christopher Meek, Robert Rounthwaite, Carl Kadie. Dependency Networks for
Inference, Collaborative Filtering, and Data Visualization (2000). [citeseer]
|
Haiyin Qiu
|
Random Fields
|
|
|
|
| Date |
Authors |
Title |
PPT |
PDF/PS |
07.12.2004
|
Karla Alcazar
|
Deeper
Inside
PageRank
|
PPT
|
|
- Larry
Page, Sergey Brin, R. Motwani, T. Winograd. The PageRank Citation Ranking: Bringing
Order to the Web (1998). [citeseer]
- Monika Henzinger. Link Analysis in Web Information
Retrieval. [pdf]
- Amy N. Langville and
Carl
D. Meyer. Deeper Inside PageRank. Internet Mathematics Vol 1,
No 3. 355-400. [pdf]
|
Steffen Rendle
|
HITS
|
|
|
- J. Kleinberg. Authoritative Sources in a Hyperlinked
Environment (1999). [ps, citeseer]
- R. Lempel, S. Moran. The Stochastic Approach for
Link-Structure Analysis (SALSA) and the TKC Effect . [pdf]
|
14.12.2004
|
Martin
Gütlein
|
Graph Mining
|
|
|
- X. Yan, J. Han. gSpan:
Graph-Based Substructure Pattern Mining (2003). [gspan-short,
gspan, citeseer]
- X. Yan, J.
Han. CloseGraph: Mining Closed
Frequent Graph
Patterns (KDD, 2003). [acm]
|
Nico
Zimmermann
|
Random
Graphs and
Social Networks
|
|
|
- Jon
M.
Kleinberg, Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Andrew
S. Tomkins. The Web
as a graph: measurements, models, and methods (1999) [citeseer]
- Soumen Chakrabarti. Mining the web: Discovering Knowledge from
Hypertext Data, chapter on Social
network analysis.
|
21.12.2004
|
Joerg
Holschuh
|
Latent
Semantic
Analysis
|
PPT
|
PDF
|
- Pierre Baldi, Paolo Frasconi, Padhraic Smyth. Modeling the Internet and the Web (2003) pg
88ff.
- Christos H. Papadimitriou, Prabhakar Raghavan,
Hisao Tamaki. Latent Semantic
Indexing: A Probabilistic Analysis (1997). [citeseer]
- Thomas Hofmann. Probabilistic
Latent Semantic Indexing (1999). [citeseer]
|
Manuel
Stritt
|
Labeled and Unlabeled Data
|
|
PDF
|
- Pierre Baldi, Paolo Frasconi, Padhraic Smyth. Modeling the Internet and the Web (2003) pg
114ff.
- Avrim Blum, Tom Mitchell. Combining Labeled and
Unlabeled Data with Co-Training (1998). [citeseer]
|
11.01.2005
|
Manuel
Metzger
|
Learing and
applying PRMs
|
PPT
|
|
- Nir Friedman, Lise Getoor, Daphne
Koller, Avi Pfeffer. Learning
Probabilistic Relational Models (1999). [citeseer]
- E.
Segal, D. Koller, D.Ormoneit. Rich
Probabilistic Models for Gene Expression (Bioinformatics 2003) [pdf]
|
Patrick
Eyerich
|
Object
Identification
|
|
|
- Parag and Pedro Domingos. Multi-Relational
Record Linkage (MRDM, 2004). [pdf]
- Sheila
Tejada, Craig A. Knoblock, Steven Minton. Learning Object Identification Rules for
Information Integration (IS, 2001). [citeseer]
|
18.01.2005
|
Jochen
Fischer
|
Focused
Crawling
|
PPT
|
|
- Andrew Kachites McCallum et al. Automating the Construction of Internet
Portals with Machine Learning (Information
Retrieval Journal, volume 3, 2000).
[citeseer]
- M Diligenti et al. Focused
Crawling Using Context Graphs (VLDB 2000). [citeseer]
|
| Benjamin
Lempp |
Web
Communities |
PPT
|
|
- Corinna Cortes, Daryl Pregibon, and Chris T.
Volinsky. Communities of Interest (IDA, 2001). [ps]
|
25.01.2005
|
Markus
Ihmsen
|
Collective
Classification
|
PPT
|
|
- Rayid Ghani, Sean Slattery, Yiming Yang. Hypertext Categorization using Hyperlink
Patterns and Meta Data (2001). [citeseer]
- Rayid Ghani. Combining
Labeled and Unlabeled Data for MultiClass Text Categorization (ICML,
2002) [citeseer]
- Jensen, D.,
J.
Neville and B. Gallagher. Why
Collective Inference Improves Relational Classification (KDD 2004).
[pdf]
|
Micha Altmeyer
|
Collaborative
Filtering
|
|
PDF
|
- David Heckerman, David Maxwell
Chickering,
Christopher Meek, Robert Rounthwaite, Carl Kadie. Dependency Networks for
Inference, Collaborative Filtering, and Data Visualization (2000). [citeseer]
- Prem Melville,
Raymond J. Mooney, Ramadass Nagarajan. Content-Boosted Collaborative Filtering (2001) [citeseer]
|
01.02.2005
|
There will be no seminar session
(do something useful on your own :-)
|
08.02.2005
|
15.02.2005
|
Haiyin Qiu |
Random Fields
|
|
|
|
Hans-Martin Schulze
|
|
|
|
|
|