Title

Constraint-based mining of bi-sets from gene expression data

Authors

J. Besson, J-F. Boulicaut, R. Pensa and C. Robardet
INSA Lyon, LIRIS CNRS FRE 2672
Batiment Blaise Pascal
69621 Villeurbanne cedex, France

Abstract

We are designing new data mining techniques on gene expression data, more precisely inductive querying techniques that extract a priori interesting bi-sets, i.e., sets of objects (or biological situations) and associated sets of attributes (or genes). The so-called (formal) concepts are important special cases of a priori interesting bi-sets in derived boolean expression matrices, e.g., matrices that encode over-expression of genes. In order to provide putative transcription modules, i.e., one of the main goals for molecular biologists, several post-processing tasks can be performed on the extracted bi-sets.

In this talk, we will survey our recent work on constraint-based mining for bi-sets. It includes efficient techniques for closed set computation and thus concept mining in typical gene expression databases. A new algorithm that pushes monotonic constraints during concept extraction will be sketched. Finally, we will consider several post-processing techniques that are currently studied in cooperation with molecular biologists (S. Blachon, Dr. O. Gandrillon, Dr. S. Rome). It includes basic but efficient vizualization techniques and the use of strong association rules.

Slides

PDF (478952 bytes)

Last modified: $Date: 2004/04/19 15:55:29 $ (UTC)