Type: Doctoral Thesis
Title: Mining Structured Data
Author: Nijssen, Siegfried Gerardus Remius
Publisher: Leiden Institute of Advanced Computer Science, Faculty of Mathematics and Natural Sciences, Leiden University
Issue Date: 2006-05-15
Keywords: Computer Science
Machine Learning
Artificial Intelligence
Database Applications
Data Mining
Abstract: Many databases do not consist of a single table of fixed dimensions, but of objects that are related to each other: the databases are relational, or structured. We study the discovery of patterns in such data. In our approach, a data analyst specifies constraints on patterns that she believes to be of interest, and the computer searches for patterns that satisfy these constraints. An important constraint on which we focus, is the constraint that a pattern should have a significant number of occurrences in the data. Constraints like this allow the search to be performed reasonably efficiently. We develop algorithms for searching ppatterns taht are represented in formal first order logic, tree data structures and graph data structures. We perform experiments in which these algorithms, and algorithms proposed by other researchers, are compared with each other, and study which properties determine the efficiency of the algorithms. As a result, we are able to develop more efficient algorithms. As application we study the discovery of fragments in molecular datasets. The aim is to discover fragments that relate the structure of molecules to their activity.
Description: Promotor: J.N. Kok, Co-Promotor: W.A. Kosters
With Summary in Dutch
Faculty: Faculteit der Wiskunde en Natuurwetenschappen
Citation: Nijssen, S.G.R., 2006, Doctoral Thesis, Leiden University
Sponsor: Institute for Programming Research and Algorithms (IPA)

Files in this item

Description Size View
application/pdf Full Text, under embargo until further notice 2.248Mb View/Open
application/pdf Curriculum Vita ... he IPA Dissertation Series 241.1Kb View/Open
application/pdf Summary in Dutch 72.09Kb View/Open
application/pdf Bibliography, Index, Acknowledgements 764.3Kb View/Open
application/pdf Chapters 1-8, under embargo until further notice 9.912Mb View/Open
application/pdf Title page, Table of Contents 92.88Kb View/Open

This item appears in the following Collection(s)