|
TITLE: | Mining colocation patterns from spatial datasets |
|
PRESENTER: | Shashi Shekhar : Biography , Homepage |
|
AFFILIATION: | Computer Science Department, University of Minnesota. |
| SLIDES: |
Given a collection of boolean spatial features, the co-location rule discovery process finds the subsets of features whose instances are frequently located together in geographic space. For example, symbiotic plant species and predator-prey animal species are likely co-locations in Ecology datasets. The co-location rule discovery problem is different from the association rule discovery problem. Even though the boolean spatial features may be considered as item types, there is no natural notion of transactions. Transactioning spatial datasets can lead to incorrect estimation of the interest measures for many spatial co-location patterns with instances near transaction boundaries. This makes it difficult to use traditional interest measures, e.g. support, and traditional association rule mining algorithms, which are based on ideas like support based pruning and compression of transaction data.
Proposed approach formalizes the notion of co-locations using user-specified spatial neighborhoods in place of transactions. It defines new interest measures based on the neighborhoods along with a model for interpreting the co-location rules. It provides a simple, correct, and complete algorithm for mining co-location rules. In addition, it proposes to advance the development of co-location mining by addressing three basic issues, namely, scalability, ascertaining quality of inferred patterns, and discovery of high confidence low support rules.
KEYWORDS: Data Mining, Spatial Datasets, Colocation patterns, Association rules, Apriori algorithm.
NOTE: Some of the results discussed in this talk appeared in the following publications: