Ulster University Logo

Ulster Institutional Repository

Integrating semantically heterogeneous aggregate views of distributed databases

Biomedical Sciences Research Institute Computer Science Research Institute Environmental Sciences Research Institute Nanotechnology & Advanced Materials Research Institute

McClean, SI, Scotney, BW, Morrow, PJ and Greer, KRC (2008) Integrating semantically heterogeneous aggregate views of distributed databases. Distributed and Parallel Databases, 24 (1-3). pp. 73-94. [Journal article]

Full text not available from this repository.

DOI: 10.1007/s10619-008-7031-6

Abstract

In statistical databases and data warehousing applications it is commonly the case that aggregate views are maintained as an underlying mechanism for summarising information. Where the databases or applications are distributed, or arise from independent data collections or system developments, there may be incompatibility, heterogeneity, and data inconsistency. These challenges need to be overcome if federations of aggregated databases are to be successfully incorporated into systems for database management, querying, retrieval, and knowledge discovery. In this paper we address the issue of integrating aggregate views that have semantically heterogeneous classification schemes. In previous work we have developed a methodology that is efficient but that cannot easily handle data inconsistencies. Our previous approach is therefore not particularly well suited to very large databases or federations of large numbers of databases. We now address these scalability issues by introducing a methodology for heterogeneous aggregate view integration that constructs a dynamic shared ontology to which each of the aggregate views can be explicitly related. A maximum likelihood technique, implemented using the EM (Expectation Maximisation) algorithm, is used to inherently handle data inconsistencies in the computation of integrated aggregates that are described in terms of the dynamic shared ontology.

Item Type:Journal article
Keywords:Distributed databases; Aggregate views; Heterogeneous data; Dynamic shared ontologies
Faculties and Schools:Faculty of Computing & Engineering
Faculty of Computing & Engineering > School of Computing and Information Engineering
Research Institutes and Groups:Computer Science Research Institute
Computer Science Research Institute > Information and Communication Engineering
ID Code:6859
Deposited By:Professor Bryan Scotney
Deposited On:20 Jan 2010 15:57
Last Modified:15 Jun 2011 11:08

Repository Staff Only: item control page