Semantic Information Gathering Approach for Heterogeneous Information
Sources on WWW
Ngamnij Arch-int
ngamnij@kku.ac.th
Peraphon Sophatsathit
Advanced Virtual and Intelligent Computing (AVIC) Center,
Department of Mathematics, Faculty of Science,
Chulalongkorn University, Bangkok, 10330, Thailand
Peraphon.S@chula.ac.th
Abstract
The increasing demand for accessing heterogeneous information sources to
support global applications and decision making requirements forces
organizations to solve heterogeneity problems. One of the important problems
stemming from accessing the heterogeneous data is semantic heterogeneity.
A number of research efforts have been proposed to address this problem,
ranging from mediators-based systems, description logic-based systems to
content-descriptive metadata systems. In thise paper, we propose a metadata
dictionary as an assistant mechanism forkey for resolvingsolving the semantic
heterogeneity. The proposed metadata dictionary is designed based on domain
ontology where the constituent components are defined in terms of
object-oriented and set theory. An XML-based data model is employed to
manipulate and express the metadata dictionary contents. The inherent
flexibility of XML technology permits system-wide interoperability
suitable for a Web-based environment.
Keywords: Heterogeneous Information Sources, Domain
Ontology, XML-based Metadata Dictionary.