Mahidol University Logo
Faculty of ICT, Mahidol University
 

Admissions

Printable Version

 

DOCUMENT ENGINEERING USING FORMAL CONCEPT ANALYSIS AND XML TECHNOLOGY

 

TITLE DOCUMENT ENGINEERING USING FORMAL CONCEPT ANALYSIS AND XML TECHNOLOGY
AUTHOR THIDA SRITHANTIP
DEGREE MASTER OF SCIENCE PROGRAMME IN COMPUTER SCIENCE
FACULTY FACULTY OF SCIENCE
ADVISOR THANWADEE SUNETNANTA
CO-ADVISOR CHARNYOTE PLUEMPITIWIRIYAWEJ
 
ABSTRACT
Document engineering is evolving as a new discipline of specifying, designing and implementing the electronic documents that request or provide interfaces to business processes via Web-based services. At the basic level, document engineering analyzes and designs methods yielding formal models that describe the information that business processes or services require. Due to the fact that there are enormous amounts of documents injected into a business enterprise everyday, there is a demand to understand the relationships or links between information in those documents, often existing in a distributed manner, in order to support better document management systems and information retrieval processes. Existing technologies for linking documents cannot cope with scenarios in which there are large data volumes and have various limitations in processing them. This research proposes an alternative model, namely DEFCA, to generate information links for relevant documents automatically. We define DEFCA input structure by XML. Structuring by XML, DEFCA is more open and able to work with various domains. In DEFCA, we applied Formal Concept Analysis (FCA), which is a data analysis technique, in order to analyze and extract relationships from a set of documents. These relationships will be used as the rules to create document links. By applying FCA, rules and links will be generated automatically without requiring an expert to predefine a set of document relationships. Therefore, DEFCA is suitable for any document management system, including one with a large data volume and frequent updates of document sets. We have also implemented a document management prototype using DEFCA. We have demonstrated our experiments which aimed at verifying the correctness of rules and links generated from DEFCA. The results of our experiments, once we passed sample document sets through the DEFCA prototype, yielded a linkbase that enabled us to retrieve information links among relevant documents easily and effectively.
KEYWORD DOCUMENT ENGINEERING / FORMAL CONCEPT ANALYSIS / XML

 

Go to Top

 

ICT Building, Mahidol University, 999 Phuttamonthon 4 Road, Salaya, Nakhonpathom 73170 Tel. +66 02 441-0909 Fax. +66 02 849-6099
Mahidol University Computing Center, The Faculty of ICT, Mahidol University , Rama 6 Road, Rajathevi, Bangkok 10400 Tel. +66 02 354-4333 Fax. +66 02 354-7333