Mahidol University Logo
Faculty of ICT, Mahidol University
 

Admissions

Printable Version

 

DOCUMENT CLASSIFICATION

YuvananYuvacharaskul5088202ITCS/B
PruchPreechapermsub5088213ITCS/B
ThanaWatthanasomsiri5088250ITCS/B

B.Sc.(INFORMATION AND COMMUNICATION TECHNOLOGY)

Project Advisor: Assoc. Prof. Dr.Damras Wongsawang

Abstract

At present, electronic document has increased numbers and various kinds of contents. In order to search and arrange documents to be simple and meet the needs, it is essential to categorize the documents conforming to the index. Therefore, the document storing and searching can be quick and effective.

Document classification is a problem in information science. The task is to assign an electronic document to one or more categories, based on its contents. Document classification tasks can be divided into two sorts: supervised document classification where some external mechanism (such as human feedback) provides information on the correct classification for documents, and unsupervised document classification, where the classification must be done entirely without reference to external information for our project we are using supervised document classification technique because we set the key-word and use program to compare word in the news with our key-word so we can categories the type of the news. Moreover our program can categorize many format such as .txt and .html.

 

KEYWORDS: DOCUMENT CLASSIFICATION / DOCUMENT CATEGORIZATION

 

Go to Top

 

ICT Building, Mahidol University, 999 Phuttamonthon 4 Road, Salaya, Nakhonpathom 73170 Tel. +66 02 441-0909 Fax. +66 02 849-6099
Mahidol University Computing Center, The Faculty of ICT, Mahidol University , Rama 6 Road, Rajathevi, Bangkok 10400 Tel. +66 02 354-4333 Fax. +66 02 354-7333