Ontology base-Interface Thai Herbs Search System (THSS): Data and user management module Mr. Chiun-Yi Chen Assoc. Prof. Kitsana Waiyamai, Ph. D. Bachelor of Engineering (Software and Knowledge Engineering) Kasetsart University TEL 🙁 +66) 2-942-8555 Ext. 1430 FAX: (+66) 2-579-6245 E-mail: ([email protected] ac. th) Abstract Thai herbs have been used more and more in many application fields such as medical, health and cosmetic. However, Thai herbs are stored and collected in several types of databases, and in variety of formats such as excel, access and flat files, which make users difficult to search and analyze them.Objective of this work is to design and develop a Thai herbs search system. The developed system is consisting of two parts which are data and user management, and ontology-based user interface modules. This project focuses on the first module which consists of data extraction and filtering, and user management modules. Data extraction and filtering are implemented using web-crawler and natural language processing techniques. Keywords: Thai herbs, Ontology, web-crawler, Search Engine 1. Introduction 1. 1Background and MotivationRecently, a usage of Thai’s herbs has become popular in many organizations such as using for medical, educational and etc. So knowledge in Thai’s herbs both expertise and indigenous knowledge are distributed in many forms and formats. Thus, method of sharable and usable knowledge in this field cannot perform easily. To enable knowledge sharing an efficient method of storage of Thai’s herbs knowledge is needed. From the collecting data of each organization, cause the disintegration of Thai herbs information. Because the amount of data source is not equal and description of data is not same, t makes people who interested in, have to take long time to search and find. Previously have some websites have developed and collected data from several sources to present data in standard pattern and help users easy to search data. But they are not support quick and easy way to improve Thai herbs data, users difficult to analyze data and difficult to use the system. The principle of Data and User Management Module are management user, extraction, filtering and collecting in same of format for easing to use and extend to next module.Thus, this work will make the website that collecting herbs data from several sources automatically, easy to extend and control user that login to management in server. 1. 2 Problem statement From the collecting data of each organization, cause the disintegration of Thai herbs information. Because the amount of data source is not equal and description of data is not same, it makes people who interested in, have to take long time to search and find. So it can conclude and distribute the problem as the following: • Hard to use for the user who don’t have knowledge. • Doesn’t provide the updated information. Too slow on searching. • Doesn’t have enough categories to find. • Doesn’t have enough details and pictures to show. 1. 3 Related works From researching, we found several related works which have some advantage and disadvantage characters as follows: • Mahidol’s System This system is maintained and updated by the Medicinal Plant Information Center (MPIC), Faculty of Pharmacy, Mahidol University. Advantages – Have many categories of searching. – Have a specialist to answer the questions that users ask in the board. – Have a quick search function for users who don’t have knowledge of herbs. Disadvantages Don’t have ontology to see the scope of categories. – Can find only the word that the users specify. – Don’t have related words of searching. • Chitralada’s System This system is maintained and updated by the Plant Genetics Conservation Project at the Royal Initiation of Her Royal Highness Maha Chakri Sirindhon. Advantages – Provide pictures search. Disadvantages – Can search herbs only from picture. – Don’t provide keyword search. – Difficult to search herbs information. • Other Systems The other websites, we found several advantages and disadvantages as follows: Advantages – Have many knowledge of agriculture. Provide keyword search. Disadvantages – Don’t have many categories for searching. – Can search only herbs that system provided. – Difficult to find herbs that users want to know. – Can find only the herb’s names that already showing in the webpage. Table 1: Comparison of the several systems | |Mahidol |Chitralada |Other | |System | | | | |Function | | | | |1. Searching follow words/list |( |( |( | |2.Searching the related word |- |- |- | |3. Storing in database |( |- |( | |4. Ontology method |- |- |- | |5. Updating information |- |- |- | |6. Quick search |( |- |- | 1. 4 Propose and Objective The objective of this work is designed and developed a Thai herbs search system by using web-crawler and natural language processing techniques to extract the data from several websites that have many formats correctly and automatically.Moreover, this work also develops on user management module which administrator can easily to control and ensure that data in the server will be correct and to be in order. 1. 5 Scope In this project uses herbs’ information from several internet sources. ? http://www. medplant. mahidol. ac. th – this site is maintained and updated by the Medicinal Plant Information Center (MPIC), Faculty of Pharmacy, Mahidol University . ? http://www. rspg. org/palace/chitralada/cld6-2. htm – this site is maintained and updated by the Plant Genetics Conservation Project at the Royal Initiation of Her Royal Highness Maha Chakri Sirindhon. http://agkc. lib. ku. ac. th/plantwebsite/webpage/Home/SelectTypeSearch. html – this site is maintained and updated by the Agriculture faculty, Kasetsart University . 2. System Overviews and Features 2. 1 System Overviews The figure below shows the whole system of Thai herbs search system that consist of 2 modules which are data and user management, and ontology-based user interface modules. [pic] Figure 1: The system architecture. • Sources: The chosen websites from which data will be extracted. • ETL Process: Extracting and loading data to the data warehouse. Database: Storing Thai’s herbs information. • Search/Query: Retrieve data from database to show clients. • Interface: Show the information to clients. • Clients: The users who want the information about herbs. 2. 2 Features This Thai herbs search system in data user management module has many features as following: • Update data automatically. • Can set up date and time to updating data. • Provide picture-based or keyword-based search. • Support hyperlink searching. • Allow merging of search-result in a table form. 3. System design and Implementation 3. 1 Thai herbs ontology designThai Herbs Knowledge Analysis: it can be divided into aspects as the followings: • Family name of herb – Family Name • The name of herb – Thai herb name (??????????? ): the general name of Thai herb – Scientific Name (??????????????? ): the name used in scientific field – Local Name (???????????? ): the name which the herb is called in the region. – Common Name (????????? ): Trade name or the name of herb in English – Synonym Name (???????? ): the synonym of herb in English • Kinds of plants – Tree (?????? )- Shrub (??????? ) – Climber (?????? )- Herb (????????? ) • Components of herbThe components of herb that can be used to cure symptoms are – Seed (????? )- Wood????? ) – Bloom (??? )- Root (??? ) – Bark (?????? ) – Trunk (??? ) – Petal (???? )- Fruit (?? ) – Leaf (?? )- Head (??? ) – Heartwood (???? )- Branch (???? ) – Other • Cons of herb: – Cons (??? ): the harmful of herbs which can be dangerous for human. • Pros of herb: – Pros (??????? ): the benefit of herbs which can use for treatment medical disease and other. From Thai herbs knowledge analysis, it can design herbs ontology which describes the relationship between those herbs domain as figure 2. [pic]Figure 2 Ontology designs of Thai herbs Graphic User Interface Figure 3-4 shows the admin interface of ontology –based Thai herb search system development. This interface is using for the user to login by insert user name and password, after registering to be a member. This page contains main page, search herb page and about us links. Moreover user can contact by leave message to administrator too and also provide boards for announcing news [pic] [pic] Figure 3 User login interface. Figure 4 User register interface [pic] [pic] Figure 5 Herb search interface. Figure 6 Show result of herb search [pic]Figure 7 Show herb merge result 4. Experiment result Testing and evaluation part, we are explain in detail following as 4. 1 Testing For the testing part of this project, we can explain as following table 2 and figure 8 Table 2: The resulting of testing system | |Measurements |Result (%) | |1 | Correctness of information. |100. 00% | | | Performance of usability |82. 00% | |2 |- support the requirements of user |75. 00% | | |- decrease working process | | [pic]Figure 8 Resulting testing chart 4. Evaluation For the evaluation part of this project, we can conclusion as following table 3 and figure 9 Table 3: Resulting of evaluation system |Topics of measurement |Score of this system | | |modulate |good |excellent |average | |Meaningful of User |10. 00% |82. 50% |7. 50% |4. 25 | |interface | | | | | |Result of searching |18. 00 % |81. 00% |1. 00% |4. 125 | |system | | | | | |Result from management |24. 0 % |70. 00% |5. 50% |4. 02 | |system | | | | | |Overall satisfying |13. 00 % |85. 00% |2. 00% |4. 75 | |Overall resulting of |15. 90 % |79. 30% |4. 80% |4. 259 | |evaluation | | | | | [pic]Figure 9 Satisfying of using system 5. Conclusion and Future work This Ontology-based Thai herb search system project has been test and evaluate from whom it may concerns. And it cans conclusion this project as following: 5. 1 Overall operationFrom developing the ontology-base Thai herb search system, it can support end users as: For organization and herbs expert • Extract knowledge easily from Thai herbs search system. • Reduce time to search Thai herbs information. • Cover Thai herbs information to use. For common people and student • Reduce time to search Thai herbs information. • Cover Thai herbs information to use. • Use the system and to analyze herbs easily. For developers • Reduce time to edit and improve data if data source is changing. • Use the system as a prototype to develop another similar system. 5. 2 Future workFor completely fulfill of this project, it has to • Provide administrator interface to easily set schedule task for updating automatically. • Provide more several herb information sources. • Provide security system for automatically updating. • More faster in crawling and parsing. 6. Acknowledgement From this project, I have faced many problems that I have never known. Therefore I received the assistance from many people. First, I would like to thank my project advisor, Assoc. Prof. Kitsana Waiyamai, Ph. D. , chief of DAKDL Laboratory for every excellent suggestions and kindness.This project could not be finished without encouragement and guidance of the project advisor. Moreover, I feel very thankful for Thanapat Kangkachit(P’Chai) that give a lot of suggestion, Thitiwan Thimapakorn(P’pupae) to explain the previous system, siriwon taewijit(P’won) to gave an idea, Klao for beautiful logo ,other DAKDL members for supporting this project, Kwan Nut(Green) for technical supporting, Klao for beautiful logo, all my lovely friends for supporting this project and giving me an encouragement and also lots of thank of Ooy for being a good project partner.I also would like to thank IUP project room and DAKDL Laboratory for being the great accommodation during doing this project. At last, all of this would not have been success without encouragement and support provided by father, mother and other people that I could not thank in words. They always support me in everything. I would like to thank for all their love and attention. REFERENCES ?????????????????????????????? , ???????????????????????? , ?????????????? ??????????????? (http://www. edplant. mahidol . ac. th/index. asp), 2000 ??????????????????????????????????? ???????????????????????? , ??????????????????? (http://www. rspg. org/palace/chitra lada/cld6-2. htm), 2544  http://agkc. lib. ku. ac. th/plantwebsite/webpage/H ome/SelectTypeSearch. html Miss Boonyarat Phaderrod,(2004). Local Thai’s Natural Product Ontology for Potential Bioactive Compounds Search and Analysis, Thesis of master degree, Kasetsart University ———————– Updating automatically!!