Issue |
MATEC Web Conf.
Volume 135, 2017
8th International Conference on Mechanical and Manufacturing Engineering 2017 (ICME’17)
|
|
---|---|---|
Article Number | 00066 | |
Number of page(s) | 9 | |
DOI | https://doi.org/10.1051/matecconf/201713500066 | |
Published online | 20 November 2017 |
Comparative Study of Machine Learning Approach on Malay Translated Hadith Text Classification based on Sanad
1
Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia
2
Fakulti Pengajian Quran dan Sunnah, Universiti Sains Islam Malaysia, Bandar Baru Nilai, Negeri Sembilan, Malaysia
3
Fakulti Pengajian Islam, Universiti Kebangsaan Malaysia, Bangi, Selangor Malaysia
* Corresponding author: syahidah@pahang.uitm.edu.my
Sanad is one of important part used to determine the authentication of hadith. However, very little research work has been found on classification of Malay translated Hadith based on sanad. There are some researches done using machine learning approach on hadith classification based on sanad but using different objective with different language. This research is to see how Machine Learning techniques are used to classify Malay translated Hadith document based on sanad. In this paper, SVM, NB and k-NN are used to identify and evaluate the performance of Malay translated hadith based on sanad. The performances are evaluated based on standard performance metrics used in text classification which is accuracy and response time. The results show that SVM has the highest accuracy and k-NN has the best response time (time taken in process for classification data) compare to other classifier. In future, we plan to extend this paper with the analysis on interclass similarity and also test on larger dataset.
© The Authors, published by EDP Sciences, 2017
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (http://creativecommons.org/licenses/by/4.0/).
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.