An Effective Way to Enhance Classifications for the Semi-Structured Research Articles
Due to the drastic increase in the research publications, numerous research articles are available electronically on different online digital libraries. Some research articles or papers are not retrieved during online searches due to their classification issues. The adequately structured research articles are relatively easily approachable as compared to semi-structured and unstructured research articles, and sometimes the reader does not get accurate results on different digital libraries as the research articles are not classified properly. Neglecting the semi-structured and unstructured published research not only causes gap deficiency but also affects the results of the proposed techniques and citations for other articles. Usually, researchers missed semi-structured and unstructured research articles during their online search. Classification techniques have been applied to structured articles and no significant work has been performed towards the classification of semi-structured and unstructured research articles. Therefore, this research focuses on the classification of semi-structured research articles using different supervised classification techniques so that the most accurate and large amount of relevant research results will be achieved. For experimentation, a labeled dataset was used for the classification of semi-structured papers. The dataset we used for experimentation is comprised of manually gathered research articles from Santos repository dataset and labeling them accordingly. The current study used four different supervised classification techniques such as Support Vector Machine (SVM) classifier, Naïve Based classifier, K Nearest Neighbor classifier, and Decision Tree classifier. The comparison was performed between these supervised classification techniques to see which classifier gives better accuracy. The unit of measures or parameters selected to compare these classifiers are: accuracy, recall, precision, and f-score. The evaluation was performed on the basis of results and comparison in the experimentation
Copyright (c) 2020 University of Sindh Journal of Information and Communication Technology
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
University of Sindh Journal of Information and Communication Technology (USJICT) follows an Open Access Policy under Attribution-NonCommercial CC-BY-NC license. Researchers can copy and redistribute the material in any medium or format, for any purpose. Authors can self-archive publisher's version of the accepted article in digital repositories and archives.
Upon acceptance, the author must transfer the copyright of this manuscript to the Journal for publication on paper, on data storage media and online with distribution rights to USJICT, University of sindh, Jamshoro, Pakistan. Kindly download the copyright for below and attach as a supplimentry file during article submission