Classification Based on Configuration Objects by Using Procrustes Analysis

Ridho Ananda; Agi Prasetiadi

doi:10.20895/infotel.v13i2.637

View PDF

Published May 30, 2021

DOI https://doi.org/10.20895/infotel.v13i2.637

Ridho Ananda

Institut Teknologi Telkom Purwokerto

Agi Prasetiadi

Institut Teknologi Telkom Purwokerto

Abstract

Classification is one of the data mining topics that will predict an object to go into a certain group. The prediction process can be performed by using similarity measures, classification trees, or regression. On the other hand, Procrustes refers to a technique of matching two configurations that have been implemented for outlier detection. Based on the result, Procrustes has a potential to tackle the misclassification problem when the outliers are assumed as the misclassified object. Therefore, the Procrustes classification algorithm (PrCA) and Procrustes nearest neighbor classification algorithm (PNNCA) were proposed in this paper. The results of those algorithms had been compared to the classical classification algorithms, namely k-Nearest Neighbor (k-NN), Support Vector Machine (SVM), AdaBoost (AB), Random Forest (RF), Logistic Regression (LR), and Ridge Regression (RR). The data used were iris, cancer, liver, seeds, and wine dataset. The minimum and maximum accuracy values obtained by the PrCA algorithm were 0.610 and 0.925, while the PNNCA were 0.610 and 0.963. PrCA was generally better than k-NN, SVM, and AB. Meanwhile, PNNCA was generally better than k-NN, SVM, AB, and RF. Based on the results, PrCA and PNNCA certainly deserve to be proposed as a new approach in the classification process.

Downloads

Download data is not yet available.

How to Cite

[1]

R. Ananda and A. Prasetiadi, “Classification Based on Configuration Objects by Using Procrustes Analysis”, INFOTEL, vol. 13, no. 2, pp. 76-83, May 2021.

Issue

Vol 13 No 2 (2021): May 2021

Section

Informatics

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work

References

[1] S. Zhang, X. Li, M. Zong, X. Zhu, and D. Cheng, â€œLearning k for kNN Classification,â€ ACM Trans. Intell. Syst. Technol., vol. 8, no. 3, 2017, doi: 10.1145/2990508.
[2] A. M. JimÃ©nez-Carvelo, A. GonzÃ¡lez-Casado, M. G. Bagur-GonzÃ¡lez, and L. Cuadros-RodrÃguez, â€œAlternative data mining/machine learning methods for the analytical evaluation of food quality and authenticity â€“ A review,â€ Food Res. Int., vol. 122, no. February, pp. 25â€“39, 2019, doi: 10.1016/j.foodres.2019.03.063.
[3] V. Rajeswari and K. Arunesh, â€œAnalysing soil data using data mining classification techniques,â€ Indian J. Sci. Technol., vol. 9, no. 19, 2016, doi: 10.17485/ijst/2016/v9i19/93873.
[4] E. M. S. Djodiltachoumy, â€œAnalysis of Data Mining Techniques for Agriculture Data,â€ Indian J. Sci. Technol., vol. 4, no. 2, pp. 1311â€“1313, 2016, doi: 10.17485/ijst/2016/v9i38/101962.
[5] H. Asri, H. Mousannif, H. Al Moatassime, and T. Noel, â€œUsing Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis,â€ Procedia Comput. Sci., vol. 83, no. Fams, pp. 1064â€“1069, 2016, doi: 10.1016/j.procs.2016.04.224.
[6] S. Anwar Lashari, R. Ibrahim, N. Senan, and N. S. A. M. Taujuddin, â€œApplication of Data Mining Techniques for Medical Data Classification: A Review,â€ MATEC Web Conf., vol. 150, pp. 1â€“6, 2018, doi: 10.1051/matecconf/201815006003.
[7] R. Prasetya and A. Ridwan, â€œData Mining Application on Weather Prediction Using Classification Tree, NaÃ¯ve Bayes and K-Nearest Neighbor Algorithm With Model Testing of Supervised Learning Probabilistic Brier Score, Confusion Matrix and ROC,â€ J. Appl. Commun. Inf. Technol., vol. 4, no. 2, pp. 25â€“33, 2019.
[8] A. Contreras-Valdes, J. P. Amezquita-Sanchez, D. Granados-Lieberman, and M. Valtierra-Rodriguez, â€œPredictive data mining techniques for fault diagnosis of electric equipment: A review,â€ Appl. Sci., vol. 10, no. 3, pp. 1â€“24, 2020, doi: 10.3390/app10030950.
[9] J. M. and C. A., â€œApplication of Data Mining Classification in Employee Performance Prediction,â€ Int. J. Comput. Appl., vol. 146, no. 7, pp. 28â€“35, 2016, doi: 10.5120/ijca2016910883.
[10] A. Ashraf, S. Anwer, and M. G. Khan, â€œA Comparative Study of Predicting Student â€™ s Performance by use of Data A Comparative Study of Predicting Student â€™ s Performance by use of Data Mining Techniques,â€ Am. Sci. Res. J. Eng. Technol. Sci., vol. 44 No.1, no. October, pp. 122â€“136, 2018.
[11] N. S. Altman, â€œAn introduction to kernel and nearest-neighbor nonparametric regression,â€ Am. Stat., vol. 46, no. 3, pp. 175â€“185, 1992, doi: 10.1080/00031305.1992.10475879.
[12] C. C. V. Vapnik, â€œSupport-Vector Networks,â€ Mach. Learn., vol. 20, pp. 273â€“297, 1995, doi: 10.1109/64.163674.
[13] N. K. K. N. S. P. K. Y. V. N. H. Deekshitulu, â€œImplementation of Naive Bayesian Classifier and Ada-Boost Algorithm Using Maize Expert System,â€ Int. J. Inf. Sci. Tech., vol. 2, no. 3, pp. 63â€“75, 2012, doi: 10.1523/JNEUROSCI.4623-04.2005.
[14] T. K. Ho, â€œRandom decision forests,â€ Proc. Int. Conf. Doc. Anal. Recognition, ICDAR, vol. 1, pp. 278â€“282, 1995, doi: 10.1109/ICDAR.1995.598994.
[15] J. Cramer, â€œthe origin of logistic regression,â€ 2002. [Online]. Available: https://papers.tinbergen.nl/02119.pdf.
[16] K. Rakesh and P. N. Suganthan, â€œAn Ensemble of Kernel Ridge Regression for Multi-class Classification,â€ Procedia Comput. Sci., vol. 108, pp. 375â€“383, 2017, doi: 10.1016/j.procs.2017.05.109.
[17] T. Bakhtiar and Siswadi, â€œOrthogonal procrustes analysis: Its transformation arrangement and minimal distance,â€ Int. J. Appl. Math. Stat., vol. 20, no. M11, pp. 16â€“24, 2011.
[18] I. L. D. K. V. Mardia, Statistical Shape Analysis with applications in R, 2nd ed. 2016.
[19] T. S. Bakhtiar, â€œON THE SYMMETRICAL PROPERTY OF PROCRUSTES MEASURE OF DISTANCE,â€ vol. 99, no. 3, pp. 315â€“324, 2015.
[20] A. Muslim and T. Bakhtiar, â€œVariable selection using principal component and procrustes analyses and its application in educational data,â€ J. Asian Sci. Res., vol. 2, no. 12, pp. 856â€“865, 2012, [Online]. Available: http://www.aessweb.com/pdf-files/856-865.pdf.
[21] Siswadi and T. Bakhtiar, â€œGoodness-of-fit of biplots via procrustes analysis,â€ Far East J. Math. Sci., vol. 52, no. 2, pp. 191â€“201, 2011.
[22] Siswadi, T. Bakhtiar, and R. Maharsi, â€œProcrustes analysis and the goodness-of-fit of biplots: Some thoughts and findings,â€ Appl. Math. Sci., vol. 6, no. 69â€“72, pp. 3579â€“3590, 2012.
[23] R. Ananda, Siswadi, and T. Bakhtiar, â€œGoodness-of-Fit of the Imputation Data in Biplot Analysis,â€ Far East J. Math. Sci., vol. 103, no. 11, pp. 1839â€“1849, 2018, doi: 10.17654/ms103111839.
[24] R. Ananda, A. R. Dewi, and N. Nurlaili, â€œa Comparison of Clustering By Imputation and Special Clustering Algorithms on the Real Incomplete Data,â€ J. Ilmu Komput. dan Inf., vol. 13, no. 2, pp. 65â€“75, 2020, doi: 10.21609/jiki.v13i2.818.
[25] F. Novika and T. Bakhtiar, â€œThe Use of Biplot Analysis and Euclidean Distance with Procrustes Measure for Outliers Detection,â€ Int. J. Eng. Manag. Res. Page Number, no. 1, pp. 194â€“200, 2018, [Online]. Available: www.ijemr.net.
[26] K. Iwata, Shape clustering as a type of procrustes analysis, vol. 11304 LNCS. Springer International Publishing, 2018.
[27] J. M. F. Ten Berge, â€œThe rigid orthogonal procrustes rotation problem,â€ Psychometrika, vol. 71, no. 1, pp. 201â€“205, 2006, doi: 10.1007/s11336-004-1160-5.
[28] H. Azis, P. Purnawansyah, F. Fattah, and I. P. Putri, â€œPerforma Klasifikasi K-NN dan Cross Validation Pada Data Pasien Pengidap Penyakit Jantung,â€ Ilk. J. Ilm., vol. 12, no. 2, pp. 81â€“86, 2020, doi: 10.33096/ilkom.v12i2.507.81-86.
[29] M. Taboga, Lectures on Probability Theory and Mathematical Statistics, 3rd ed. CreateSpace Independent Publishing Platform, 2017.

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

References