Fuzzy Integration to Standard Calculation of K-Nearest Neighbour Attributes

M Adib Al Karomi, Ivandari Ivandari

Abstract


The development of information and data in the era of the industrial revolution 4.0 is very fast. Researchers, institutions and even industry are competing to find and utilize methods in data processing that are more effective and efficient. In data mining classification, there are several best methods and are widely used by researchers. One of them is K-Nearest Neighbor (KNN). The calculation process in the KNN algorithm is carried out by comparing the testing data to all existing training data. This comparison is generally symbolized by the value of closeness or similarity between attribute records. The KNN method is proven to be good for handling large datasets and datasets with many attributes. One of the drawbacks in calculating the similarity of the KNN is that if there are attributes with a large range value, the similarity value will also be large. Conversely, if the range in an attribute is small, the similarity is also small. This condition is clearly unfair considering the types of attributes in the current data vary widely. One solution to this problem is to use standardization for all existing data attributes. Fuzzy is a model introduced by Prof. Zadeh which allows a faint value to be a value between 1 and 0. In this study the fuzzy model will be integrated in the KNN similarity calculation to obtain standardization of all data attributes. The results show that the use of the KNN algorithm in the classification of credit approval has an accuracy rate of 91.83%.


Keywords


attribute normalization, fuzzy integration, KNN

Full Text:

PDF

References


Bawono, Aditya Hari, and Ahmad Afif Supianto. 2019. “Efisiensi Klasifikasi Big Data Menggunakan Improved Neighbour” 6 (6): 1–6. https://doi.org/10.25126/jtiik.201962085.

Cover, T M, and P E Hart. 1967. “Nearest Neighbor Pattern Classification” I.

Gamadarenda, ikhsan wisnuadji, and Indra Waspada. 2018. “Implementasi Data Mining Untuk Deteksi Penyakit Ginjal Kronis (Pgk) Menggunakan K-Nearest Neighbor (Knn) Dengan Backward Elimination” 7 (2): 417–26. https://doi.org/10.25126/jtiik.202071896.

Gorunescu, Florin. 2011. Data Mining: Concepts; Models and Techniques. Springer.

Indrayanti, Indrayanti, Sugianti Devi, and M. Adib Al Karomi. 2017. “Peningkatan Akurasi Algoritma KNN Dengan Seleksi Fitur Gain Ratio Untuk Klasifikasi Penyakit Diabetes Mellitus.” IC-TECH XIII (2): 1–6.

Karomi, M. Adib Al, Much. Rifqi Maulana, Slamet Joko Prasetiyono, Ivandari, and Arochman. 2019. “Strengthening Campus Finance by Analyzing Attribute Attributes for Student Registration Classifications.” https://jurnal.polines.ac.id/index.php/jaict/article/view/1431.

Karomi, M Adib Al. 2015. “Optimasi Parameter K Pada Algoritma KNN Untuk Klasifikasi Heregistrasi Mahasiswa Program Studi Teknik Informatika STMIK Widya Pratama Jl . Patriot 25 Pekalongan Email : Adib.Comp@gmail.Com.” IC-TECH X (0285): 5.

Kusrini, and Luthfi Emha Taufiq. 2009. Algoritma Data Mining. Yogyakarta: Andi Offset.

Larose, Daniel T. 2005. Discovering Knowledge in Data: An Introduction to Data Mining. John Wiley & Sons.

Prasetyo, Eko. 2012. Data Mining Konsep Dan Aplikasi Menggunakan Matlab. Yogyakarta: Andi Offset.

Santosa, Budi. 2007. Data Mining Teknik Pemanfaatan Data Untuk Keperluan Bisnis. Edisi Pert. Yogyakarta: Graha Ilmu.

Sets, Fuzzy. 1997. “Toward a Theory of Fuzzy Information Granulation and Its Centrality in Human Reasoning and Fuzzy Logic” 90: 111–27.

Singh, Harpreet, Madan M Gupta, Thomas Meitzler, Zeng-guang Hou, Kum Kum Garg, Ashu M G Solo, and Lotfi A Zadeh. 2013. “Real-Life Applications of Fuzzy Logic” 2013.

Susanto, Sani, and Dedi Suryadi. 2010. Pengantar Data Mining: Menggali Pengetahuan Dari Bongkahan Data. Yogyakarta: Andi Offset.

Witten, I. H., E. Frank, and M. A. Hall. 2011. Data Mining: Practical Machine Learning Tools and Techniques 3rd Edition. Vol. 40. Elsevier. https://doi.org/10.1002/1521-3773(20010316)40:6<9823::AID-ANIE9823>3.3.CO;2-C.

Witten, Ian H, Eibe Frank, and Mark A. Hall. 2011. Data Mining: Practical Machine Learning Tools and Techniques 3rd Edition. Elsevier.

Wu, Xindong. 2009. The Top Ten Algorithms in Data Mining. Edited by Vipin Kumar. New York: Taylor & Francis Group, LLC.




DOI: http://dx.doi.org/10.32497/jaict.v5i2.1984

Refbacks

  • There are currently no refbacks.


ISSN: 2541-6340
Online ISSN: 2541-6359

Visitor: 

View My Stats

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.