The Application of Agglomerative Clustering in Customer Credit Receipt of Fashion and Shoe Retail
DOI:
https://doi.org/10.9744/jirae.3.1.37-44Keywords:
Agglomerative Clustering, Single Linkage, Complete Linkage, Data Warehouse, Data MiningAbstract
Agglomerative Clustering is one of data mining methods to get a cluster in form of trees. In order to achieve these objectives, we used two agglomerative methods such as Single Linkage and Complete Linkage. Searching for nearest items to be clustered into one cluster also needs a similarity distance to be measured. We used Euclidean Distance and Cosine Similarity for measuring similarity distance between two points. The factors that promote high levels of accuracy depend on the pre-proceeding stage for clustering process and also affect the results obtained. Therefore, we conducted research through several stages: pre-processing such as ETL, normalization, and pivoting. The ETL process consisted of removing outliers using IQR method, data-cleaning and data-filtering processes. For normalization, we used Min-Max and Altman Z-Score methods to get the best normal value. The results of this research demonstrate that the highest accuracy occurs when using the Complete Linkage with Min-Max and the Euclidean method with the average purity of 0.4. The significant difference is observed when using the Z-Score and Cosine Similarity methods; the average purity is around 0.11. Besides, we found that the system also could not predict the customers’ preferences in buying goods for the next period. Another result in the research is that transactional data in a company are not good enough to be clusterized.Downloads
Published
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and publishing right, and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) followingthe publication of the article, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).