A Comparative Study on Association Rule Mining Algorithms on the Hospital Infection Control Dataset

Authors

DOI:

https://doi.org/10.21123/bsj.2023.7571

Keywords:

Machine learning, Apriori Algorithm, apriori mlxtend, Hospital Readmission, Association Rule Mining, Performance of Algorithms.

Abstract

Administrative procedures in various organizations produce numerous crucial records and data. These
records and data are also used in other processes like customer relationship management and accounting
operations.It is incredibly challenging to use and extract valuable and meaningful information from these data
and records because they are frequently enormous and continuously growing in size and complexity.Data
mining is the act of sorting through large data sets to find patterns and relationships that might aid in the data
analysis process of resolving business issues. Using data mining techniques, enterprises can forecast future
trends and make better business decisions.The Apriori algorithm has been introduced to calculate the
association rules between objects; the primary goal of this algorithm is to establish an association rule between
various things. The association rule describes how two or more objects are related.We have employed the
Apriori property and Apriori Mlxtend algorithms in this study and we applied them on the hospital database;
and, by using python coding, the results showed that the performance of Apriori Mlxtend was faster, and it
was 0.38622, and the Apriori property algorithm was 0.090909. That means the Apriori Mlxtend was better
than the Apriori property algorithm.

References

Koutsojannis, C. Medical Knowledge Extraction: Particular Difficulties And Obligations. 1st edition. Chap 3, NOVA Publishers .2020; 42-43.

Feldman R, Sanger J. The text mining handbook: advanced approaches in analyzing unstructured data. Cambridge university press. 2019. 34(1): 125-127

Muhammed M, Flaih L. Online Web Page Classification. Computer Engineering and Intelligent Systems. 2014; 5)1(: 14-15. https://core.ac.uk/reader/234644747.

Silva M, Rodrigues O. Risk factors for surgical site infection: challenges to public health. J Microbiol Exp. 2019; 10(1): 1-8. https://doi.org/10.15406/jmen.2022.10.00345.

Kocakoç D, Türkölmez B. Using Data Mining Techniques for Designing Patient-Friendly Hospitals. In Advances in Econometrics, Operational Research, Data Science and Actuarial Studies. Springer, Cham. 2022; 321-343. https://doi.org/10.1007/978-3-030-85254-2_20.

Chen B, Huang Z, Liu C, Wu Z. Spatio-temporal data mining method for joint cracks in concrete dam based on association rules. Struct Control Health Monit. 2022; 29: 4-5. https://doi.org/10.1002/stc.2848.

Wardani D. Measuring Positive and Negative Association of Apriori Alg3orithm with Cosine Correlation Analysis. Baghdad Sci J. 2021; 18(3): 0554. https://doi.org/10.21123/bsj.2021.18.3.0554.

Ibrahim W, Abdullaev S, Alkattan H, Adelaja A, Subhi A. Development of a Model Using Data Mining Technique to Test, Predict and Obtain Knowledge from the Academics Results of Information Technology Students. Data. 2022; 1-2. https://doi.org/10.3390/data7050067.

Prasanthi N, Rao MVP. A Comprehensive Assessment of Privacy Preserving Data Mining Techniques. 2nd Int Conf Sustain Expert Syst.. 2022; 833- 842. https://doi.org/10.1007/978-981-16-7657-4_67.

Sastrt J, Suresh V. A Survey Paper on Frequent Itemset Mining. J Phys: Conf Sers. 2019; 1228(1): 1-7.

Abdulmajeed A, Tawfeeq T, Al-jawaherry A. Constructing a Software Tool for Detecting Face Mask-wearing by Machine Learning. Baghdad Sci J. 2022; 19(3): 0642. https://doi.org/10.21123/bsj.2022.19.3.

Dehbozorgi MR. Rastegar M, Sami A. Data mining-based cause identification of momentary outages in power distribution systems. Sustain Cities Soc. 2022; 77: 103587. https://doi.org/10.1016/j.scs.2021.103587.

Babu A, Raj, Varalatchoumy M, Gopila M, Justin F. Novel Approach for Predicting COVID-19 Symptoms using ARM based APRIORI Algorithm. International Conference on Computing Methodologies and Communication (ICCMC). IEEE. 2022; 1577-1580.https://doi.org/10.1109/ICCMC53470.2022.9753987.

Pradeepa S. Jaisaiarun P, Srinivasan P, Subbulakshmi V, Tarik A. FREEDOM Effective Surveillance and Investigation of Water-borne Diseases from Data-centric Networking Using Machine Learning Techniques. Int J Artif Intell Tools. 2022; 2250004. https://doi.org/10.1142/S021821302250004X.

Boyko, N, Komarnytska H, Kryvenchuk Y, Malynovskyy Y. Clustering Algorithms for Economic and Psychological Analysis of Human Behavior. In CMiGIN. 2019; 614-626.

Ivascu T, Cincar K, Carunta A. Extracting Association Rules from Emergency Department Data. The 7th IEEE International Conference on E-Health and Bioengineering. IEEE. 2019: 3-4.

Wang Y, Wu Y, Li Y, Yao F, Fournier-Viger P, Wu X. Self-adaptive nonoverlapping sequential pattern mining. Appl Intell. 2022; 52(6): 6646-6661. https://doi.org/10.1007/s10489-021-02763-y.

Raj S, Ramesh D, Sethi K. A Spark-based Apriori algorithm with reduced shuffle overhead. J Supercomput. 2021; 77(1): 133-151. https://doi.org/10.1007/s11227-020-03253-7.

Datta S, Mali K, Das S, Kundu S, Harh S. Rhythmus periodic frequent pattern mining without periodicity threshold. J Ambient Intell Humaniz Comput. 2022; 1-13. https://doi.org/10.1007/s12652-021-03617-8.

Dinov D. (2023). Qualitative Learning Methods-Text Mining, Natural Language Processing, and Apriori Association Rules Learning. In Data Science and Predictive Analytics: Biomedical and Health Applications using R. Cham: Springer International Publishing. 2023; 385-437. https://Doi:10.1007/978-3-031-17483-4_7.

Tharini J, Shivakumar L. Computational Intelligence and Data Sciences. 1st edition. Florida, USA: CRC Press. Chap 11, High-Utility Itemset Mining: Fundamentals, Properties, Techniques and Research Scope. 2022; 195-210.

Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules. Proc. of the 20th Int Conf. on VLDB: Santiago, Chile. 1994; 487–499. https://doi/10.5555/645920.672836.

Zhao Z, Jian Z, Gaba S, Alroobaea R, Masud M, Rubaiee S. An improved association rule mining algorithm for large data. Int J Intell Syst. 2021; 30(1): 750-762. https://doi.org/10.1515/jisys-2020-0121.

Wu Y, Yuan Z, Li Y, Guo L, Fournier-Viger P, Wu X. NWP-Miner: Nonoverlapping weak-gap sequential pattern mining. Inf Sci. 2022; 588: 124-141. https://doi.org/10.1016/j.ins.2021.12.064.

Wang Y. Internet Medical Privacy Disclosure Mining and Prediction Model Construction Based on Association Rules. Teh. Vjesn. 2022; 29:1: 231-238. https://doi.org/10.17559/TV-20210903062509.

Das A, Jana S, Ganguly P, Chakraborty N. Application of Association Rule: Apriori Algorithm in E-Commerce. In 2021 Innovations in Energy Management and Renewable Resources (IEMRE). 2021; 1-7. https://doi.org/10.1109/IEMRE52042.2021.9386737.

Agrawal R, Imieliński T, Swami A. Mining association rules between sets of items in large databases. In Proc. of the 1993 ACM SIGMOD Int Conf. on Management of data. 1993; 207-216. https://doi.org/10.1145/170035.170072.

Ma H, Ding J, Liu M, Liu Y. Connections between Various Disorders: Combination Pattern Mining Using Apriori Algorithm Based on Diagnosis Information from Electronic Medical Records. Biomed Res Int. 2022; 2022: 1-16. https://doi.org/10.1155/2022/2199317.

Downloads

Published

2023-10-28

How to Cite

1.
A Comparative Study on Association Rule Mining Algorithms on the Hospital Infection Control Dataset. Baghdad Sci.J [Internet]. 2023 Oct. 28 [cited 2024 Apr. 27];20(5(Suppl.). Available from: https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/7571

Similar Articles

You may also start an advanced similarity search for this article.