 
             The Author ensures that the research has been conducted responsibly and ethically with adherence to all relevant regulations. read more..
 COVID-19
 COVID-19 
1DIMES, Engineering Department of Informatics Modelling Electronics and Systems Science University of Calabria, Italy
2CNR - National Research Council of Italy - Institute for High Performance Computing and Networking (ICAR), Italy
*Corresponding author:Libero Nigro, DIMES–Engineering Department of Informatics Modelling Electronics and Systems Science University of Calabria, 87036 Rende, Italy
Submission: August 08, 2023; Published: August 25, 2023
 
	
	ISSN 2578-0247Volume3 Issue2
This paper proposes an algorithm, named HWK-Sets, based on K-Means, suited for clustering data which are variable-sized sets of elementary items. An example of such data occurs in the analysis of medical diagnosis, where the goal is to detect human subjects who share common diseases so as to predict future illnesses from previous medical history possibly. Clustering sets is difficult because data objects do not have numerical attributes and therefore it is not possible to use the classical Euclidean distance upon which K-Means is normally based. An adaptation of the Jaccard distance between sets is used, which exploits application-sensitive information. More in particular, the Hartigan and Wong variation of K-Means is adopted, which can favor the fast attainment of a careful solution. The HWK-Sets algorithm can flexibly use various stochastic seeding techniques. Since the difficulty of calculating a mean among the sets of a cluster, the concept of a medoid is employed as a cluster representative (centroid), which always remains a data object of the application. The paper describes the HWK-Sets clustering algorithm and outlines its current implementation in Java based on parallel streams. After that, the efficiency and accuracy of the proposed algorithm are demonstrated by applying it to 15 benchmark datasets.
Keywords: Clustering sets; Hartigan and Wong K-Means; Jaccard distance; Medoids; Seeding methods; Java parallel streams
Abbreviations: SDH: Sum of Distances to Histogram; ARI: Adjusted Rand Index; SDM: Sum of Distances to Medoids; CI: Centroids Index; SI: Silhouette Index
Ph.D in Agriculture from Faculty of Agriculture, Tohoku University
 
						Research Professor, PhD, Holistic Research Institute
 
						Professor, Chief Doctor, Director of Department of Pediatric Surgery, Associate Director of Department of Surgery, Doctoral Supervisor Tongji hospital, Tongji medical college, Huazhong University of Science and Technology
Senior Research Engineer and Professor, Center for Refining and Petrochemicals, Research Institute, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran, Saudi Arabia
 
						Fellow of International Agency for Standards and Ratings (IASR), Edith Cowan University, Sarich Neuroscience Research Institute
 
						Chancellor Emeritus / Professor Emeritus of Chemistry and Physics, University of Missouri–St. Louis
.jpg) 
						Ph.D in Science from the Federal University of Alagoas, UFAL, Brazil
 
						Assistant Professor in College of Architecture, Art and Design
 
						Interim Dean, College of Education and Health Sciences, Director of Biomechanics Laboratory, Sport Science Innovation Program, Bridgewater State University
 
						Professor of numerous training courses in Family Medicine
Assistant Professor, Department of Electronics and Computer Science
 
						Emeritus Professor of Physics, Kadir Has University, Turkey
 Editorial Board Registrations
								 Editorial Board Registrations
								
							 Submit your Article
							 Submit your Article Best Paper of the Volume
							 Best Paper of the Volume Reprints
							 Reprints Refer a Friend
							 Refer a Friend
							 Advertise With Us
							
						 Advertise With Us
						 
									Wenzhou Medical University, China
 
									Fooyin University, Taiwan
 
									Saglik Bilimleri University, Turkey
Vincent Pol University, Poland
.jpg) 
									National Defence University of Malaysia, Malaysia
 
									Dogus University, Turkey
 
									Hope College, USA
 
									Russian Academy of Sciences, Russia
 
									Southern Cross University, Australia
 
									Umm Al-Qura University, Saudi Arabia
 
									City University of New York, USA
 
									Khalifa University of Science & Technology, United Arab Emirates
 
									Prince of Songkla University, Thailand
Hebei Normal University, China
Alexandria University, Egypt
 
									Indian Institute of Technology Kharagpur, India
 
									Council for Agriculture Research and Analysis of Agri Economy (CREA), Italy
 
									King Fahd University of Petroleum and Minerals, Saudi Arabia
Universiti Teknologi MARA, Malaysia
 
									King Abdulaziz University, Saudi Arabia
 
									University of Oregon, USA
 
									University of Edinburgh, Scotland
.bmp) 
									University of Tennessee, USA
.jpg) 
									Central University of Venezuela, Venezuela
.png) 
									Islamic Azad University Central Tehran Branch, Iran
.jpg) 
									Tourin University, Italy
 
									Teaching & Public Speaking, Spain
Paeditric Hospital "Giovanni XXIII", Italy
 
									General Chemical State Laboratory , Greece
 
									University of Nicosia, Cyprus
 
									Universidad Miguel Hernández de Elche, Spain
 
									Oral Roberts University, USA
 
									Beijing Normal University, China
 
									Howard University, USA
Edith Cowan University, Australia
 
									Dubai Health Authority, UAE
 
									University of Minnesota, USA
Indian Institute of Technology Kharagpur, India
 
									Serhal Hospital, Lebanon
.jpg) 
									University of Missouri-St. Louis , USA
 
									University of Malta, Malta
 
									National Center for Global Health and Medicine, Japan
 
									Molloy College, USA
 
									Federal University of Piauí, Brazil
 
									Krankenhaus Nordwest Hospital, Germany
 
									Belgorod State University, Russia
.png) 
									Laval University, Canada
 
									Cinvestav-Unidad Saltillo, Mexico
.png) 
									UPMC Hamot Neuroscience Institute, USA
 
									Ramon Llull University, Spain
White Bear Associates, LLC, USA
 
									Lehigh University, USA
 
									California Southern University, USA
.png) 
									Institute of Solid State Physics of RAS, Russia
 
									University of Buenos Aires, Argentina
Mansoura University, Egypt
 
									King Saud University, Saudi Arabia
 
									University of Coimbra, Portugal
 a Creative Commons Attribution 4.0 International License. Based on a work at www.crimsonpublishers.com.
							
							
							Best viewed in
   a Creative Commons Attribution 4.0 International License. Based on a work at www.crimsonpublishers.com.
							
							
							Best viewed in  
							 | Above IE 9.0 version
| Above IE 9.0 version
							
						
