Novel Enhanced Unsupervised Quantum Clustering Algorithm

Authors

  • Diptanshu Sikdar BASIS Independent Silicon Valley
  • Joe Clapis
  • Swetha Bhattacharya BASIS Independent Silicon Valley

DOI:

https://doi.org/10.47611/jsrhs.v11i2.2905

Keywords:

Quantum Computing, Clustering, K-Means, Q-Means, Angle Embedding, Oracle, SWAP Test

Abstract

Every day, humans generate approximately 2.5 billion gigabytes of new data. While this data is crucial in applications such as drug development and cybersecurity, it is mostly unstructured and unlabeled which makes it difficult to process effectively. Businesses and governments alike solve this problem by using machine learning algorithms to characterize the data and make decisions with it. However, conventional machine learning processes consume excessive computational resources and time: one of the most prominent, K-means clustering, has an approximate time complexity of O(n^2), where n is the input data size. This quadratic time complexity causes scalability problems with large datasets. In this project, I have created an improved, more scalable clustering algorithm by leveraging quantum computation. The approach (new Q-means) involves an Angle Embedding Feature Map, an d-dimensional SWAP Test, and a dynamic, threshold distance-based termination criteria. After implementing this algorithm using the IBM Qiskit SDK, the new Q-means algorithm was run several times on different dimensional datasets. On an average, the new Q-means performed 18% better than the K-means algorithm based on the Adjusted Rand Index. In terms of the simulation execution time, the new Q-means was about six times faster than the existing Q-means due to a multi-threaded implementation and faster convergence. The time complexity of the new Q-means without a Quantum Random Access Memory (QRAM) is O(n^2). While the existing Q-means algorithm, which utilizes QRAM, has a time complexity of O(n(log n)^p), the time complexity of the new Q-means with QRAM is significantly better—O(n).

Downloads

Download data is not yet available.

References or Bibliography

Aimeur, E., Brassard, G., & Gambs, S. (2012, August 31). Quantum Speed-up for unsupervised learning - machine learning. SpringerLink. https://doi.org/10.1007/s10994-012-5316-5

Kerenidis, I., Landman, J., Luongo, A., & Prakash, A. (2018, December 12). Q-means: A quantum algorithm for unsupervised machine learning. arXiv. https://doi.org/10.48550/arXiv.1812.03584

Li, J., & Kais, S. (2021, June 13). Quantum cluster algorithm for Data Classification. arXiv. https://doi.org/10.48550/arXiv.2106.07078

MJV Team. (2021, September 23). Types of clustering: Why is it so important for business? MJV Technology & Innovation. Retrieved February 29, 2022, from https://www.mjvinnovation.com/blog/types-of-clustering/

Neukart, F., Dollen, D. V., & Seidel, C. (2018, June 14). Quantum-assisted cluster analysis on a quantum annealing device. Frontiers. https://doi.org/10.3389/fphy.2018.00055

Nielsen, M. A., & Chuang, I. L. (2018). Quantum Computation and Quantum Information: 10th anniversary ed. Cambridge University Press.

Pakhira, M. K. (2014). A linear time-complexity K-means algorithm using cluster shifting. IEEE Xplore. https://doi.org/10.1109/CICN.2014.220

Preskill, J. (2021, June 25). Quantum Computing 40 years later. arXiv. https://doi.org/10.48550/arXiv.2106.10522

Wikipedia Staff. (2022, April 26). Elbow method (clustering). Wikipedia. Retrieved March 13, 2022, from https://en.wikipedia.org/wiki/Elbow_method_(clustering)

Wikipedia Staff. (2022, February 10). SWAP Test. Wikipedia. Retrieved February 9, 2022, from https://en.wikipedia.org/wiki/Swap_test

Xie, X., Duan, L., Qiu, T. et al. (2021, July 30). Quantum algorithm for MMNG-based DBSCAN. Nature. https://doi.org/10.1038/s41598-021-95156-7

Zhang, Y., Lu, K., Gao, Y., & Wang, M. (2013, March 26). NEQR: A novel enhanced quantum representation of digital images. SpringerLink. https://doi.org/10.1007/s11128-013-0567-z

Published

05-31-2022

How to Cite

Sikdar, D., Clapis, J., & Bhattacharya, S. (2022). Novel Enhanced Unsupervised Quantum Clustering Algorithm. Journal of Student Research, 11(2). https://doi.org/10.47611/jsrhs.v11i2.2905

Issue

Section

HS Research Projects