Comparing Clustering Algorithms using Financial Time-series data Author: Duangrux Tangsirisakul

Comparing Clustering Algorithms using Financial Time-series data

Author (s): 

Duangrux Tangsirisakul

Abstract

Data clustering is one of the most popular unsupervised machine learning approaches.  Clustering data can help identify the pattern of what seems to be similar data and leads to the best solution for all commercial problems. For example, taxi booking application, customer’s data can be clustered to match supply with demand, to detect fraud pattern of an e-commerce transaction or clustering customers in dating application, etc. In order to carry out the best calculation of clustering certain requirement is needed in each method and approach such as the basic assumption of data. When analyzing data with a wrong assumption, it results in low-quality outcomes. So we would like to study and compare this type of data in an in-depth manner. Time-series analysis is used in many future prediction tasks based on previously observed values, mixing cluster analysis and time-series data to serve the initial purpose that researcher would like to share to the public for better understanding of the clustering, researcher would also like following researchers to refer to this work and develop this theory and apply in wider issues in future. In this paper, the focus is on comparing time-series clustering algorithm with financial time-series data, which is common data such as cryptocurrency, exchange rate currency, the Shanghai Stock Exchange (SSE50), and the stock exchange of Thailand 50 (SET50). The paper introduces the importance of data mining, machine learning, and time-series clustering and some related methods, which lays a theoretical foundation for the formal research of this paper. By analyzing the structure of time-series clustering, that consists of several parts, including distance measurement, time-series prototype, a clustering algorithm, and clustering evaluation. From research result, the hierarchical algorithm is the most efficient algorithm for unequal length of cryptocurrency series and SSE 50. In another hand, the partitional algorithm is the most efficient for an equal length of exchange rate currency and SET 50.

Key words: time-series clustering, machine learning, dynamic time warping, crypto-currency

Download PDF

Title:

Comparing Clustering Algorithms using Financial Time-series data

Author:

Duangrux Tangsirisakul

Journal Name: International Journal of Science and Business
Website: ijsab.com
ISSN: ISSN 2520-4750 (Online), ISSN 2521-3040 (Print)
DOI: https://doi.org/10.5281/zenodo.2617341
Media: Online
Volume: 3
Issue: 2
Acceptance Date: 27/03/2019
Date of Publication: 31/03/2019
PDF URL: http://ijsab.com/wp-content/uploads/334.pdf
Free download: Available
Page: 146-166
First Page: 146
Last Page: 166
Current Status: Published

Cite This Article:

Tangsirisakul, D.  (2019). Comparing Clustering Algorithms using Financial Time-series data. International Journal of Science and Business, 3(2), 146-166. doi: https://doi.org/10.5281/zenodo.2617341

Retrieved from http://ijsab.com/wp-content/uploads/334.pdf

 

About Author

Duangrux Tangsirisakul, (Corresponding Author) Department of Mathematics, China University of Mining and Technology, Jiangsu, China.

 

 Download PDF

DOI: https://doi.org/10.5281/zenodo.2617341

Follow us

This Post Has Been Viewed 43 Times