Parallel Clustering Algorithms: Segmenting Chinese A-Share Stocks Using Financial Indicators

  • Hai Mo Central University of Finance and Economics, China
  • Niu Yihan Central University of Finance and Economics, China
  • Zhang Yuejin Shanghai Pudong Development Bank, Kunming, China
Keywords: stock market segmentation, financial indicators, K-means clustering, big data analytics, parallel algorithms

Abstract

This study presents a novel application of parallel clustering algorithms for segmenting stocks in the Chinese A-share market based on financial indicators. Using the Hadoop platform and Mahout software library, we implemented and compared the performance of K-means and fuzzy K-means algorithms across five distance measures: Euclidean, squared Euclidean, Manhattan, cosine, and Tanimoto. The analysis utilized 15 financial indicators from 2,544 listed companies, reflecting profitability, solvency, growth capability, asset management quality, and shareholder profitability. Experimental results demonstrate that for stock financial data clustering, the K-means algorithm with Tanimoto distance yields optimal execution efficiency and clustering quality, while the fuzzy K-means algorithm performs best with squared Euclidean distance. However, the K-means algorithm proved more effective overall, successfully categorizing 1,483 stocks into 26 meaningful segments compared to only 511 stocks in 27 segments by fuzzy K-means. The resulting stock segmentation framework divides the market into eight comprehensive categories based on investment value and security, providing investors with practical guidance for stock selection. Our approach enables investors to understand fundamental characteristics of each stock segment, discern their distinctive features, and identify undervalued stocks with appreciation potential. This research represents the first application of parallel big data clustering algorithms to segment the entire Chinese A-share market, offering significant practical value for investment decision-making.

Downloads

Download data is not yet available.

References

[1] Zhou Xin. Empirical Research on Segment Effect in China's Stock Market (Master's Thesis). Chengdu: Southwest Jiaotong University, 2012 Zhou X. Empir ic a l re se a rch on plate effect of Chinese stock market (master d i s s e r t at i o n ). C h e n g d u : S o ut h w e s t Jiaotong University, 2012.
[2] Chou, C.H., Chen, W.N., Chang, Z.Y.. Application of Cluster Analysis in Securities Investment. Journal of Chongqing University (Natural Science Edition), 2002, 25(7): 122~126 Z h o u Z H, C h e n W N, Z ha n g Z Y. Application of cluster analysis in stock i n v e s t m e n t . J o u r n a l o f C h o n g q i n g University( Natura l Science Ed ition), 2002, 25(7): 122~126
[3] Lanjun Lao, Yumin Shao. Dynamic cluster analysis of sectoral return series in the Chinese stock market. Financial Research, 2004, 30(11): 75~82 Lao L J, Shao Y M. Dynamic clustering analysis of return series of industrial indexes in Chinese stock market. journal of Finance and Economics, 2004, 30( Journal of Finance and Economics, 2004, 30( 11): 75~82
[4] Li Yunfei, Li Pengyan. Selection of stock investment value evaluation indexes based on fuzzy clustering technique. Journal of Yanshan University, 2008. 32(6): 551-556
[5] Sun Leiping. Application of Data Mining Methods in Stock Analysis and Research (Master's Thesis). Chengdu: Southwest University of Finance and Economics, 2013. Sun L P. The application and research of data mining in stock analysis (master dissertation). Chengdu: Southwestern University of Finance and Economics, 2013
[6] Deng Xiuqin. The application of cluster analysis in stock market sector analysis Use. Mathematical Statistics and Management, 1999, 18(5): 1~4 Deng X Q. Application of cluster analysis in stock market board analysis. Journal of Applied of Statistics and Management, 1999, 18(5): 1~4
[7] Yang, F.Y.. Application of Data Mining Techniques in Stock Investment Changsha: Hunan University, 2010 Yang F Y. Application of data mining in stock investment (master dissertation). Changsha: Hunan University, 2010 Yang F Y. Application of data mining in stock investment (master dissertation). Changsha: Hunan University, 2010
[8] Zhang, Chuanqi. Research on stock sector classification based on ant colony clustering algorithm (Master's thesis). Shanghai: Fudan University, 2012
Published
2025-04-30
How to Cite
Hai Mo, Niu Yihan, & Zhang Yuejin. (2025). Parallel Clustering Algorithms: Segmenting Chinese A-Share Stocks Using Financial Indicators. Journal of Systems Engineering and Information Technology (JOSEIT), 4(1). https://doi.org/10.29207/joseit.v4i1.6535
Section
Articles