A Study on Prediction of New Followers of Social Media Bloggers Based on Principal Component Clustering and Temporal Difference Regression

Authors

  • Zhiqing Guo
  • Junpeng Yuan
  • Hang Zeng
  • Yiting Qiu
  • Lanqin Wang
  • Jiaxin Huang
  • Chentong Wen
  • Xiaolu Zou
  • Jingmin Lan
  • Tingyan Wang
  • Manni Li
  • Zhe Li

DOI:

https://doi.org/10.54097/628c0y48

Keywords:

Principal component analysis, K-means clustering, temporal difference OLS regression, blogger-user interaction matrix, new follower prediction

Abstract

In recent years, social media platforms have profoundly affected people's social interactions and information acquisition, precisely analysing users' needs to achieve efficient matching of content supply, promoting positive ecological cycles, and ultimately enhancing the competitiveness of platforms and commercial value. This paper focuses on the prediction task of "the number of new followers of each blogger", integrates dynamic Bayesian network and collaborative filtering concepts, constructs the "blogger-user interaction matrix", extracts 9-dimensional sliding window differential features of viewing, liking, commenting, etc. in the past three days, and The high-dimensional user features were compressed to 42 dimensions using principal component analysis. Subsequently, K-means was used to divide the 42 bloggers into three groups, and the time-difference OLS regression models were built separately, and the adjusted R² of the three groups was higher than 0.78, which verified the hypothesis that "similar bloggers have similar attention growth patterns". After the model was tested for robustness and sensitivity, the blogger with the most new followers on 21 July was predicted to be B21 (554 people). The research results can provide a quantitative basis for the platform's accurate push and blogger's operation strategy. This paper is better in data dynamic modelling, reviewing a large amount of literature to establish the best model, with more thorough consideration in the problem. The model passes the robustness test and sensitivity test, and has reference value in platform push.

Downloads

Download data is not yet available.

References

[1] Li Xirui, Lv Bin, Liu Yujie. New passenger flow prediction of traffic and travelling integration service area based on entropy value method [J]. Highway Traffic Science and Technology, 2025, 42(01):203-214.

[2] Liu Yang. PV new installed capacity forecast upward this year [N]. China Securities Journal, 2023-12-16(A05). DOI:10.28162/n.cnki.nczjb.2023.006126.

[3] Zelong Ouyang, Jinqiong Li, Guanju Wang, et al. Prediction of new cases of monkeypox weekly globally based on a time series model [J/OL]. Chinese Journal of Animal Infectious Diseases, 1-9 [2025-07-24]. https://doi.org/10.19958/j.cnki.cn31-2031/s.20230906.004.

[4] Feng Xia, Wang Yao. Future new route discovery based on link prediction [J]. Journal of Beijing University of Aeronautics and Astronautics, 2021, 47(09):1729-1738. DOI:10.13700/j.bh.1001-5965.2020.0335.

[5] Li Yang, Hu Yao, Shang Mingju, et al. New vehicle throughput congestion prediction model [J]. Journal of Guizhou University (Natural Science Edition), 2019, 36(05):21-27. DOI:10.15958/j.cnki.gdxbzrb.2019.05.05.

[6] Li Xiaofei, Wang Bo. Research on the enhancement of Dalian city image communication influence under the new media perspective [J/OL]. Journal of Dalian University, 1-10[2025-07-24]. http://kns.cnki.net/kcms/detail/21.1390.G4.20250717.1526.002.html.

Downloads

Published

27-09-2025

Issue

Section

Articles