Steam Game Data Collection and Visualization Based on Python Crawlers
DOI:
https://doi.org/10.54097/6ykd9269Keywords:
Python crawler, Steam games, Data cleaning, Visualization and analysisAbstract
With the booming development of digital game market, Steam platform, as the world's largest digital game distribution and sales platform, covers a huge amount and multi-dimensional game data. In this paper, based on Python crawler technology, we collect and organize the information of thousands of games on Steam platform, such as price, ratings, reviews, genres and tags, as well as release time. By visualizing and analyzing the game price distribution, the correlation of ratings and reviews, the characteristics of genres and popular tags, and the trend of release time and other dimensions, we reveal the price structure of the Steam game market, the pattern of user ratings, the popular genres and tags preferences, and the development dynamics of the game industry. The results of this paper not only help game developers to accurately grasp the market pricing strategy, understand the positioning and audience characteristics of games in different price ranges, but also provide consumers with more intuitive reference for their purchases, and at the same time, provide academics with empirical research cases for the digital game market.
Downloads
References
[1] De Luisa, A., Hartman, J., Nabergoj, D., Pahor, S., Rus, M., Stevanoski, B., Demšar, J., & Štrumbelj, E. (2021). Predicting the Popularity of Games on Steam. arXiv preprint arXiv:2110.02896. https://arxiv.org/abs/2110.02896
[2] Cunha, L. R., Pessa, A. A. B., & Mendes, R. S. (2024). Shape patterns in popularity series of video games. arXiv preprint arXiv:2406.10241. https://arxiv.org/abs/2406.10241
[3] Batra, S., Sharma, V., Sun, Y., Wang, X., & Wang, Y. (2023). Steam Recommendation System. arXiv preprint arXiv:2305.04890. https://arxiv.org/abs/2305.04890
[4] YANG Fu-Xiang, LIU Yun-Chao. An Overview of Data Cleaning [J]. Computer Application Research, 2002, 19 (3): 1-4.
[5] Guo Zhimao, Zhou Aoying. A review of research on data quality and data cleaning [J]. Journal of Software, 2002, 13 (11): 2019-2026.
[6] LIU Fang, HE Fei. Research on Data Cleaning Based on Cluster Analysis Technique [J]. Computer Engineering and Science, 2005, 27 (6): 29-32.
[7] Liang Wenbin. Research on Data Cleaning Technology and Its Application [D]. Suzhou University, 2005.
[8] Hernandez M A, Stolfo S J. The Merge/Purge Problem for Large Databases [A]. ACM SIGMOD International Conference on Management of Data [C]. 1995: 127-138.
[9] Zhou Yixin. Research and Application of Data Cleaning Algorithm [D]. Qingdao University, 2005.
[10] P. Zhang, P. Dang Election. Similar Duplicate Record Detection Based on Entropy Feature Preferred Group Clustering [J]. Sensors and Microsystems, 2011, 30 (11): 45-48.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Journal of Computer Science and Artificial Intelligence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.








