Steam Game Data Collection and Visualization Based on Python Crawlers

Siman Wang; Yi Xu; Sai Li; Zhihan Zou; Juan Li

doi:10.54097/6ykd9269

Authors

Siman Wang
Yi Xu
Sai Li
Zhihan Zou
Juan Li

DOI:

https://doi.org/10.54097/6ykd9269

Keywords:

Python crawler, Steam games, Data cleaning, Visualization and analysis

Abstract

With the booming development of digital game market, Steam platform, as the world's largest digital game distribution and sales platform, covers a huge amount and multi-dimensional game data. In this paper, based on Python crawler technology, we collect and organize the information of thousands of games on Steam platform, such as price, ratings, reviews, genres and tags, as well as release time. By visualizing and analyzing the game price distribution, the correlation of ratings and reviews, the characteristics of genres and popular tags, and the trend of release time and other dimensions, we reveal the price structure of the Steam game market, the pattern of user ratings, the popular genres and tags preferences, and the development dynamics of the game industry. The results of this paper not only help game developers to accurately grasp the market pricing strategy, understand the positioning and audience characteristics of games in different price ranges, but also provide consumers with more intuitive reference for their purchases, and at the same time, provide academics with empirical research cases for the digital game market.

Downloads

Download data is not yet available.

References

[1] De Luisa, A., Hartman, J., Nabergoj, D., Pahor, S., Rus, M., Stevanoski, B., Demšar, J., & Štrumbelj, E. (2021). Predicting the Popularity of Games on Steam. arXiv preprint arXiv:2110.02896. https://arxiv.org/abs/2110.02896

[2] Cunha, L. R., Pessa, A. A. B., & Mendes, R. S. (2024). Shape patterns in popularity series of video games. arXiv preprint arXiv:2406.10241. https://arxiv.org/abs/2406.10241

[3] Batra, S., Sharma, V., Sun, Y., Wang, X., & Wang, Y. (2023). Steam Recommendation System. arXiv preprint arXiv:2305.04890. https://arxiv.org/abs/2305.04890

[4] YANG Fu-Xiang, LIU Yun-Chao. An Overview of Data Cleaning [J]. Computer Application Research, 2002, 19 (3): 1-4.

[5] Guo Zhimao, Zhou Aoying. A review of research on data quality and data cleaning [J]. Journal of Software, 2002, 13 (11): 2019-2026.

[6] LIU Fang, HE Fei. Research on Data Cleaning Based on Cluster Analysis Technique [J]. Computer Engineering and Science, 2005, 27 (6): 29-32.

[7] Liang Wenbin. Research on Data Cleaning Technology and Its Application [D]. Suzhou University, 2005.

[8] Hernandez M A, Stolfo S J. The Merge/Purge Problem for Large Databases [A]. ACM SIGMOD International Conference on Management of Data [C]. 1995: 127-138.

[9] Zhou Yixin. Research and Application of Data Cleaning Algorithm [D]. Qingdao University, 2005.

[10] P. Zhang, P. Dang Election. Similar Duplicate Record Detection Based on Entropy Feature Preferred Group Clustering [J]. Sensors and Microsystems, 2011, 30 (11): 45-48.

Steam Game Data Collection and Visualization Based on Python Crawlers

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

Cover

Indexing & Abstracting