ISSN :2582-9793

Detecting Social Stock Pumping in the Russian Equity Market Using Machine Learning

Original Research (Published On: 09-Sep-2025 )
DOI : https://doi.org/10.54364/AAIML.2025.53236

Gleb Khaziev

Adv. Artif. Intell. Mach. Learn., 5 (3):4242-4259

1. Gleb Khaziev: National Research University Higher School of Economics (HSE)

Download PDF Here Citation Info via Semantic Scholar

DOI: 10.54364/AAIML.2025.53236

Article History: Received on: 05-Jun-25, Accepted on: 02-Sep-25, Published on: 09-Sep-25

Corresponding Author: Gleb Khaziev

Email: glebhaziev@mail.ru

Citation: Gleb Khaziev. Detecting Social Stock Pumping in the Russian Equity Market Using Machine Learning. Advances in Artificial Intelligence and Machine Learning. 2025; 5(3):236.


Abstract

    

This study investigates the phenomenon of social stock pumping in the Russian equity market and explores effective machine learning models for its detection. Social stock pumping is defined as a market anomaly in which coordinated publications on social media trigger abnormal increases in stock prices and trading volumes without fundamental justification. The paper proposes a methodology for identifying such events based on a combination of behavioral and market indicators. A dataset of 615 social pumping episodes across 104 Russian companies over the period 2019–2025 was constructed. To assess the impact of social media, two proprietary indices were developed: the Russian Social Media Intensive Index (RSMII) and the Russian Social Media Sentiment Index (RSMSI). Five classification models were trained to detect manipulation events: logistic regression, KNN, random forest, SVM and CatBoost. The CatBoost model showed the best performance (AUC-ROC = 0.97, F1 score = 0.91). A comparison with normal trading days confirmed the presence of statistically significant anomalies in prices, trading volumes, and social indicators on pump days. The results demonstrate that machine learning models (particularly CatBoost and KNN) substantially outperform logistic regression in terms of accuracy and recall when detecting social pumping cases. The proposed methodology can be applied by regulators and market participants to monitor informational influence and manage associated risks.

Statistics

   Article View: 1783
   PDF Downloaded: 15