false

Prediction of high-frequency futures return directions based on the mean uncertainty classification methods: An application in China's future market

Prediction of high-frequency futures return directions based on the mean uncertainty classification methods: An application in China’s future market ArXiv ID: 2508.06914 “View on arXiv” Authors: Ying Peng, Yifan Zhang, Xin Wang Abstract In this paper, we mainly focus on the prediction of short-term average return directions in China’s high-frequency futures market. As minor fluctuations with limited amplitude and short duration are typically regarded as random noise, only price movements of sufficient magnitude qualify as statistically significant signals. Therefore data imbalance emerges as a key problem during predictive modeling. From the view of data distribution imbalance, we employee the mean-uncertainty logistic regression (mean-uncertainty LR) classification method under the sublinear expectation (SLE) framework, and further propose the mean-uncertainty support vector machines (mean-uncertainty SVM) method for the prediction. Corresponding investment strategies are developed based on the prediction results. For data selection, we utilize trading data and limit order book data of the top 15 liquid products among the most active contracts in China’s future market. Empirical results demonstrate that comparing with conventional LR-related and SVM-related imbalanced data classification methods, the two mean-uncertainty approaches yields significant advantages in both classification metrics and average returns per trade. ...

August 9, 2025 · 2 min · Research Team

Leveraging Generative Adversarial Networks for Addressing Data Imbalance in Financial Market Supervision

Leveraging Generative Adversarial Networks for Addressing Data Imbalance in Financial Market Supervision ArXiv ID: 2412.15222 “View on arXiv” Authors: Unknown Abstract This study explores the application of generative adversarial networks in financial market supervision, especially for solving the problem of data imbalance to improve the accuracy of risk prediction. Since financial market data are often imbalanced, especially high-risk events such as market manipulation and systemic risk occur less frequently, traditional models have difficulty effectively identifying these minority events. This study proposes to generate synthetic data with similar characteristics to these minority events through GAN to balance the dataset, thereby improving the prediction performance of the model in financial supervision. Experimental results show that compared with traditional oversampling and undersampling methods, the data generated by GAN has significant advantages in dealing with imbalance problems and improving the prediction accuracy of the model. This method has broad application potential in financial regulatory agencies such as the U.S. Securities and Exchange Commission (SEC), the Financial Industry Regulatory Authority (FINRA), the Federal Deposit Insurance Corporation (FDIC), and the Federal Reserve. ...

December 4, 2024 · 2 min · Research Team