TY - GEN
T1 - An Explainable Multimodal Framework for Real-Time Bitcoin Forecasting
AU - Badal, Dipesh
AU - Busalim, Abdelsalam
AU - Lee, Donghyeok
N1 - Publisher Copyright:
© 2026 Copyright held by the owner/author(s).
PY - 2026/2/16
Y1 - 2026/2/16
N2 - High-frequency crypto forecasting requires systems that are accurate, explainable, and designed for human decision-making. Bitcoin presents a unique challenge for Human-Centred AI (HCAI) due to its volatility and sensitivity to heterogeneous technical, fundamental, and sentiment signals. This paper presents an explainable multimodal framework for Bitcoin forecasting at 15-minute resolution. We align five modalities - market data, on-chain metrics, the Fear & Greed Index (FGI), news, and Reddit - onto a unified, leakage-safe 15-minute grid. We evaluate tree-based, sequential, and Multimodal Fusion Block (MFB) models for next-interval log-return prediction using chronological splits. Results show that while short-horizon prediction remains challenging, multimodal features consistently improve over structured baselines, particularly during event-driven periods. To ensure transparency, the framework integrates a dual-layer explanation system: SHapley Additive exPlanations (SHAP) attributions combined with large language model (LLM) narratives, ensuring outputs are both technically faithful and human-accessible. This work unlocks the "black box"of complex predictive architectures, transforming opaque multimodal signals into transparent, actionable decision support for high-frequency trading.
AB - High-frequency crypto forecasting requires systems that are accurate, explainable, and designed for human decision-making. Bitcoin presents a unique challenge for Human-Centred AI (HCAI) due to its volatility and sensitivity to heterogeneous technical, fundamental, and sentiment signals. This paper presents an explainable multimodal framework for Bitcoin forecasting at 15-minute resolution. We align five modalities - market data, on-chain metrics, the Fear & Greed Index (FGI), news, and Reddit - onto a unified, leakage-safe 15-minute grid. We evaluate tree-based, sequential, and Multimodal Fusion Block (MFB) models for next-interval log-return prediction using chronological splits. Results show that while short-horizon prediction remains challenging, multimodal features consistently improve over structured baselines, particularly during event-driven periods. To ensure transparency, the framework integrates a dual-layer explanation system: SHapley Additive exPlanations (SHAP) attributions combined with large language model (LLM) narratives, ensuring outputs are both technically faithful and human-accessible. This work unlocks the "black box"of complex predictive architectures, transforming opaque multimodal signals into transparent, actionable decision support for high-frequency trading.
KW - Bitcoin Forecasting
KW - Deep Learning
KW - Explainable AI
KW - Machine Learning
KW - Multimodal Data
KW - SHAP
UR - https://www.scopus.com/pages/publications/105031784723
U2 - 10.1145/3777490.3777508
DO - 10.1145/3777490.3777508
M3 - Conference contribution
AN - SCOPUS:105031784723
T3 - HCAI-ep 2026 - Proceedings of the 2026 Conference on Human Centered Artificial Intelligence - Education and Practice
SP - 100
EP - 106
BT - HCAI-ep 2026 - Proceedings of the 2026 Conference on Human Centered Artificial Intelligence - Education and Practice
PB - Association for Computing Machinery (ACM)
T2 - 3rd International Conference on Human-Centred AI - Education and Practice, HCAI-ep 2026
Y2 - 21 January 2026 through 22 January 2026
ER -