Abstract
Existing approaches to social media sentiment analysis typically focus on static classification, offering limited foresight into how public opinion evolves. This study addresses that gap by introducing the Multi-Feature Sentiment-Driven Forecasting (MFSF) framework, a novel pipeline that enhances sentiment trend prediction by integrating rich contextual information from text. Using state-of-the-art transformer models on the Sentiment140 dataset, our framework extracts three concurrent signals from each tweet: sentiment polarity, aspect-based scores (e.g., ‘price’ and ‘service’), and topic embeddings. These features are aggregated into a daily multivariate time series. We then employ a SARIMAX model to forecast future sentiment, using the extracted aspect and topic data as predictive exogenous variables. Our results, validated on the historical Sentiment140 Twitter dataset, demonstrate the framework’s superior performance. The proposed multivariate model achieved a 26.6% improvement in forecasting accuracy (RMSE) over a traditional univariate ARIMA baseline. The analysis confirmed that conversational aspects like ‘service’ and ‘quality’ are statistically significant predictors of future sentiment. By leveraging the contextual drivers of conversation, the MFSF framework provides a more accurate and interpretable tool for businesses and policymakers to proactively monitor and anticipate shifts in public opinion.
| Original language | English |
|---|---|
| Pages (from-to) | 670 |
| Number of pages | 1 |
| Journal | Information |
| Volume | 16 |
| Issue number | 8 |
| Early online date | 6 Aug 2025 |
| DOIs | |
| Publication status | Published - 6 Aug 2025 |
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver