Document Type : Research Paper

Authors

1 Ph.D. Candidate in Economics, Semnan University, Semnan, Iran

2 Professor, Economics Department, Semnan University, Semnan, Iran

3 Associate Professor, Economics Department, Semnan University, Semnan, Iran

Abstract

The exchange rate is recognized as a key economic indicator influenced by multiple factors. Some of these factors manifest as measurable economic variables, while others are reflected in political and financial news. A central, unresolved question is whether it is possible to develop a comprehensive and scalable model for exchange rate modeling and forecasting that accounts for all relevant variables and factors. Using a data fusion approach, the present study proposed a comprehensive deep learning–based model supporting multiple data types. To train the model, exchange rate–related news was collected from ten major national and international sources covering the period from 2014 to 2023 (1393–1402 in the Iranian calendar). The data was then combined with exchange rate figures and other economic indicators. To identify the best model, eight machine learning models, two statistical models, and one large language model were trained and evaluated under both regression and classification settings. To mitigate bias and random effects, the study applied time series–aware cross-validation along with repeated training and testing using different random initializations. The results demonstrated that the proposed approach, which directly incorporates all influential factors, significantly outperforms existing methods.

Introduction

Exchange rate fluctuations represent one of the most complex challenges in modern economic analysis, shaped by a dynamic interplay of macroeconomic fundamentals, policy decisions, and informational signals disseminated through the media. Traditional econometric approaches often fail to capture these multidimensional interactions, as they rely primarily on quantitative variables and lagged historical data. As a result, they tend to overlook the qualitative influence of news, market sentiment, and expectations that often precede measurable economic changes. Recent advances in artificial intelligence and machine learning have introduced powerful tools for integrating diverse forms of data—both numerical and textual—into unified predictive systems. The present research tried to propose a comprehensive and extensible model for forecasting exchange rates in Iran, combining structured economic indicators with unstructured news data through a data fusion approach.

Materials and Methods

This study employed a quantitative and applied methodology based on supervised machine learning techniques. The dataset spans the period from April 2014 to March 2023 (1393–1402 in the Iranian calendar). Daily free-market exchange rates were obtained from three verified sources: the National Exchange website, the Gold and Currency Information Network, and the Bonbast platform. Additionally, key macroeconomic indicators—including GDP growth, inflation rate, unemployment rate, trade balance, public debt, foreign reserves, and oil prices—were collected from official statistical repositories. Then the study went on to incorporate qualitative dimensions. In this respect, news articles related to exchange rate dynamics were gathered from ten major national and international media outlets, including Donya-e-Eqtesad, San’at-Madan-Tijarat, Asia Daily, ISNA, Khabaronline, Tabnak, BBC Persian, and Voice of America Persian. Each news item was labeled according to the contemporaneous changes in exchange rates. Data preprocessing involved normalization, outlier removal, and interpolation of missing values for numerical data. Textual data underwent cleaning, tokenization, and embedding using the ParsBERT model (Farahani et al., 2021), which was fine-tuned on domain-specific economic texts to improve contextual representation. Following preprocessing, approximately 388,354 fused samples were constructed. Eight machine learning models (Random Forest, XGBoost, LightGBM, CNN-LSTM, GRU, Bi-GRU, LSTM, and Bi-LSTM), two statistical models (ARIMA and Prophet), and one large language model (GPT-4) were trained and compared under both regression and classification settings. Model evaluation was conducted through time-series–aware cross-validation and repeated random initialization to minimize bias. Performance metrics included Mean Absolute Error (MAE), Mean Squared Error (MSE), Accuracy, and F1-score.

Results and Discussion

The results revealed that models integrating textual and numerical data substantially outperform those trained solely on numerical inputs. Specifically, the inclusion of news embeddings reduced forecasting error by more than 5% across most deep learning architectures. Among the evaluated models, the fine-tuned GPT-4 achieved the highest overall accuracy and the lowest error metrics in both regression and classification tasks. However, considering constraints on interpretability and data security, the Bi-directional Gated Recurrent Unit (Bi-GRU) model was identified as the optimal choice for practical implementation. The Bi-GRU model exhibited strong learning capability in capturing temporal dependencies and contextual relationships between macroeconomic variables and market sentiment. In classification mode, it achieved an F1-score of 0.84 and an accuracy rate of 0.86 when textual data were incorporated. In contrast, traditional statistical models such as ARIMA and Prophet showed limited capacity to reflect short-term market shocks influenced by real-time news.
The findings highlighted the importance of data fusion in financial forecasting. Textual news data provide early signals of market sentiment that often precede observable changes in economic variables. By integrating these heterogeneous data sources, the proposed model can offer a more dynamic and responsive forecasting framework, particularly suited to volatile markets such as Iran’s foreign exchange sector.

Conclusion

This study proposed a comprehensive machine learning–based model that successfully integrates textual and numerical data for exchange rate forecasting in Iran. The results confirmed that data fusion enhances predictive accuracy and robustness, outperforming both conventional econometric methods and single-modality deep learning models. Among the evaluated architectures, Bi-GRU offered the most practical balance between performance, interpretability, and computational efficiency. The findings underscored that incorporating news-driven sentiment and contextual information provides a timely advantage for policy formulation and risk management. Moreover, the modular structure of the proposed model allows for future extensions to other economic domains such as stock market analysis and inflation forecasting. Future studies are recommended to expand the dataset to include social media sentiment and to adopt explainable AI (XAI) techniques to improve interpretability and transparency.

Keywords

Main Subjects

Abounouri, A., Khanalipour, A. & Abbasi, J. (2009). The effect of news on exchange rate volatility in Iran: An application of the ARCH family. Pajouheshnameh Bazargani, 50, 101-120. [In Persian] dor:20.1001.1.17350794.1388.13.50.4.8
Amiri, Farhad, Derakhshani-Daraabi, Kaveh, & Asayesh, Hamid. (2020).
Investigation the Effects of Exchange Rate Fluctuations on the Sub-Sectors Value Added in Iran. Iranian Journal of Applied Economic Studies, 10(39), 247–267. [In Persian]
Bayat, N. (2018). Forecasting exchange rates using recurrent self-organizing maps. Economics and Modern Trade, 13, 55-84. [In Persian]
Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.D., Dhariwal, P., ... & Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877-1901. doi:10.48550/arXiv.2005.14165
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. (2018). BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv: 1810.04805. doi:10.48550/arXiv.1810.04805
Dridan, R. & Oepen, S. (2012). Tokenization: Returning to a long solved problem—a survey, contrastive experiment, recommendations, and toolkit. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers).
Farahani, M., Gharachorloo, M., Farahani, M. & Manthouri, M. (2021). ParsBERT: Transformer-based model for Persian language understanding. Neural Processing Letters, 53, 3831-3847.
Fatahi, Sh., Ahmadi, A. & Akram-Mirzaei, A. (2013). Comparison of the accuracy of genetic algorithm methods with other exchange rate forecasting methods. Economic Studies and Policies, 96, 113-136. [In Persian]. doi:10.22096/esp.2013.26156
Goudarzi Farahani, G., Adeli, O.A. & Ghorbani, M. (2020). The impact of economic policy uncertainty on exchange rate volatility using the NARDL model approach. Econometric Modeling, 5(4), 147-171. [In Persian]. doi:10.22075/jem.2021.22243.1547
Hashemi-Dizaj, A., Hazari-Neiri, H. & Pourvohedani, R. (2020). Comparing the performance of artificial neural network models for forecasting exchange rates in Iran. Scientific Biannual Journal of Economic Studies and Policies, 7(2), 53-80. [In Persian]. doi:10.22096/esp.2020.43397
Khashaei, Mehdi, Bijari, Mehdi, & Mokhatab-Rafi‘i, Farimah. (2013).
Forecasting the Exchange Rate Using Hybrid Models of Multilayer Perceptrons (MLPs) and Probabilistic Neural Network Classifiers (PNNs). Numerical Methods in Engineering Quarterly, 32(1), 1–14. [In Persian]
Khodaveisi, H. & Molabahrami, A. (2012). Modeling and forecasting exchange rates based on stochastic differential equations. Economic Research, 100(47), 129-144. [In Persian]. doi:10.22059/jte.2012.29257
Heaton, J. (2016). An empirical analysis of feature engineering for predictive modeling. SoutheastCon, 1-6. doi:10.48550/arXiv.1701.07852
Josse, J. & Husson, F. (2012). Handling missing values in exploratory multivariate data analysis methods. Journal de la Société Française de Statistique, 153(2), 79-99.
Khan, W., Daud, A., Khan, K., Muhammad, S. & Haq, R. (2023). Exploring the frontiers of deep learning and natural language processing: A comprehensive overview of key challenges and emerging trends. Natural Language Processing Journal, 100026. doi:10.1016/j.nlp.2023.100026
Li, Y. & Yang, T. (2018). Word embedding for understanding natural language: A survey. Guide to Big Data Applications, 83-104. doi:10.1007/978-3-319-53817-4
Liu, J., Li, T., Xie, P., Du, S., Teng, F. & Yang, X. (2020). Urban big data fusion based on deep learning: An overview. Information Fusion, 53, 123-133. doi:10.1016/j.inffus.2019.06.016
Mansouri-Gorgori, H. & Khodavisi, H. (2019). Exchange rate forecasting: Comparison of logistic growth models with competing models. Economics and Modeling, 10, 141-179. [In Persian]. doi:10.48308/ecoj.10.3.157
Marzban, D.H., Javaheri, B.B. & Akbarian, A. (2005). A comparison between structural econometric models, time series, and neural networks for exchange rate forecasting. Economic Research Journal, 40(2). [In Persian]. dor: 20.1001.1.00398969.1384.40.2.8.6
Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv preprint arXiv: 1301.3781. doi:10.48550/arXiv.1301.3781
Mohamadi, S., Mujtaba, G., Le, N., Doretto, G. & Adjeroh, D.A. (2023). ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey. arXiv preprint arXiv: 2307.04251.
Nargesian, F., Samulowitz, H., Khurana, U., Khalil, E.B. & Turaga, D.S. (2017). Learning feature engineering for classification. IJCAI, 2529-2535. doi:10.24963/ijcai.2017/352
Potdar, K., Pardawala, T.S. & Pai, C.D. (2017). A comparative study of categorical variable encoding techniques for neural network classifiers. International Journal of Computer Applications, 175(4), 7-9.
Rahimi-Boroujerdi, A. (2000). The optimal exchange system and forecasting exchange rates for Iran's economy. Iranian Economic Research, 5, 40-45. [In Persian].
Seabe, P.L., Moutsinga, C.R.B. & Pindza, E. (2023). Forecasting cryptocurrency prices using LSTM, GRU, and bi-directional LSTM: A deep learning approach. Fractal and Fractional, 7(2), 203.
Sharif-Moghaddam, Sh. & Hashemi, S.A. (2018). Forecasting euro to dollar exchange rate using artificial neural networks. Financial Engineering and Securities Management, 9(37), 399-413. [In Persian].
Shirazi, Homayoun, & Nasrollahi, Khadijeh. (2013). Monetary Models and Exchange Rate Forecasting in Iran: From Theory to Empirical Evidence. Financial and Economic Policies Quarterly, 1(4), 5–24. [In Persian]
Singh, D. & Singh, B. (2020). Investigating the impact of data normalization on classification performance. Applied Soft Computing, 97, 105524. doi:10.1016/j.asoc.2019.105524
Sousa, M.G., Sakiyama, K., de Souza Rodrigues, L., Moraes, P.H., Fernandes, E.R. & Matsubara, E.T. (2019). BERT for stock market sentiment analysis. 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI), 1597-1601. doi.org:10.1109/ICTAI.2019.00231
Taghavi, M. & Khodam, M. (2011). Comparative efficiency of exchange rate theories in predicting exchange rate changes in the international foreign exchange market. Financial Knowledge of Securities Analysis (Financial Studies), 9, 147-192. [In Persian].
Tayebi, S., Moheddinia, N. & Kazemini, M. (2008). Utilizing artificial neural networks in forecasting economic variables and comparison with econometric methods: Forecasting exchange rate trends in Iran. Industrial Engineering and Management (Sharif Special Edition for Engineering Sciences), 43, 104-199. [In Persian]
Villamil, L., Bausback, R., Salman, S., Liu, T.L., Horn, C. & Liu, X. (2023). Improved stock price movement classification using news articles based on embeddings and label smoothing. arXiv preprint arXiv:2301.10458. doi:10.48550/arXiv.2301.10458
Yarmohammadi, M. & Mahmoodvand, R. (2015). Forecasting exchange rates using singular value decomposition analysis. Journal of Economics, 18, 137-146. [In Persian].
Zaranejad, M., Fagh-Majidi, A. & Rezaei, R.A. (2008). Exchange rate forecasting using artificial neural networks and ARIMA models. Quantitative Economics (Economic Studies), 19, 107-130. [In Persian]