Bachelorarbeit, 2022
58 Seiten, Note: 1,0
1 Introduction
2 Method
2.1 State Space Modelling Approach
2.1.1 Linear Gaussian State Space Model
2.1.2 ARIMA State Space Model
2.1.3 SARIMA State Space Model
2.1.4 (S)ARIMAX - Regression with (S)ARIMA errors
2.1.5 Kalman Filter
2.2 No-Change Baseline Model
2.3 Evaluation Metrics
2.3.1 TADDA
2.3.2 Mean Absolute Error (MAE)
2.3.3 Root Mean Square Error (RMSE)
3 Data
3.1 Armed Conflict Location and Event Data Project (ACLED)
3.1.1 Analysis of Conflict Incidence and Country Categorization
3.2 International Monetary Fund - World Economic Outlook Database (IMF-WEO)
3.3 World Bank - World Development Indicators (WB-WDI)
3.4 Variable Overview and Missing Data
4 Implementation in Python
4.1 No-Change Forecaster Class
4.2 State Space ARIMA Forecaster Class
4.3 Grid Search Class
4.4 Automated Model Building Process
5 Results
5.1 Global Model Performances
5.2 Country-Level Model Performances
6 Conclusion
This thesis addresses the "heuristic crisis" in conflict prediction by developing a fully-automated machine learning framework that optimizes econometric state space ARIMA methods to forecast conflict fatalities. By comparing a portfolio of model variants—ranging from standard ARIMA to ARIMAX models incorporating socio-economic and neighbor-conflict indicators—the research aims to create a data-driven system capable of consistently outperforming naive no-change heuristics in predictive accuracy.
2.1 State Space Modelling Approach
In contrast to the classical Box-Jenkins modelling of ARIMA processes, I rely on the state space approach, which allows the formulation of equivalent state space ARIMA equations that can be converted back to the classical ARIMA equations. The State Space approach is kept very general and allows the modeling of any kind of system, which is why it has a wide range of applications in the natural sciences as well as in engineering. For time series forecasting, state space modeling allows a reinterpretation of the underlying problem as the evolution of a system over time: Instead of directly modeling the time series of an observed variable yt, yt is linked to the unknown internal states αt of the system and then the temporal state evolution of the system is modeled. The training of a state space model corresponds to learning the unknown state components based on the observation time series yt. During forecasting, an estimate for a future state can be generated based on all previous states, which in turn can be used to derive the corresponding future value of the observed variable.
1 Introduction: Outlines the urgent challenge of conflict prediction, the limitations of current naive heuristics, and the potential for a more sophisticated, automated state space ARIMA approach.
2 Method: Provides the mathematical foundation for the automated state space modelling framework and the different ARIMA, SARIMA, and ARIMAX model specifications, including the Kalman filter implementation.
3 Data: Describes the three main data sources—ACLED, IMF-WEO, and World Bank WDI—used to generate conflict history indicators and socio-economic exogenous variables.
4 Implementation in Python: Details the modular object-oriented Python implementation, covering the preprocessing pipeline, grid search cross-validation, and the automated model building process.
5 Results: Analyzes the predictive performance of the 16 state space ARIMA variants at both global and country levels, highlighting the significance of monthly vs. quarterly data and the impact of exogenous variables.
6 Conclusion: Summarizes the key findings, confirming that properly tuned time series models can compete with or outperform state-of-the-art conflict prediction models, and discusses directions for future research.
Conflict Prediction, State Space ARIMA, Machine Learning, ACLED, Econometrics, Crisis Early Warning, TADDA Metric, Time Series Analysis, Exogenous Predictors, Spill-over Effect, Hyperparameter Optimization, Kalman Filter, Data-Driven Forecasting, Conflict Fatalities, Development in Reverse.
The work focuses on developing an automated, data-driven framework to improve conflict prediction by using state space ARIMA methods to forecast the number of conflict fatalities at the country level.
Key themes include the comparison of different ARIMA-based model configurations, the impact of socio-economic and neighborhood-conflict indicators on forecast accuracy, and the evaluation of models against naive baseline heuristics.
The primary goal is to exceed the predictive accuracy of the simple "no-change" model—which has historically been difficult for sophisticated models to beat—in the context of conflict escalation and de-escalation.
The research uses econometric state space ARIMA methods in a pipeline that includes imputation, standardization, and principal component analysis (PCA), combined with an extending window cross-validation for hyperparameter tuning.
The main body details the theoretical state space framework, the data preparation from ACLED and socio-economic sources, the modular Python implementation of the modeling pipeline, and an extensive evaluation of model performances.
Core keywords include Conflict Prediction, State Space ARIMA, Machine Learning, ACLED, Econometrics, Crisis Early Warning, and the TADDA metric.
The research finds that monthly models perform significantly better than quarterly models because quarterly aggregation leads to a substantial loss of information and increases uncertainty in the model coefficients.
The research implements a strict pipeline where independent preprocessing (imputation, standardization, and PCA) is performed separately on training and test sets within an extending window cross-validation to prevent information from the future from influencing the model training.
The thesis finds that the rigid implementation of annual seasonality used in this research actually interferes with prediction accuracy, likely acting as noise rather than helpful seasonal signals in most country-level time series.
Der GRIN Verlag hat sich seit 1998 auf die Veröffentlichung akademischer eBooks und Bücher spezialisiert. Der GRIN Verlag steht damit als erstes Unternehmen für User Generated Quality Content. Die Verlagsseiten GRIN.com, Hausarbeiten.de und Diplomarbeiten24 bieten für Hochschullehrer, Absolventen und Studenten die ideale Plattform, wissenschaftliche Texte wie Hausarbeiten, Referate, Bachelorarbeiten, Masterarbeiten, Diplomarbeiten, Dissertationen und wissenschaftliche Aufsätze einem breiten Publikum zu präsentieren.
Kostenfreie Veröffentlichung: Hausarbeit, Bachelorarbeit, Diplomarbeit, Dissertation, Masterarbeit, Interpretation oder Referat jetzt veröffentlichen!

