A Comparative Evaluation of Multi-Layer Perceptron, Moving Average, and Autoregressive Integrated Moving Average Models for Forecasting Carbon Dioxide Levels
This project aims to evaluate and compare the performance of three forecasting models for predicting monthly CO2 levels at the Mauna Lua observatory:
- Multi-Layer Perceptron (MLP) : A neural network architecture capable of learning complex relationships within the data.
- Moving Average (MA) : A simple statistical method that averages the past
Kobservations to predict the nextTvalues. - Autoregressive Integrated Moving Average (ARIMA) : A more advanced statistical model that incorporates both past values and past errors to forecast future observations.
- Data Acquisition : Obtain the monthly Mauna Lua CO2 dataset.
- Data Preprocessing : Address missing values (if any) and normalize the data for improved model training.
- Model Implementation :
- MLP : Develop an MLP architecture with
Kinput neurons representing the pastKCO2 readings and one output neuron predicting the nextTvalue. Training the model using an appropriate optimizer and loss function (e.g., Adam optimizer, Mean Squared Error loss). - MA : Implementing a rolling window approach to calculate the average of the past
Kobservations for each prediction. - ARIMA : Utilize a library like statsmodels to establish an optimal ARIMA model.
- MLP : Develop an MLP architecture with
- Evaluation :
- Split the data into training and testing sets.
- Train each model on the training set.
- Generate forecasts for the next
Tvalues on the testing set using each model. - Evaluate the performance of each model using metrics like Mean Squared Error (MSE), Root Mean Squared Error (RMSE)
- Visualize the actual CO2 values against the predicted values for each model to compare their accuracy.
- Results and Discussion :
- Analyze the performance metrics and visualizations to identify the most effective forecasting model for this dataset and configuration (
KandT). - Discuss the potential reasons for the observed results, considering factors like model complexity (MLP's ability to capture non-linear relationships), seasonality in the data (potential advantage for ARIMA), and the impact of data quality and length on model training.
- Analyze the performance metrics and visualizations to identify the most effective forecasting model for this dataset and configuration (
This project will provide valuable insights into the suitability of different forecasting methods for CO2 prediction on the Mauna Lua dataset. By comparing the performance of MLP, MA, and ARIMA, we can gain a better understanding of the trade-offs between model complexity and accuracy in this specific context.