FutureOrders - Forecasting Demand With AI
FutureOrders - Forecasting Demand With AI
Demand forecasting is one of the main activities of supply chain management that helps
companies predict customer demand, manage inventory, and make optimal allocation of
resources. With greater global market complexity, accurate demand forecasting has become
more critical than ever. Companies employ forecasting to prevent stockouts, reduce excess
inventory, and drive overall operating efficiency to its maximum level. A well-functioning
demand forecasting system generates customer satisfaction through ensured product
availability while reducing holding costs concurrently.
Traditional demand forecasting techniques, such as moving averages and exponential
smoothing, have been used for decades. However, these approaches often fail to capture
complex patterns, especially in datasets exhibiting high seasonality, trends, or sudden
fluctuations. In response to these limitations, modern machine learning and deep learning
techniques have emerged as powerful alternatives. By leveraging vast amounts of historical
data, these advanced methods can identify intricate relationships and patterns that traditional
models might overlook.
Machine learning-based demand forecasting integrates a variety of algorithms, including
decision trees, support vector machines, neural networks, and ensemble methods like
XGBoost. Each of these techniques has its strengths, depending on the nature of the data and
the specific business context. Additionally, hybrid models combining statistical and machine
learning approaches have gained popularity due to their ability to balance interpretability and
predictive power.
In this project, we focus on forecasting daily demand using the "Daily Demand Forecasting
Orders" dataset. We explore a range of forecasting models, from statistical methods like
ARIMA to machine learning techniques such as LSTM and XGBoost. By evaluating the
performance of these models, we aim to identify the most effective approach for predicting
demand fluctuations accurately. This study offers important information on the application of
machine learning in demand forecasting and different approaches with advantages and
disadvantages.
Machine learning demand forecasting involves a range of algorithms, such as decision trees,
support vector machines, neural networks, and ensemble techniques such as XGBoost. Each
has its advantages, depending on the type of data and the particular business situation.
Furthermore, hybrid models where statistical and machine learning methods are used together
have also become widely applied because they can find the optimal trade-off between
interpretability and predictive capability.
2
3.1 Data Collection The dataset consists of daily order records, including various features
influencing demand. Key attributes include:
• Week of the month (First, second, third, fourth, or fifth week)
• Day of the week (Monday to Friday)
• Non-urgent order count
• Urgent order count
• Order type (A, B, and C)
• Fiscal sector orders
• Orders from the traffic controller sector
• Banking orders (Three categories)
• Target variable: Total orders
3.3 Model Selection & Training Multiple regression-based and tree-based machine learning
models were developed and validated for demand forecasting. The models are as below:
• Linear Regression: A basic model that identifies linear trends with respect to demand.
• Ridge Regression & Lasso Regression: Regularized linear models reducing overfitting.
• Decision Tree Regressor: A non-linear model that separates data based on feature values.
• Random Forest Regressor: An ensemble model consisting of multiple decision trees that
enhance the accuracy of prediction.
• Gradient Boosting & XGBoost Regressors: Advanced boosting models known for their
strong predictive power and ability to handle complex relationships..
3.5 Feature Importance Analysis To understand which factors influence demand the most,
we analyzed feature importance scores from tree-based models (Random Forest, Gradient
Boosting, and XGBoost). These insights can help businesses prioritize key demand drivers in
decision-making.
3.7 Model Evaluation Performance measures utilized to compare the models are:
• Mean Absolute Error (MAE): Measures the average magnitude of errors in predictions.
• Mean Squared Error (MSE): Computes the average squared difference between actual
and predicted values.
• Root Mean Squared Error (RMSE): Provides an interpretable measure of prediction
accuracy.
7
Performance Metrics:
• Best Model: XGBoost
o Mean Absolute Error (MAE): 24.25
o Root Mean Squared Error (RMSE): 40.67
o R² Score: 0.84
o Cross-Validation R² Mean: 0.9041
10