ArticleText 63744 2 10 20220502
ArticleText 63744 2 10 20220502
net/publication/360313350
CITATIONS READS
13 2,379
4 authors:
All content following this page was uploaded by Chau Thi Diem Le on 02 May 2022.
Thi Diem Chau LE1, 2, Duc Duy NGUYEN3, Judit OLÁH1, 4*, Miklós PAKURÁR5
1, 5Faculty
of Economics and Business, University of Debrecen, Debrecen, Hungary
2Dept of Quality Assurance, Ho Chi Minh City University of Technology – VNUHCM, Ho Chi Minh, Vietnam
3School of Business and Management, LUT University, Lappeenranta, Finland
4Faculty of Applied Sciences, WSB University, Dabrowa Górnicza, Poland
Abstract. The demand for daily food purchases has increased dramatically, especially during the Covid-19 pandemic. This
requires suppliers to face a huge and complex problem of delivering products that meet the needs of their customers on
a daily basis. It also puts great pressure on managers on how to make day-to-day decisions quickly and efficiently to both
satisfy customer requirements and satisfy capacity constraints. This study proposes a combination of the cluster-first –
route-second method and k-means clustering algorithm to deal with a large Vehicle Routing Problem with Time Windows
(VRPTW) in the logistics and transportation field. The purpose of this research is to assist decision-makers to make quick
and efficient decisions, based on optimal costs, the number of vehicles, delivery time, and truck capacity efficiency. A dis-
tribution system of perishable goods in Vietnam is used as a case study to illustrate the effectiveness of our mathematical
model. In particular, perishable goods include fresh products of fish, chicken, beef, and pork. These products are packed
in different sizes and transferred by vehicles with 1000 kg capacity. Besides, they are delivered from a depot to the main 39
customers of the company with arrival times following customers’ time window. All of the data are collected from a logis-
tics company in Ho Chi Minh city (Vietnam). The result shows that the application of the clustering algorithm reduces the
time for finding the optimal solutions. Especially, it only takes an average of 0.36 s to provide an optimal solution to a large
Vehicle Routing Problem (VRP) with 39 nodes. In addition, the number of trucks, their operating costs, and their utiliza-
tion are also shown fully. The logistics company needs 11 trucks to deliver their products to 39 customers. The utilization
of each truck is more than 70%. This operation takes the total costs of 6586215.32 VND (Vietnamese Dong), of which, the
transportation cost is 1086215.32 VND. This research mainly contributes an effective method for enterprises to quickly
find the optimal solution to the problem of product supply.
Keywords: vehicle routing problem, logistics, time window, cluster, route, k-means clustering, transportation.
Notations
Variables and functions:
c – transportation unit cost per km [VND/km]; ei – the earliest time that customer can receive
C – set of customers; the products;
Ck – the be a set of K centers in the k-means clus- f – refrigeration unit cost per km [VND/km];
tering model; jj – the latest time that customer can receive the
CTk – maximum permissible load of truck k [kg]; products;
d ‒ dimensional real vector; G – graph;
dij – distance from node i to node j [km]; Kij – maximum spending time between 2 nodes;
Di – demand of customer i; L – labour cost per truck per day [VND];
dist (di, ck) – the Euclidean distance between a data point pi – processing time of customer i;
di and the cluster center ck; Sj – the set of samples that belong to the kth
Ei – estimated volume of products of customer i; cluster;
Sik – an integer variable that present the time of from academics and managers Zhang et al. 2019). In the
vehicle k starts to service a customer i; practical, VRP appears as several classes of additional
tij – moving time from node i to node j; constraints, such as considering customer time windows,
(ji, ei) – the time window at customer i where ji the route lengths, the limits on vehicle capacity, etc. In which,
earliest time that customer i can receive the the VRPTW is an important variant of the VRP and has
products and ei is the latest time that cus- been paid attention from researchers (Wang et al. 2020).
tomer i can receive the products; The VRPTW aims to determine the optimal set of routes
n – speed [km/h]; for a set of restricted capacity identical vehicles and sat-
V – vehicle; isfy the time window constraints (El-Sherbeny 2010).
VTk – maximum volume of truck k [m3]; Each customer is served exactly once by a vehicle within
xijk – a binary variable that equals to 1 if vehicle k the time window defined by the earliest and latest times.
drives directly from node i to node j, other- Each vehicle trip starts and finishes at the given depot.
wise, it is 0; The locations, demand, and time windows of customers
X – a set of observations. are known as a priority. In the VRPTW, the vehicle could
Abbreviations: arrive at the customer’s location before the earliest time
ASEAN – Association of South-East Asian Nations; and wait without cost or a penalty cost for example park-
GDP – gross domestic product; ing slots cost until service becomes allowable. However,
IMF – International Monetary Fund; the vehicle must not arrive at customers after the latest
LIFO – last in first out; time. The time window constraints in the VRPTW are re-
MILP – mixed integer linear programming; quired strictly satisfied, however, in many practical prob-
MIP – mixed integer programming; lems, these constraints can be violated. There are 2 types
VND – Vietnamese Dong; of VRPs that allow violating the time windows constraints:
VRP – vehicle routing problem; (1) VRP with flexible time windows and (2) VRP with soft
VRPTW – VRP with time windows. time windows.
There are different methodologies for determining the
Introduction solutions of VRP and its variants such as exact algorithm,
heuristic methods, and hierarchical method (Comert et al.
Logistics costs are costs associated with the process of dis- 2018). In the hierarchical method, the problem is system-
tribution and circulation of goods, which have a direct atically split into different levels for determining solutions.
impact on the efficiency of businesses, localities, and the In this research, one of the hierarchical methods named
national economy. Logistics costs are paid by businesses cluster-first – route-second was employed. This study in-
for supply activities in the market. Therefore, it can ac- troduces a practical problem of vehicle routing in the field
celerate or inhibit product consumption and influence of logistics and transportation. This problem mainly takes
the competitiveness of enterprises and the whole econo- into account three constraints, including time window,
my. Logistics costs mainly consist of transportation costs, vehicle capacity, and product type. The aims of this study
warehousing, packing and losses, inventory, and order are to determine the number of vehicles and minimize to-
processing, and administrative (Rushton et al. 2022). tal logistics costs. In which, logistics costs include labour,
In Vietnam, logistics costs are much higher than in fuel, and refrigeration costs. The MIP model is built to
many countries ASEAN region and around the world such describe the product delivery network with 39 nodes. The
as Thailand, Singapore, and so on. According to the data clustering algorithm is used to solve the proposed MIP
of the World Bank, logistics costs including transporta- model. The results show that 11 trucks are used to serve
tion, storage, customs clearance, and so on in Vietnam 39 customers and the truck utilization efficiency is more
are about 20.9…25.0% of GDP (VILAS 2021). In which, than 70%. These optimal results are found in an average
transportation costs account for about 60%, a high cost of 0.36 s.
compared to developed countries. This cost is higher than This paper consists of several sections as follows. Sec-
Thailand’s 6%, Malaysia’s 12%, and 3 times higher than tion 1 is the literature review, which describes the over-
Singapore’s. According to the IMF, in the US economy, lo- view of the research gaps of previous studies. A math-
gistics costs account for 9.9% of the country’s GDP ($921 ematical model of the VRP and methodology are shown
billion in 2000). High logistics cost situations could cause in Sections 2 and 3, respectively. Section 4 presents a case
a low level of enterprises’ competitiveness. Therefore, re- study, including data collection, assumptions, and results.
ducing logistics costs, especially transportation costs, is a The finally is the conclusions.
big challenge for Vietnam’s logistics industry.
VRP is formed to minimize the transportation cost. 1. Literature review
The development of VRP is based on the multiple trave-
ling salesman problems (Toth, Vigo 2014; Xu et al. 2018). VRP is an important topic considered by many managers
VRP played a vital role in the field of logistics and trans- in the logistics and transportation fields. According to Cao
portation. Consequently, VRP attracts a lot of attention and Yang (2017), there are two basic types of VRP, includ-
Transport, 2022, 37(1): 17–27 19
ing pure VRPs and their variations. In which, pure VRP hicles in their objective functions. Kantawong and Praves-
refers to as the distribution of cargo from one destination jit (2020) implemented the optimization of distance and
to many other locations (Yao et al. 2019). The solutions of vehicle service cost. Londoño et al. (2021) also optimized
VRP bring more benefits for companies such as shortest distance in addition to minimizing transportation costs
routes and time, the number of necessary vehicles, and op- and penalty fees. Qi and Hu (2020) found the solution
timal vehicle capacity. These results contribute to reduce to optimize three types of cost in their cost function, in-
the total costs of logistics and increase businesses’ profit cluding the costs of fuel, refrigeration, and cargo damage.
(Le et al. 2020; Zhu et al. 2020). Simultaneously, they also optimize the number of vehicles
In recent years, VRP has gained more interest from used in their problem. Tasar et al. (2019) determined the
academics. There are many characteristics of VRP that are optimal number of vehicles to minimize the fixed and fuel
solved by different mathematical models, and their results costs. Similarly, Aggarwal and Kumar (2019) established
meet the aims of businesses (Table 1). Table 1 shows that the objective to drop their transportation cost and cost
the differences in VRP characteristics between previous for the availability of vehicles and determine the optimal
studies focus on objectives, problem description, and vehicle number. Unlike introduced studies, Afifi et al.
mathematical model and algorithm. First of all, the cost (2016) and Pérez-Rodríguez, Hernández-Aguirre (2019)
is the most concern of previous studies while time, dis- focused on reducing time and distance in their problem,
tance, and the number of vehicles have less appearance in respectively.
their objective function. Zhu and Hu (2019) considered Secondly, problem description in almost previous stud-
the cost of opening routes in the objective equation (Ruiz ies includes network configure, constraint, and data. Net-
et al. 2019). Zhang et al. (2017) and Spliet and Desaulniers works configure from previous studies are sets of nodes
(2015) introduced the transportation cost as a factor to be ranged from 6 nodes to 200 nodes. These problems mainly
reduced in their objective function. In a more complicated deal with two constraints: time window and vehicle capac-
way, Birim (2016) considered both transportation cost and ity. In which, a time window is a more popular constraint
fixed vehicle cost. Zhu and Hu (2019) minimized three than the vehicle capacity. According to Bachu et al. (2021),
types of costs that include fuel cost, labour cost, and ve- building a model to respond to time constraints is a major
hicle depreciation cost. Some studies consider both costs challenge. Thus, almost all previous studies consider one
and other factors such as distance and the number of ve- of two constraints when determining the optimal solution.
Problem description
Objective
network Constraint Data Mathematical
Study
no of dis- configuration time vehicle kind of experi- model and algorithm
cost time reality
vehicle tance (N – nodes) window capacity products ence
Qi, Hu (2020) ü ü 13 N ü MIP, heuristic
Kantawong, MIP, artificial bee
ü ü 100 N ü ü
Pravesjit (2020) colony
Londoño et al.
ü ü 51 N ü MILP, local search
(2021)
Aggarwal, Kumar
ü ü 60 N ü ü MIP
(2019)
Pérez-Rodríguez,
an estimation of
Hernández- ü 6N ü ü
distribution
Aguirre (2019)
Tasar et al. (2019) ü ü 100 N ü MIP, heuristic
MILP, response
Zhu, Hu (2019) ü 200 N ü ü
surface method
MIP, biased
Ruiz et al. (2019) ü 14 N ü ü
random – key genetic
tabu search, the
Zhang et al. (2017) ü 27 N ü ü
artificial bee colony
MILP, simulated
Birim (2016) ü 10 N ü ü
annealing
MIP, simulated
Afifi et al. (2016) ü 30 N ü ü
annealing
Spliet, Desaulniers MIP, exact branch –
ü 60 N ü ü ü
(2015) price – cut algorithm
MIP, clustering
Current study ü ü 39 N ü ü ü ü
algorithm
20 T. D. C. Le et al. Clustering algorithm for a vehicle routing problem with time windows
tions are considered in this mathematical model. They presents that customers need to be visited exactly once by
include: (1) the current total number of vehicles meet the a vehicle. On other words, one customer is only served
total customer demand per day and (2) each customer by one truck. Equation (3) states that the used vehicle
orders the number of products for the day not to exceed could not exceed the number of available vehicles. Equa-
the capacity of one vehicle. As a result, each vehicle is ar- tions (4) and (5) mean vehicle capacity need to be satis-
ranged on a route to serve customers and carry a variety of fied. This means that transporting products to customers
suitable products. This means that each vehicle can serve is not exceed the maximum capacity of a truck. In which,
many customers in its trip. By contrast, one customer only the vehicle capacity is calculated by both permissible load
receives products from only one vehicle at one time. with the unit of a [kg] and volume with the unit of [m3].
The VRP time windows is the VRPTW is formed by Equation (6) ensures that every vehicle leaves at the de-
a set of V vehicles, a set of nodes that includes a set of pot zero to a customer (Equation (7)) and then finishes
customer C and a driving-out depot, and a returning de- the trip at the depot n + 1 (Equation (8)). Equation (9)
pot. All of the nodes together generate a directed graph presents that every vehicle from i to j cannot arrive at j
G = (V , C ). Customers are denoted from 1 to n with n is before Sik + tij. Where tij is the spending time from i to
the number of customers, the driving – out depot and re- j that is equal to total moving time from i to j and the
turning depot are presented by 0 and n + 1, respectively. moving time at customer j. The moving time is calculated
The network is a set of N nodes that consist of C + 2 based on the distance between i and j and vehicle velocity
nodes. An arc ( i, j ), where i ≠ j is an arc from node i to (Equation (10)). Equation (11) defines a value of Kij that
node j. No outgoing arc at node n + 1 and no arc come to helps linearize a non-convex optimization. Finally, Equa-
node 0. However, in some cases, node 0 and node n + 1 tion (12) ensures that time windows are observed. Equa-
are in the same location. tions (13) and (14) show type of variables.
min ∑∑∑ (c + f ) ⋅ dij ⋅ xijk + L ⋅ ∑∑ x0 jk (1)
2.2. k-means clustering algorithm
k∈V i∈N j∈N k∈V j∈C
subject to: k-means clustering is a well-known clustering method
that is used to form n observations to k clusters, where
∑∑ xijk = 1, ∀i ∈ C ; (2) k is known as a priority. The objective of the algorithm is
k∈V j∈C to minimize the objective function and to separate each
∑∑ x0 jk ≤ V , ∀k ∈V , ∀j ∈ N ; (3) compact class as far as possible. k-means clustering could
be explained as below (Khan, Ahmad 2004).
k∈V j∈C
Given X = {x1 , x2 , ..., xn } is a set of observations,
∑ Di ⋅ ∑ xijk ≤ CTk , ∀k ∈V ; (4)
and each observation is a d ‒ dimensional real vec-
i∈C j∈C
tor. CCk = {c1 , c2 , ..., cn } is the be a set of K centers.
∑ Ei ⋅ ∑ xijk ≤ VTk , ∀k ∈V ; (5) S j = {d | d is member of cluster k} be the set of samples
i∈C j∈C that belong to the kth cluster.
∑ x0 jk = 1, ∀k ∈V ; (6)
n
pacity of the truck needs to be taken into account. There- circles. The notations for customers in the map begin with
fore, Comert et al. (2018) introduced the equation for C, for example, C1 is the name of customer 1. The distance
determining the number of k in k-means clustering in a matrix among depots and customers is calculated based
capacitated VRP problem. The number of clusters is based on latitudes and longitudes of their locations from Google
on the capacity of trucks and demand: Maps.
total demand The logistics company has 15 1000 kg reefer trucks
number of clusters = . (17) that can flexibly deliver goods in Ho Chi Minh city urban
truck capacity
in rush hours and off-peak hours. Every day, the planning
The number of clusters is round up to an integer department in the warehouse get orders from customers
number. For example, in this research, the total demand and assigns customers for trucks. Next, the driver sorted,
is 10020 kg, and a truck that has a capacity is 1000 kg. As loaded, and delivered to customer. The product in a truck
a result, the number of clusters is calculated is 10.020 kg is arranged and grouped by customers and follows the
then the decimal rate is round up to 11. Once the number LIFO rule. The velocity of each truck is assumed 45 km/h.
of clusters is determined, the k-means algorithm is per- The working time is from 8:30 to 15:30 h. The earli-
formed in the data. The related customers are assigned to est time and the latest time for serving by trucks and the
clusters has total demand could not exceed the capacity. processing time are varied by customers.
Set number base value is specified by considering the con- In this research, the truck cost is focused on labour
dition: truck utilization is 70% truck. After the initial set cost, fuel cost, and refrigeration cost. Each truck is op-
of members in each cluster is generated from the k-means erated by a driver, where the salary was assumed to be
clustering algorithm based on the customer locations. The 500000 VND/day. The salary is calculated based on the
cluster demand is checked whether it is satisfied with the salary provided by the human resource department. Based
truck capacity and truck utilization condition (less or on the manufacturer’s technical report, each truck with
more than 700 to 1000 kg), the member of each group full load cost 2551 VND/km for fuel. The refrigeration
could be modified and capacity controlled repeated until cost per km is assumed to equal 20% of the fuel cost.
set satisfied the conditions.
3.2. Assumptions
3. A case study
Assumptions are:
3.1. Data collection »» the problem is static;
»» the model uses one type of reefer truck;
In this research, a transportation network from a logis- »» the products of customers are packed and arranged
tics company in Ho Chi Minh city (Vietnam) is consid- before being shipped;
ered. The company provides warehousing and distribu- »» the number of customers, customer time window,
tion services for 39 main customers in Ho Chi Minh city. customer demand, processing time at customer lo-
As shown in Figure 2, a red star is used to represent the cations are deterministic and they are known as a
depot, which also is the warehouse and parking place, priority;
whereas the locations of customers are presented by blue »» the fuel cost and refrigeration cost are deterministic;
»» customer demand is changed day-by-day;
»» every truck can start at the depot at j0 = 0 and re-
turn the depot at e0 = 420 (7 working h);
»» each customer may be serviced by one tractor truck;
»» backhauls are not permitted;
»» 15 vehicles can meet the total demand of customers
every day;
»» each customer’s demand per day is less than or equal
to 1000 kg.
Figure 3. Customer and cluster locations from capacitated k-means clustering (source: current study)
Table 5. Computational results for different numbers of k from Table 6. An example demand set for problems have
classical k-means clustering algorithm (source: current study) 9 customers (source: current study)