Sensors 22 04968 v2
Sensors 22 04968 v2
Article
Predicting Patient Length of Stay in Australian Emergency
Departments Using Data Mining
Sai Gayatri Gurazada 1 , Shijia (Caddie) Gao 1, * , Frada Burstein 1 and Paul Buntine 2
1 Faculty of Information Technology, Monash University, Clayton, Melbourne, VIC 3800, Australia;
sgur0006@student.monash.edu (S.G.G.); frada.burstein@monash.edu (F.B.)
2 Eastern Health Clinical School Monash University, Box Hill, Melbourne, VIC 3128, Australia;
paul.buntine@easternhealth.org.au
* Correspondence: caddie.gao@monash.edu; Tel.: +61-3-9903-2411
Abstract: Length of Stay (LOS) is an important performance metric in Australian Emergency De-
partments (EDs). Recent evidence suggests that an LOS in excess of 4 h may be associated with
increased mortality, but despite this, the average LOS continues to remain greater than 4 h in many
EDs. Previous studies have found that Data Mining (DM) can be used to help hospitals to manage this
metric and there is continued research into identifying factors that cause delays in ED LOS. Despite
this, there is still a lack of specific research into how DM could use these factors to manage ED LOS.
This study adds to the emerging literature and offers evidence that it is possible to predict delays in
ED LOS to offer Clinical Decision Support (CDS) by using DM. Sixteen potentially relevant factors
that impact ED LOS were identified through a literature survey and subsequently used as predictors
to create six Data Mining Models (DMMs). An extract based on the Victorian Emergency Minimum
Dataset (VEMD) was used to obtain relevant patient details and the DMMs were implemented using
the Weka Software. The DMMs implemented in this study were successful in identifying the factors
that were most likely to cause ED LOS > 4 h and also identify their correlation. These DMMs can be
used by hospitals, not only to identify risk factors in their EDs that could lead to ED LOS > 4 h, but
Citation: Gurazada, S.G.; Gao, S.;
also to monitor these factors over time.
Burstein, F.; Buntine, P. Predicting
Patient Length of Stay in Australian
Keywords: clinical decision support; data mining models; emergency department; length of stay;
Emergency Departments Using Data
predictive data mining; Weka
Mining. Sensors 2022, 22, 4968.
https://doi.org/10.3390/s22134968
2. Literature Survey
This section firstly discusses the general processes in an Australian ED followed by
suitable DMMs that can be used in the ED LOS context. It also discusses the factors that are
likely to risk delays in the ED as suggested by other studies. Both the DMMs and factors
discussed in this section were identified by conducting a systematic literature survey based
on their suitability to the Australian ED context. PubMed and Scopus were the primary
databases used to identify literature for this study. Additionally, some other government
Sensors 2022, 22, 4968 3
primary databases used to identify literature for this study. Additionally, some other
websites were ernment
used when websites
necessary.were used when
PubMed necessary.
and Scopus were PubMed and Scopus
the primary databases were
usedthe primar
tabases used for the identification of studies for this research.
for the identification of studies for this research. The studies were identified using three The studies were iden
using three sets of search terms (ST) (i = 3). The studies
sets of search terms (ST) (i = 3). The studies were initially identified from the databases were initially identified fro
databases and additionally from other sources such
and additionally from other sources such as Google scholar, government websites and as Google scholar, government
references wheresitesapplicable.
and references
All the where applicable.
studies were thenAll the studies
screened were then
to remove screened
duplicates andto remov
to ensure that plicates
they were and to ensure
relevant that
to the they were
research. Therelevant
studies to werethealso
research.
filteredTheoutstudies
based were al
tered out based on quality (quality of journal/conference,
on quality (quality of journal/conference, must be peer-reviewed). The number of studies must be peer-reviewed
number
(ni ) obtained from eachofofstudies (ni) obtained
these searches from each
is denoted of these
in Figure searches
1. There wasisnodenoted in Figure 1.
restriction
was no restriction enforced on the publication year since
enforced on the publication year since DM for CDSS, ED LOS and data quality are relatively DM for CDSS, ED LOS and
older concepts still applicable in present context. This literature survey explores studies literatur
quality are relatively older concepts still applicable in present context. This
that use DM asvey partexplores
of CDSS studies that use using
in healthcare DM asstudies
part of selected
CDSS in from healthcare
searchusing
i = 1.studies
These selected
search
studies were also usedi =to1.determine
These studies wereappropriate
the most also used tokey determine the most
performance appropriate
indicators for key p
interpreting ourmance indicators
results. We survey for potential
interpreting our affecting
factors results. We LOS survey potentialED
in Australian factors affecting L
context
using studies from i = 2. Since we acknowledge the significance of data quality on accuracysignifican
Australian ED context using studies from i = 2. Since we acknowledge the
data quality
of results, we explored theon accuracy
impact of results,
of data qualitywein explored the impact
similar studies usingofliterature
data quality
fromin similar
i = 3. ies using literature from i = 3.
Figure
Figure 1. Systematic 1. Systematic
literature surveyliterature survey
to identify to identify
suitable studies.suitable studies.
urgent) by ED staff based on the acuity [7]. Following this, staff is scheduled, and finally,
relevant internal departments are consulted, and further patient testing and assessments
are conducted.
This final stage was found to take the longest time to complete, usually affecting ED
LOS. There have been many efforts to streamline the ED process, for example introducing
measures to reduce the amount of paperwork at each stage. However, there have been no
efforts into reengineering the ED process and it has remained the same for many years.
Although limited, there has been some research into how DM can be used as part of CDS
to make the processes in EDs more efficient. Some instances of DM used as part of CDSS in
EDs are predicting patient pathway, predicting patient admissions, pathology ordering,
customer relationship management (CRM), and predicting ED LOS [21]. Several factors
were found to increase ED LOS and are discussed in the following sections.
LOS [33,36,37]. It was also hypothesized that patients arriving by ambulance were more
likely to be presenting with severe conditions compared to those arriving by other means
of transport such as personal cars. This is why the arrival mode of a patient at the hospital
could determine their ED LOS [2,36].
Triage category is another important factor that has been found to impact ED LOS [38].
All Australian EDs follow a standardized triage category system which ranges from
1–6 with decreasing level of patient severity. All patients are assigned a code by the
triage nurse upon arrival at the ED based on their presenting condition. As a result of this,
medical attention is given sooner to those patients who appeared to be presenting with
more severe conditions (triage 1–3) than those with triage category above 3. This could
mean that patients assigned a less critical triage might end up having to wait longer to be
treated [36]. Time taken for doctors to first examine a patient and delays in diagnostics
is indicative of ED crowding. This also could indicate the resource availability for that
ED [34].
Some studies suggested that patient gender and ethnicity are also factors affecting ED
LOS, even though there was no sufficient evidence to prove this [4,39,40]. For instance,
it was found that ethnicity only impacted ED LOS in cases where there were language
barriers or communication issues [41]. Despite this, both these factors were considered for
this study. Other factors such as patient admission and requiring an interpreter were also
found to impact ED LOS. This is because those requiring admission or interpreters might
have to wait longer for diagnosis or scans due to severity in condition or complexities in
communication, respectively [6,42]. Even though other factors such as patient insurance
and number of staff available in the ED were found to impact ED LOS, they are considered
to be out of scope for this study as they were not included in the dataset [6,37,43]. Many
of these factors are either directly available in the VEMD or can be derived from VEMD
attributes. These factors can be used as predictors to build PDMMs as detailed in the
following sections.
3. Research Methodology
This section discusses the dataset used in this study along with an overview of the
dataset quality. Following this, steps involved in the pre-processing and analysis of these
dataset are discussed.
3.1. Dataset
A de-identified data extract containing all elements from VEMD plus additional
administrative data were obtained from three locations of Healthcare Service A. Data
collected at the three locations of Healthcare Service A for the year 2019 was obtained and
used for this study. The data obtained for this study was collected before the COVID-19
pandemic and does not take into account factors such as shortage of resources or any other
special circumstances.
Along with the standard VEMD attributes, the dataset obtained for this study had
several additional attributes introduced by staff for internal uses. Many of these attributes
are either irrelevant to this research or of low data quality. For example, an attribute called
“presenting condition” had over 7000 unique values which were either misspellings or
a variation of the same values. Including this attribute in the analysis may have been
beneficial to the study by helping improve model utility and performance [6]. Despite this,
due to its low quality, it was removed from the analysis. This issue is consistent with the
findings from other studies that reported human-related data entry issues to be the biggest
reason for quality issues of the VEMD [45]. Data quality ultimately determines the accuracy
and reliability of a PDMM. Through exploration of the dataset and cleaning, the quality of
data can be improved [46]. This pre-processing of the dataset to improve data quality is
discussed further in Section 3.3.
Figure 2. Portion of the decision tree. Note that the dotted lines indicate that the branch is connected
Figure 2. Portion of the decision tree. Note that the dotted lines indicate that the branch is connected
to the rest of the tree. The ovals contain decisions, and the rounded rectangles contain the class.
to the rest of the tree. The ovals contain decisions, and the rounded rectangles contain the class.
For this study, a pruned DT with a confidence factor of 0.25 was produced. There
4. Results
was no significant difference in model
A total of 173,005 EDperformance
presentationswhen the DT
recorded was left
by three unpruned
locations or
of Healthcare Service A
reduced error pruning was used. Following this, the RF algorithm was implemented.
in the year 2019 were used in this study. Of these presentations, 38.62% reported having an RF
models provideED accurate
LOS > 4results
h, whilewhen bagging
61.38% is performed
reported an ED LOSas≤it4is h.not
Theprone
average to time
overfit-it took for a doctor
ting [29]. This istobecause the RF works by computing several
first examine a patient was 73.88 min. Six PDMMs were implementedsmall decision trees before for this study. Of
producing one final
thesedecision.
six models, Hence,
the RF
J48 models
DT, LR,can NB,beandrunZeroR
on thealgorithms
complete dataset, unlike
were implemented using the
DTs, which canPS overfit. The “bag size percent” value can be changed in RF model
feature on Weka. The accuracy for these models was computed as the average accuracy settings
before execution. For thisover
obtained study, the default
10 iterations “bag
with size percent”
different value[55,56].
seed values of 100 Thewas ROCused.and number of
Next, the K-NNcorrectly
model called the LazyIBK model on Weka was implemented.
classified instances known as the accuracy of the model were Similar to adopted as the
the RF, the K-NN algorithm does not require training [57,58]. A k-value is to be
primary performance metrics in this study. They were identified to be suitable to be used specified
in the model settings
in the ED before
LOSexecution.
context [4,6].Increasing or decreasing the k-value impacts the
model performance.The It was
firstreported that larger
model, ZeroR, hadk-values
an average improved
baseline model
accuracyperformances
of 61.41% and Standard
for larger datasets [56,59].(SD)
Deviation Forof this study,
0.154. TheaROC k-valuevalueof for
1 was
this used.
modelKwas values
0.05 upoutto of50theinmaximum value
increments of 5ofwere tested, but these models had an insignificant change in
1. The next model, J48, had an average accuracy of 72.10%, SD of 0.1, and an ROC value accuracy.
Following this, ofboth the This
0.762. LR and NB models
model’s accuracy wereandimplemented using thehigher
ROC are significantly PS method.
than the Thebaseline accuracy.
results obtainedThe from these models are presented in Section 4.
DT had a total of 944 leaves which were generated based on test cases. One part of this
DT can be seen in Figure 2.
4. Results In this figure, we see that a total of 23,087 patients experienced an ED LOS >4 h when
their
A total of 173,005 first
EDdoctor visit lasted
presentations longer
recorded by than
threethe average
locations time and were
of Healthcare admitted into the
Service
inpatient ward. On the other hand, if the patient waited
A in the year 2019 were used in this study. Of these presentations, 38.62% reported having less than the average time to first
be examined by a doctor but required a CT scan,
an ED LOS > 4 h, while 61.38% reported an ED LOS ≤ 4 h. The average time it took for a the age of the patient determines their ED
LOS. In this
doctor to first examine case, 9988
a patient was patients
73.88 min. agedSix>74
PDMMsstayedwerelonger.
implemented for this
The NBthe
study. Of these six models, model yielded
J48 DT, LR, an NB,average
and ZeroRaccuracy of 70.23%
algorithms werewith an SD of 0.167 and an ROC
implemented
using the PS feature on Weka. The accuracy for these models was computed as thean
of 0.758. The LR model yielded an average accuracy 71.33% with SD of 0.155 and ROC
average
value of 0.773. Both these models had an accuracy
accuracy obtained over 10 iterations with different seed values [55,56]. The ROC and num-and ROC value lower than the J48 model.
ber of correctly classified instances known as the accuracy of the model were adopted as the LazyIBK
However, they were still higher than the accuracy and ROC of ZeroR. Next,
model wasmetrics
the primary performance implemented
in thiswith study.k =They
1. This model
were had antoaccuracy
identified be suitableof 74.04%,
to be along with an
ROC value
used in the ED LOS context [4,6]. of 0.82. For the LazyIBK model, several k-values starting from k = 1 to k = 50
were tested
The first model, ZeroR, in increments
had an average of 5. The modelaccuracy
baseline had an average
of 61.41%accuracy of ≈72% when k values
and Standard
other than 1 were tested out. Finally, the RF model
Deviation (SD) of 0.154. The ROC value for this model was 0.05 out of the maximum value had an accuracy of 74.024% with an
ROC value of 0.81. Both the RF and LazyIBK had the highest ROC and accuracies of all
of 1. The next model, J48, had an average accuracy of 72.10%, SD of 0.1, and an ROC value
Sensors 2022, 22, 4968 9 of 15
the models. The models implemented using PS did not show any significant change in
accuracy when varying the seed values. This is why there is no significant difference in
SD for these models. The f-measure, recall and precision of each of these models was also
computed. The ZeroR model did not produce any recall or precision as it focuses only on
the majority class, which is “≤4” in this study. The overall performance metrics form the
data analysis can be found in Table 2.
5. Discussion
This study found that more than one-third of the ED presentations in the year 2019
had an ED LOS > 4 h. This was slightly higher than what was reported by other Australian
EDs [4,6]. Of the six models that were implemented in this study, RF and LazyIBK models
had the best model performance. The LazyIBK model had an ROC value of 0.82 and the RF
had an ROC of 0.81. An ROC value of 1 indicates a perfect test with any values close to
0.7 considered to be acceptable. ROC values above 0.8 for medical research are considered
to indicate excellent model performance [60]. The other three models, J48, LR, and NB had
acceptable ROC curve values, while the ZeroR model could be considered an imperfect test.
ROC values reported in other medical studies range from 0.6 to 0.86 [4,61,62]. The ROC
values of both the RF and LazyIBK were consistent with what was reported by another
study conducted in the Australian ED context [63].
LazyIBK, which is a Weka implementation of K-NN used in this study, had nearly the
same accuracy of 74% as RF, while J48, DT, NB and LR performed with a slightly lower
accuracy. The J48 DT, RF, and NB models had higher accuracies when compared to some
studies in the LOS context, which reported accuracies around 63–72% using variations on
classification [18,64]. There is only one other published study that used J48 DT to predict
LOS in the Australian ED context [6]. This study reported an accuracy of 85% which is
higher than our results. The inclusion of factors such as “presenting condition”, which
were removed from this study due to data quality issues could be a reason contributing to a
higher model accuracy in their study [63]. Consistent with this finding, model performance
for RF and DT in this study had increased to around 83% and 82%, respectively, with
the inclusion of the attribute “presenting condition”. Despite identifying this factor to be
significant in determining ED LOS, it was not included in the final study analysis and results
due to its poor quality. In our dataset, this attribute had nearly 7000 distinct values, many
misspellings or redundancies which could not be cleaned without domain knowledge.
This study found that the performance of all the models improved after the data
were pre-processed when compared to using the data in its original form. This confirms
pre-processing data to improve its quality is an important step when implementing DMMs
as suggested in literature [46]. The J48 DT implemented in this study was successful in
identifying 944 possible outcomes based on the factors used in the study. This DT helped
identify correlation between various factors used in analysis (see Figure 2 in Section 3.4).
Out of the 16 attributes used in the analysis, only six attributes were found to have a
significant impact on patient ED LOS. These include patient age, time taken for the doctor
to first see a patient, the patient mode of arrival, triage category, need for admission, and
performing CT scans. This study found that patients aged between 64 and 74 and patients
older than 74 were more likely to have an ED LOS greater than 4 h when compared to the
others. This is consistent with other studies that suggested that older patients were likely to
have a longer ED LOS [26,36,65]. The average time taken for a doctor to first see a patient
Sensors 2022, 22, 4968 10 of 15
was found to be 73.88 min. These results were consistent with the other Australian EDs,
but higher than some outside Australia [7,66]. This research found that around 66.7% of
the patients who were admitted into the hospital and waited longer than the average time
to first be seen by a doctor had an ED LOS greater than 4 h. Additionally, those patients
who had waited a greater than average time to first be seen by a doctor and also required a
mental health review were found to have an ED LOS greater than 4 h. This is consistent
with studies that suggested that patients who wait longer to be seen by a physician, require
admission or require a mental health review experience longer ED LOS [6,47].
Studies also suggest that a longer waiting time to be seen by a doctor could be
indicative of overcrowding [34,35,67,68]. Hospitals can customize these PDMMs to make
critical decisions for reducing delays in diagnosis based on their present ED conditions. It
was also found that around 43% of those who arrived by a mode of transport other than an
ambulance, police vehicle, or community transport had an ED LOS which was less than or
equal to 4 h. Those patients who arrived at the ED in any ambulance (air or road) had a
higher probability of experiencing an ED LOS of greater than 4 h. This is consistent with
findings from previous research [2,36].
Previous studies suggest that patients who required additional diagnostics were more
likely to experience longer ED LOS [69,70]. This was found to be true only for patients
requiring CT scans. This finding is consistent with other studies that suggest that CT scans
often cause delays in ED LOS [37]. Other tests such as MRIs, ultrasounds, pathology, and
X-rays were found to have the least impact on patient ED LOS out of all 16 factors. This is
contradictory to other research, which found that MRI scans and pathology such as blood
tests were equally responsible for delays in ED LOS as CT scans [69,71]. Based on the data
used in this study, the number of CT scans that were performed were significantly higher
than any other diagnostics tests. This could explain why the results from this study indicate
only the connection of CT scans to delayed ED LOS compared to any other diagnostics.
This may also confirm the hypothesis that CT scans are often over-prescribed in EDs [37,72].
Many studies also reported that patients with less severe triage codes (above 3) ex-
perience delays in diagnosis and treatment which impacted their LOS [6,47]. Contrary to
this, our study found that more patients with triage category 3, who were aged between
50–64 and those aged >74 experienced ED LOS > 4 h compared to the rest. In the dataset
used for this research, triage category 3 had a significantly higher number of patients than
triage categories above 3 when ED LOS exceeded 4 h. Furthermore, the DT yielded an
outcome of “>4” for triage category, mostly when patient age was considered. This is
consistent with other studies that suggest that triage category impacts ED LOS based on
the age of a patient [36].
The DT obtained in this study indicated a total of 944 possible outcomes and attribute
relationships in the form of DT branches. Based on these relationships, it was also found
that the admission of a patient into the hospital is classified based on factors such as age,
triage category, and mode of arrival. This is why it was determined to have an impact on
patient ED LOS. This is consistent with previous studies that considered age to determine
patient admission and the delays in ED LOS due to these factors [6,63]. Along with
confirming this, our study was also able to determine that factors such as triage category
and mode of arrival to also determine patient admission and subsequent delays in ED LOS.
Factors such as indigenous status, gender, preferred language, and needing an interpreter
were found to have the least impact on ED LOS. Although most studies found language
not to be significant, some still suggested that language barriers caused ED delays [41].
This study also confirmed that using the column “presenting condition” resulted in
improved model performance as suggested by other studies [6,63]. We found data quality
to be a significant factor in our research and believe that adopting measures to govern data
quality would have a major impact on improving future research and the overall usability
of data available [44]. One way to govern data to improve its quality is to standardize the
information being collected. By establishing the purpose for collecting data could help
Sensors 2022, 22, 4968 11 of 15
decide rules around its collection and storage. Adopting these practices will be beneficial
to not only hospitals but also to those using these data for research [73].
impact ED LOS. This is because a shortage in staff and beds results in delayed patient
diagnosis and testing [4,78,79]. Diagnostic results handover time was also found to be
significant in increasing ED LOS. The handover time can be defined as the time taken for
the diagnostician to handover the results of the test (for example, CT scan results) to the
doctor [4]. Future work could include these factors as part of their research.
Author Contributions: Formal analysis, S.G.G. and P.B.; Investigation, S.G.G., S.G. and P.B.; Method-
ology, S.G., F.B. and P.B.; Project administration, F.B.; Resources, P.B.; Supervision, S.G. and F.B. All
authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: The study was registered as an audit with the Eastern Health
Office of Research and Ethics (QA21-010, 16 March 2021).
Informed Consent Statement: Not applicable.
Data Availability Statement: Restrictions apply to the availability of these data. Data was obtained
from Healthcare Service A and are available from the authors with the permission of Healthcare
Service A.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Forster, A.J.; Stiell, I.; Wells, G.; Lee, A.J.; Van Walraven, C. The Effect of Hospital Occupancy on Emergency Department Length
of Stay and Patient Disposition. Acad. Emerg. Med. 2003, 10, 127–133. [CrossRef] [PubMed]
2. Kusumawati, H.I.; Magarey, J.; Rasmussen, P. Analysis of factors influencing length of stay in the Emergency Department in
public hospital, Yogyakarta, Indonesia. Australas. Emerg. Care 2019, 22, 174–179. [CrossRef] [PubMed]
3. Welch, S.J.; Asplin, B.R.; Stone-Griffith, S.; Davidson, S.J.; Augustine, J.; Schuur, J.; Alliance, E.D.B. Emergency Department
Operational Metrics, Measures and Definitions: Results of the Second Performance Measures and Benchmarking Summit.
Ann. Emerg. Med. 2011, 58, 33–40. [CrossRef] [PubMed]
4. Khanna, S.; Boyle, J.; Good, N.; Lind, J. New emergency department quality measure: From access block to National Emergency
Access Target compliance. Emerg. Med. Australas. 2013, 25, 565–572. [CrossRef] [PubMed]
5. Sullivan, C.; Staib, A.; Khanna, S.; Good, N.M.; Boyle, J.; Cattell, R.; Heiniger, L.; Griffin, B.; Bell, A.J.; Lind, J.; et al. The National
Emergency Access Target (NEAT) and the 4-hour rule: Time to review the target. Med. J. Aust. 2016, 204, 354. [CrossRef]
[PubMed]
6. Rahman, A.; Honan, B.; Glanville, T.; Hough, P.; Walker, K. Using data mining to predict emergency department length of stay
greater than 4 hours: Derivation and single-site validation of a decision tree algorithm. Emerg. Med. Australas. 2020, 32, 416–421.
[CrossRef]
7. Australian Institute of Health and Welfare. Emergency Department Care. 2018. Available online: https://www.aihw.gov.au/
getmedia/9ca4c770-3c3b-42fe-b071-3d758711c23a/aihw-hse-216.pdf.aspx (accessed on 16 June 2021).
8. Kienbacher, C.L.; Steinacher, A.; Fuhrmann, V.; Herkner, H.; Laggner, A.N.; Roth, D. Factors influencing door-to-triage- and
triage-to-patient administration-time. Australas. Emerg. Care 2022, in press. [CrossRef]
9. Van Der Linden, M.C.; Khursheed, M.; Hooda, K.; Pines, J.M.; Van Der Linden, N. Two emergency departments, 6000 km apart:
Differences in patient flow and staff perceptions about crowding. Int. Emerg. Nurs. 2017, 35, 30–36. [CrossRef]
10. Hou, X.-Y.; Chu, K. Emergency department in hospitals, a window of the world: A preliminary comparison between Australia
and China. World J. Emerg. Med. 2010, 1, 180–184.
11. Health and Human Services. Victorian Emergency Minimum Dataset (VEMD) Manual: 2020–2021, 25th ed.; DHHS: Melbourne,
VIC, Australia, 2020. Available online: https://www2.health.vic.gov.au/hospitals-and-health-services/data-reporting/health-
data-standards-systems/data-collections/vemd (accessed on 16 June 2021).
12. Women’s Health Australia. Australian Longitudinal Study on Women’s Health. 2020. Available online: https://alswh.org.au/
for-data-users/linked-data-overview/state-linked-data/ (accessed on 7 June 2022).
13. Gill, S.D.; Lane, S.E.; Sheridan, M.; Ellis, E.; Smith, D.; Stella, J. Why do ‘fast track’ patients stay more than four hours in
the emergency department? An investigation of factors that predict length of stay. Emerg. Med. Australas. 2018, 30, 641–647.
[CrossRef]
14. Hall, M.; Frank, E.; Holmes, G.; Pfahringer, B.; Reutemann, P.; Witten, I.H. The WEKA data mining software: An update.
ACM SIGKDD Explor. Newsl. 2009, 11, 10–18. [CrossRef]
15. Holmes, G.; Donkin, A.; Witten, I.H. Weka: A machine learning workbench. In Proceedings of the ANZIIS’94-Australian New
Zealand Intelligent Information Systems Conference, Brisbane, QLD, Australia, 29 November–2 December 1994; pp. 357–361.
16. Graham, B.; Bond, R.; Quinn, M.; Mulvenna, M. Using data mining to predict hospital admissions from the emergency department.
IEEE Access 2018, 6, 10458–10469. [CrossRef]
Sensors 2022, 22, 4968 13 of 15
17. Benbelkacem, S.; Kadri, F.; Chaabane, S.; Atmani, B. A data mining-based approach to predict strain situations in hospital
emergency department systems. In Proceedings of the 10ème Conférence Francophone de Modélisation, Optimisation et
Simulation-MOSIM’14, Nancy, France, 5–7 November 2014.
18. Azari, A.; Janeja, V.P.; Mohseni, A. Predicting hospital length of stay (PHLOS): A multi-tiered data mining approach. In
Proceedings of the IEEE 12th International Conference on Data Mining Workshops, Brussels, Belgium, 10 December 2012;
pp. 17–24.
19. Christ, M.; Grossmann, F.; Winter, D.; Bingisser, R.; Platz, E. Modern triage in the emergency department. Dtsch. Ärzteblatt Int.
2010, 107, 892–898. [CrossRef] [PubMed]
20. Ceglowski, A.; Churilov, L.; Wassertheil, J. Knowledge discovery through mining emergency department data. In Proceedings of
the 38th Annual Hawaii International Conference on System Sciences, Big Island, HI, USA, 3–6 January 2005.
21. Uriarte, A.G.; Zúñiga, E.R.; Moris, M.U.; Ng, A.H. How can decision makers be supported in the improvement of an emergency
department? A simulation, optimization and data mining approach. Oper. Res. Health Care 2017, 15, 102–122. [CrossRef]
22. Hand, D.J. Data mining: New challenges for statisticians. Soc. Sci. Comput. Rev. 2000, 18, 442–449. [CrossRef]
23. Middleton, B.; Sittig, D.; Wright, A. Clinical Decision Support: A 25 Year Retrospective and a 25 Year Vision. Yearb. Med. Inform.
2016, 25, S103–S116.
24. Musen, M.; Middleton, B.; Greenes, R. Clinical decision-support systems. In Biomedical Informatics; Shortliffe, E.H., Cimino, J.J.,
Eds.; Springer: London, UK, 2014; pp. 643–674.
25. Hiscock, H.; Neely, R.J.; Lei, S.; Freed, G. Pediatric mental and physical health presentations to emergency departments, Victoria,
2008–2015. Med. J. Aust. 2018, 208, 343–348. [CrossRef]
26. Chaou, C.-H.; Chen, H.-H.; Chang, S.-H.; Tang, P.; Pan, S.-L.; Yen, A.M.-F.; Chiu, T.-F. Predicting Length of Stay among
Patients Discharged from the Emergency Department—Using an Accelerated Failure Time Model. PLoS ONE 2017, 12, e0165756.
[CrossRef]
27. Hachesu, P.R.; Ahmadi, M.; Alizadeh, S.; Sadoughi, F. Use of Data Mining Techniques to Determine and Predict Length of Stay of
Cardiac Patients. Health Inform. Res. 2013, 19, 121–129. [CrossRef]
28. Kantardzic, M. Data Mining: Concepts, Models, Methods, and Algorithms; John Wiley & Sons: Hoboken, NJ, USA, 2011.
29. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [CrossRef]
30. Jabbar, M.; Deekshatulu, B.; Chndra, P. Alternating decision trees for early diagnosis of heart disease. In Proceedings of the
International Conference on Circuits, Communication, Control and Computing, Bangalore, India, 21–22 November 2014; pp.
322–328.
31. Alyahya, M.S.; Hijazi, H.H.; Alshraideh, H.A.; Al-Nasser, A.D. Using decision trees to explore the association between the length
of stay and potentially avoidable readmissions: A retrospective cohort study. Inform. Health Soc. Care 2017, 42, 361–377. [CrossRef]
32. Liu, P.; Lei, L.; Yin, J.; Zhang, W.; Naijun, W.; El-Darzi, E. Healthcare data mining: Prediction inpatient length of stay. In
Proceedings of the 3rd International IEEE Conference Intelligent Systems, London, UK, 4–6 September 2006; pp. 832–837.
33. Biber, R.; Bail, H.J.; Sieber, C.; Weis, P.; Christ, M.; Singler, K. Correlation between age, emergency department length of stay and
hospital admission rate in emergency department patients aged ≥70 years. Gerontology 2013, 59, 17–22. [CrossRef] [PubMed]
34. Brick, C.; Lowes, J.; Lovstrom, L.; Kokotilo, A.; Villa-Roel, C.; Lee, P.; Lang, E.; Rowe, B.H. The impact of consultation on length of
stay in tertiary care emergency departments. Emerg. Med. J. 2014, 31, 134–138. [CrossRef] [PubMed]
35. Asaro, P.V.; Lewis, L.M.; Boxerman, S.B. The impact of input and output factors on emergency department throughput.
Acad. Emerg. Med. 2007, 14, 235–242. [CrossRef] [PubMed]
36. Casalino, E.; Wargon, M.; Peroziello, A.; Choquet, C.; Leroy, C.; Beaune, S.; Pereira, L.; Bernard, J.; Buzzi, J.-C. Predictive factors
for longer length of stay in an emergency department: A prospective multicentre study evaluating the impact of age, patient’s
clinical acuity and complexity, and care pathways. Emerg. Med. J. 2013, 31, 361–368. [CrossRef] [PubMed]
37. Gardner, R.L.; Sarkar, U.; Maselli, J.H.; Gonzales, R. Factors associated with longer ED lengths of stay. Am. J. Emerg. Med. 2007, 25,
643–650. [CrossRef]
38. McCusker, J.; Karp, I.; Cardin, S.; Durand, P.; Morin, J. Determinants of emergency department visits by older adults: A systematic
review. Acad. Emerg. Med. Off. J. Soc. Acad. Emerg. Med. 2003, 10, 1362–1370. [CrossRef]
39. Rashid, A.; Brooks, T.R.; Bessman, E.; Mears, S.C. Factors Associated with Emergency Department Length of Stay for Patients
With Hip Fracture. Geriatr. Orthop. Surg. Rehabil. 2013, 4, 78–83. [CrossRef]
40. Liew, D.; Liew, D.; Kennedy, M.P. Emergency department length of stay independently predicts excess inpatient length of stay.
Med. J. Aust. 2003, 179, 524–526. [CrossRef]
41. Wechkunanukul, K.; Grantham, H.; Damarell, R.; Clark, R.A. The association between ethnicity and delay in seeking medical care
for chest pain: A systematic review. JBI Evid. Synth. 2016, 14, 208–235. [CrossRef]
42. Ramirez, D.; Engel, K.G.; Tang, T.S. Language Interpreter Utilization in the Emergency Department Setting: A Clinical Review.
J. Health Care Poor Underserved 2008, 19, 352–362. [CrossRef]
43. Karaca, Z.; Wong, H.S.; Mutter, R.L. Duration of patients’ visits to the hospital emergency department. BMC Emerg. Med. 2012,
12, 15. [CrossRef] [PubMed]
44. Tayi, G.; Ballou, D. Examining data quality. Commun. ACM 1998, 41, 54–57. [CrossRef]
45. Marson, R.; Taylor, D.M.; Ashby, K.; Cassell, E. Victorian Emergency Minimum Dataset: Factors that impact upon the data quality.
Emerg. Med. Australas. 2005, 17, 104–112. [CrossRef] [PubMed]
Sensors 2022, 22, 4968 14 of 15
46. Dasu, T.; Johnson, T. Exploratory Data Mining and Data Cleaning; John Wiley & Sons: Hoboken, NJ, USA, 2003.
47. Yoon, P.; Steiner, I.; Reinhardt, G. Analysis of factors influencing length of stay in the emergency department. Can. J. Emerg. Med.
2003, 5, 155–161. [CrossRef]
48. Lee, K.; Palsetia, D.; Narayanan, R.; Patwary, M.M.A.; Agrawal, A.; Choudhary, A. Twitter trending topic classification. In
Proceedings of the IEEE 11th International Conference on Data Mining Workshops, Vancouver, BC, Canada, 11 December 2011;
pp. 251–258.
49. Agresti, A. Categorical Data Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2003.
50. Nishisato, S. Analysis of Categorical Data: Dual Scaling and Its Applications; Toronto University Press: Toronto, ON, Canada, 1980.
51. Esling, P.; Agon, C. Time-series data mining. ACM Comput. Surv. (CSUR) 2012, 45, 12. [CrossRef]
52. Reitermanova, Z. Data splitting. In WDS’10 Proceedings of Contributed Papers, Prague, Czech Republic, 1–4 June 2010; pp. 31–36.
53. Bhargava, N.; Sharma, G.; Bhargava, R.; Mathuria, M. Decision tree analysis on J48 algorithm for data mining. Int. J. Adv. Res.
Comput. Sci. Softw. Eng. 2013, 3, 1114–1119.
54. Rodríguez, J.D.; Pérez, A.; Lozano, J. Sensitivity analysis of kappa-fold cross validation in prediction error estimation. IEEE Trans.
Pattern Anal. Mach. Intell. 2010, 32, 569–575. [CrossRef]
55. Smith, T.C.; Frank, E. Introducing machine learning concepts with WEKA. In Statistical Genomics; Humana Press: New York, NY,
USA, 2016; pp. 353–378.
56. Witten, I.H. Data Mining with Weka. Available online: https://cs.famaf.unc.edu.ar/~{}laura/Weka_workshop/Slides.pdf
(accessed on 16 June 2021).
57. An, S.; Hu, Q.; Wang, C.; Guo, G.; Li, P. Data reduction based on NN-k NN measure for NN classification and regression. Int. J.
Mach. Learn. Cybern. 2021, 12, 1649–1665. [CrossRef]
58. O’farrell, M.; Lewis, E.; Flanagan, C.; Lyons, W.; Jackman, N. Comparison of k-NN and neural network methods in the
classification of spectral data from an optical fibre-based sensor system used for quality control in the food industry. Sens. Actuators
B Chem. 2005, 111, 354–362. [CrossRef]
59. Zhang, S.; Li, X.; Zong, M.; Zhu, X.; Wang, R. Efficient kNN Classification with Different Numbers of Nearest Neighbors. IEEE
Trans. Neural Netw. Learn. Syst. 2017, 29, 1774–1785. [CrossRef]
60. Mandrekar, J.N. Receiver Operating Characteristic Curve in Diagnostic Test Assessment. J. Thorac. Oncol. 2010, 5, 1315–1316.
[CrossRef] [PubMed]
61. Sesen, M.B.; Nicholson, A.; Banares-Alcantara, R.; Kadir, T.; Brady, M. Bayesian Networks for Clinical Decision Support in Lung
Cancer Care. PLoS ONE 2013, 8, e82349. [CrossRef]
62. Horng, S.; Sontag, D.A.; Halpern, Y.; Jernite, Y.; Shapiro, N.I.; Nathanson, L.A. Creating an automated trigger for sepsis clinical
decision support at emergency department triage using machine learning. PLoS ONE 2017, 12, e0174708. [CrossRef] [PubMed]
63. Rendell, K.; Koprinska, I.; Kyme, A.; Ebker-White, A.A.; Dinh, M. The Sydney Triage to Admission Risk Tool (START2) using
machine learning techniques to support disposition decision-making. Emerg. Med. Australas. 2018, 31, 429–435. [CrossRef]
[PubMed]
64. Livieris, I.E.; Dimopoulos, I.F.; Kotsilieris, T.; Pintelas, P. Predicting length of stay in hospitalized patients using SSL algorithms.
In Proceedings of the 8th International Conference on Software Development and Technologies for Enhancing Accessibility and
Fighting Info-Exclusion, New York, NY, USA, 20–22 June 2018; pp. 16–22.
65. Cheng, I.; Taylor, D.; Schull, M.J.; Zwarenstein, M.; Kiss, A.; Castren, M.; Brommels, M.; Yeoh, M.; Kerr, F. Comparison of
emergency department time performance between a Canadian and an Australian academic tertiary hospital. Emerg. Med.
Australas. 2019, 31, 605–611. [CrossRef] [PubMed]
66. Ahmad, B.A.; Khairatul, K.; Farnaza, A. An assessment of patient waiting and consultation time in a primary healthcare clinic.
Malays. Fam. Physician Off. J. Acad. Fam. Physicians Malays. 2017, 12, 14–21.
67. Morley, C.; Unwin, M.; Peterson, G.M.; Stankovich, J.; Kinsman, L. Emergency department crowding: A systematic review of
causes, consequences and solutions. PLoS ONE 2018, 13, e0203316. [CrossRef]
68. Ono, T.; Tamai, A.; Takeuchi, D.; Tamai, Y.; Iseki, H.; Fukushima, H.; Kasahara, S. Predictors of length of stay in a ward for
demented elderly: Gender differences. Psychogeriatrics 2010, 10, 153–159. [CrossRef]
69. Kocher, K.E.; Meurer, W.J.; Desmond, J.S.; Nallamothu, B.K. Effect of Testing and Treatment on Emergency Department Length of
Stay Using a National Database. Acad. Emerg. Med. 2012, 19, 525–534. [CrossRef]
70. Mentzoni, I.; Bogstrand, S.T.; Faiz, K.W. Emergency department crowding and length of stay before and after an increased
catchment area. BMC Health. Serv. Res. 2019, 19, 506. [CrossRef]
71. Francis, A.J.; Ray, M.J.; Marshall, M.C. Pathology processes and emergency department length of stay: The impact of change.
Med. J. Aust. 2009, 190, 665–669. [CrossRef] [PubMed]
72. Ullrich, M.; LaBond, V.; Britt, T.; Bishop, K.; Barber, K. Influence of emergency department patient volumes on CT utilization rate
of the physician in triage. Am. J. Emerg. Med. 2021, 39, 11–14. [CrossRef] [PubMed]
73. Information Management Group. March. Data Quality Guideline (Version 1). 2018. Available online: https://www.vic.gov.au/
sites/default/files/2019-07/IM-GUIDE-09-Data-Quality-Guideline.pdf (accessed on 16 June 2021).
74. Fryer, D.; Strümke, I.; Nguyen, H. Shapley values for feature selection: The good, the bad, and the axioms. IEEE Access 2021, 9,
144352–144360. [CrossRef]
Sensors 2022, 22, 4968 15 of 15
75. Bollepalli, S.C.; Sahani, A.K.; Aslam, N.; Mohan, B.; Kulkarni, K.; Goyal, A.; Singh, B.; Singh, G.; Mittal, A.; Tandon, R.; et al. An
Optimized Machine Learning Model Accurately Predicts In-Hospital Outcomes at Admission to a Cardiac Unit. Diagnostics 2022,
12, 241. [CrossRef] [PubMed]
76. Arnaud, É.; Elbattah, M.; Gignon, M.; Dequen, G. Deep learning to predict hospitalization at triage: Integration of structured data
and unstructured text. In Proceedings of the 2020 IEEE International Conference on Big Data, Atlanta, GA, USA, 10–13 December
2020; pp. 4836–4841.
77. Baier, N.; Geissler, A.; Bech, M.; Bernstein, D.; Cowling, T.E.; Jackson, T.; van Manen, J.; Rudkjøbing, A.; Quentin, W. Emergency
and urgent care systems in Australia, Denmark, England, France, Germany and the Netherlands—Analyzing organization,
payment and reforms. Health Policy 2019, 123, 1–10. [CrossRef]
78. Di Somma, S.; Paladino, L.; Vaughan, L.; Lalle, I.; Magrini, L.; Magnanti, M. Overcrowding in emergency department: An
international issue. Intern. Emerg. Med. 2015, 10, 171–175. [CrossRef]
79. Harris, A.; Sharma, A. Access block and overcrowding in emergency departments: An empirical analysis. Emerg. Med. J. 2010, 27,
508–511. [CrossRef]