Abstract
We present a multi-device and multi-modal dataset, called WEEE, collected from 17 participants while they were performing different physical activities. WEEE contains: (1) sensor data collected using seven wearable devices placed on four body locations (head, ear, chest, and wrist); (2) respiratory data collected with an indirect calorimeter serving as ground-truth information; (3) demographics and body composition data (e.g., fat percentage); (4) intensity level and type of physical activities, along with their corresponding metabolic equivalent of task (MET) values; and (5) answers to questionnaires about participants’ physical activity level, diet, stress and sleep. Thanks to the diversity of sensors and body locations, we believe that the dataset will enable the development of novel human energy expenditure (EE) estimation techniques for a diverse set of application scenarios. EE refers to the amount of energy an individual uses to maintain body functions and as a result of physical activity. A reliable estimate of people’s EE thus enables computing systems to make inferences about users’ physical activity and help them promoting a healthier lifestyle.
Measurement(s) | Heart rate • Accelerometer • Oxygen Consumption • Gyroscope • Respiratory Rate |
Technology Type(s) | PPG/ECG sensors • Accelerometer sensors • VO2 Master Analyzer • Gyroscope sensors • Wahoo ticker |
Sample Characteristic - Organism | Healthy subjects |
Sample Characteristic - Environment | Laboratory setting |
Similar content being viewed by others
Background & Summary
Human energy expenditure (EE) refers to the amount of energy an individual uses to maintain essential body functions (respiration, circulation, digestion) and as a result of physical activity1. Knowledge regarding the expended energy or calories could help people (e.g., athletes, obese, diabetic) to plan their physical activity for leading a healthier lifestyle2. Additionally, it could be used to enable nutrition coaching for weight management purposes3. Devising methods for EE estimation (EEE) is a key enabler of the mentioned intervention strategies and it is the core goal of the dataset presented in this paper.
The gold-standard EE measurement methods are direct calorimetry–which measures body heat while the subject is inside a chamber–, indirect calorimetry–that consists of a mouth piece worn for respiratory gases analysis–and doubly labeled water–which measures carbon dioxide production during the interval between first and last body water samples3,4,5. Such techniques require the use of cumbersome and expensive equipment and are not feasible to measure EE in free-living conditions for specific activities on a minute by minute basis. Measuring EE in real-world scenarios in a fine-grained manner would enable obtaining valuable information regarding people’s physical activity and providing personalized and timely recommendations.
Considering the cost and practical limitations of gold-standard methods combined with the proliferation of ubiquitous computing3, several researchers started exploring the use of mobile and wearable devices for EEE6,7,8. Such devices are suitable for continuous monitoring of EE because they are unobtrusive and do not hamper the natural behavior of the user in free-living conditions. Additionally, they have the potential to provide a cheap and reliable solution to this problem. Despite the considerable research progress in sensor-based EEE, several challenges remain open. In particular, it is not evident which type of sensor, body position or combination thereof would enable reliable EEE. Also, there is a lack of studies investigating the quality of data and how it influences the robustness of EEE. Such investigations are impeded by the lack of sensor-diverse, multimodal and publicly available datasets, which could potentially enable the development of more accurate EEE techniques4,7. While there exist commercial wearable devices that measure EE (mainly using demographics data and accelerometer sensor), it is not clear how they compare to gold-standard measurements (e.g., indirect calorimetry) and new sensor-based techniques (e.g., physiological sensors).
To overcome such barriers and foster further developments in EEE, in this paper, we introduce a new, multimodal dataset collected from 17 participants using 7 wearable devices, each containing multiple sensors. The goal of the dataset is to enable the design and development of new sensor-based EEE techniques during rest and physical activity. To this goal, we design and run a data collection protocol, which consists of three activities, such as resting, cycling and running, each performed for 10 minutes. We picked these activities because they involve movements of different intensity levels (e.g., light, moderate and vigorous). In addition, they require full-, half- or no-body movement, which are representative of physical activities performed in everyday life, as discussed in3. Each physical activity was performed at two intensity levels to cover a wider range of movement intensity and explore the EE changes during such intensities. For instance, participants ran at two different speeds for 5 minutes each.
The dataset is collected using an indirect calorimeter, a headband, earbuds, two chest-belts (a commercial and a gold-standard device), and three wristbands (a research-grade and two commercial devices). At least one or more devices include the following sensor data: oxygen consumption (VO2), fraction of oxygen in expired breath (FeO2), air moved by the lungs (Ve), volume breathed in a breath (Tv), breaths per minute (BR), humidity (H), temperature (T), pressure (P), acceleration (ACC), gyroscope (GYRO), photoplethysmography (PPG), electrocardiography (ECG), electrodermal activity (EDA), skin temperature (TEMP) and electroencephalography (EEG) and information derived from sensors such as e.g., heart rate (HR), heart rate variability (HRV), breathing rate (BR), body posture and more.
Table 1 presents an overview of existing datasets in the literature that enable EE modeling using sensor data. Only two of the existing datasets are publicly available for download, e.g.3,9, marked with “Yes” in the “Publicly Available” column of the table. In comparison to these datasets, our dataset contains a higher number of unique data sources (in total 18). Further, it is the only dataset that contains ACC and HR from multiple body locations, such as the ear, wrist, and chest, which allows researchers to investigate the development of novel techniques for EE estimation. Only Bouarfa et al.10 investigated the use of ACC placed on the ear to estimate EE. However, estimating EE from ACC and HR data collected from the ear has not yet been explored. Additionally, WEEE contains data from both medical grade devices (e.g., Zephyr Bioharness) and commercial devices (e.g., Fitbit sense and Apple Watch), which enables the comparison of HR measurements between such devices.
Methods
To enable multimodal EE modeling, we design a controlled experiment and ask participants to perform a set of pre-defined activities. We opt for a controlled study because, despite its constraints, it enables running detailed analysis of the phenomenon under investigation and it is suitable for the replicability of the data collection procedure. In this section, we provide details about the participants, data collection setup and protocol, and the collected data.
Participants
We recruited 17 participants (12 males and 5 females) using snowball sampling11. Participants were of age between 23 and 41 years old (MEAN = 30, STD = 5) and with an average BMI of 24.5 (STD = 2.9). The study was conducted following the ethical regulations at our institution. All the participants signed an informed consent form and agreed their data to be used for research purposes. Participants were instructed to wear comfortable attire for the experiment. Also, we asked participants to be in a rested and fasting state by refraining from endurance training for 24 hours prior the study and avoiding caffeine, tobacco, alcohol, and food intake 3 hours before the experiment. Participants were compensated with a £20.- gift card.
Setup
As a preparation for each experiment, we charged the devices and visually verified that the clock of each device matched the same time reference to ensure synchronization among the devices. This included checking for the date, time (in terms of hours, minutes and seconds) and time zone. Before the experiment, participants completed a set of questionnaires regarding their eating habits, sleep, stress and physical activity level. Before starting the experiment, we asked the participants to step up on the QardioBase smart scale (https://www.qardio.com/qardiobase-smart-scale-iphone-android/) to measure body composition metrics (e.g., weight, muscle percentage). We then placed the devices as follows on the participant: VO2 Master Analyzer (https://vo2master.com/) on the face, Nokia Bell Labs earbuds12,13 on the right ear, Muse S headband (https://choosemuse.com/muse-s/) on the head, Empatica E4 wristband14 on the non-dominant hand, Zephyr BioHarness chestbelt (https://www.zephyranywhere.com/) and Wahoo Tickr chest strap (https://eu.wahoofitness.com/devices/heart-rate-monitors) on the chest, Fitbit Sense watch (https://www.fitbit.com/global/us/products/smartwatches/sense) and Apple watch (https://www.apple.com/apple-watch-series-6/index.html) on the dominant hand. Figure 1 presents an overview of the study setup, devices used and their location. We ensured proper attachment of the face mask and calibration of the flow sensor, as recommended in3. Muse S headband, Zephyr Bioharness and Wahoo chestbelts were moisturized with water before attaching them to participant’s body. The earbuds are a multi-sensory earable device under development by the Nokia Bell Labs, which has been already tested in12,13,15. The VO2 Master Analyzer device has a smaller size than the major portable metabolic analyzer brands, which makes it a suitable option for VO2 measurements. Montoye et al.16 have shown acceptable validity and reliability of this device in comparison to gold-standard measurements. Furthermore, the VO2 Master Analyzer is compatible with other devices, such as, e.g., the Wahoo Tickr– validated in17–, which makes it easier for the researchers to obtain additional data (e.g., heart rate) together with VO2 measurements. The Zephyr BioHarness chest belt contains an ECG sensor, which provides heart rate measurements. Nazari et al.18 have shown evidence of the reliability and validity of heart rate measurements across multiple contexts using this device. The Zephyr BioHarness has been used also in other studies3,19,20,21. The Empatica E4 device is a watch-like, multi-sensor device. It is light, easy to use and comfortable to wear, which makes it suitable to monitor people’s energy expenditure. Additionally, the Empatica E4 provides the raw sensor data as well as encrypts the data during transfer and does not store user’s personal data, which is convenient to preserve the privacy of the study participants. The Empatica E4 has been extensively used in the literature for energy expenditure estimation21, but also other tasks15,22,23. We chose the Fitbit and Apple watch devices because they are among the most popular smartwatches available in the market, as shown in a recent article by The Economist magazine in24. Also, they have shown high accuracy for measuring heart rate during physical activities considered in our work (e.g., cycling, running)25. We chose the Muse S device because it is a portable and unobtrusive brain-sensing headband and has been previously validated in the literature15,26,27.
Procedure
Figure 2 depicts an overview of the study protocol. Participants followed a predefined set of activities, similar to28,29, grouped into three parts: resting, cycling and running. During resting, participants were asked to sit on a chair and stand on their feet, for 5 minutes each, to obtain physiological data during a resting state. After that, they cycled in an indoor bike and run on a treadmill, for 10 minutes in each activity. Both cycling and running activity were performed in two intensity levels, each of 5 minutes. We used a window of 5 minutes for each activity to reach a steady state EE, as recommended in3. The intensities of these activities were selected by the participants to represent their individual habits, as suggested in previous work30,31. The total duration of the experiment was 30 minutes. For consistency, the bicycle resistance and treadmill inclination were kept the same for all participants.
Data collection protocol. Speed 1 and Speed 2 during cycling and running refer to low and high speed levels. Such levels were chosen by the individual based on their fitness level. The average speed during the low and high levels of cycling was 15.64 (STD = 3.33) and 24.17 (STD = 4.94), and for running was 5.93 (STD = 1.48) and 8.58 (STD = 2.55).
We picked resting, cycling and running activities because these activities involve movements of different intensity levels (e.g., light, moderate and vigorous). For instance, sitting or standing requires no or light movement, cycling requires half-body or moderate movement and running full body or vigorous movement. We run the protocol from low to high intensity to avoid the impact of high activities into low intensity ones.
Collected data
We collect five types of data: sensor data, respiratory gases, demographics and body composition, activity data and questionnaires data explained as follows.
Sensor and respiratory gases
Table 2 shows an overview of the characteristics of devices used to collect WEEE dataset. The table presents the device used, device location, the type of data that was collected for each device as well as paper(s) that validated the sensor readings of the device. The table shows that WEEE contains data from 8 different devices (including an indirect calorimeter serving as ground-truth information) placed on 5 unique body locations. Some of the sensors (e.g., ACC, PPG) are available in more than one body location (e.g., ear, wrist, chest).
Demographics and body composition
To collect body composition and demographics data, we use QardioBase smart scale. In particular, we collect participants’ gender, age, height, weight, percentage of body fat, muscle, bone, water and body mass index (BMI). Muscle mass percentage is calculated as the percentage of muscle in the body as compared to total body weight. Table 3 shows the mean (standard deviation) of the demographics and body composition data for all participants as well as for participants with female or male body types. The range of BMI is 20 to 30 kg/m2 (MEAN = 24.5, STD = 2.9).
Activity data
We derive labels regarding the activity performed from the protocol. Also, we kept notes of the intensity level (speed) of each activity. To enable further comparisons, we include the metabolic equivalent of a task (MET) values for each activity type based on intensity as defined in the compendium of physical activities32.
Questionnaires
We assess participant’s physiological and physical state before the experiment using validated questionnaires. In particular, we evaluate their sleep quality level over the past month using the Pittsburgh Sleep Quality Index (PSQI)33 and sleepiness level before the experiment using the Stanford Sleepiness Scale (SSS)34. Participants also report their stress level using the Perceived Stress Scale (PSS)35, physical activity level using the International Fitness Scale (IFIS)36, the readiness for physical activity using the Physical Activity Readiness (PAR-Q)37, and How healthy is your diet? to measure the nutritional value of their diet, which have an impact on EE.
Data Records
The raw data can be found at Zenodo38 and the dataset is available for download at this link: https://doi.org/10.5281/zenodo.6420886. Data of each participant has been anonymized with an alphanumeric format of P#, to which we refer to as participant identifier, and is placed on separate folders named with participant identifier (e.g., P1). The dataset contains a folder for each participant and some other files described as follows: Demographics.csv contains demographics (e.g., gender, age) and body composition data (e.g., BMI, percentage of fat, muscle, water, bone) for each participant in an anonymous format, Study_Information.csv contains the start and end time of each study condition (e.g., start time of the sitting or cycling activity), speed of cycling/running and MET information for each activity, Questionnaires folder contains the answers to the pre-study questionnaires regarding participants’ physiological state. Within each participant folder, there are five other folders, namely, VO2, EARBUDS, E4, ZEPHYR, and MUSE, which contain the raw data obtained from each device during data collection. Table 4 provides an overview and description of the main files inside a participant folder.
Missing data
The MUSE S device data of participant P02 is missing due to a malfunction in the streaming of the sensor data to the third-party app MindMonitor (https://mind-monitor.com/), which we used to collect the data. Part of the VO2 data of P03 and P12 during the cycling condition and of P16 during the running condition was lost due to issues with the indirect calorimeter VO2 sensor.
Technical Validation
We evaluate the technical validity of the dataset, i.e., whether the sensor measure what they are expected, in three ways: (1) by providing descriptive statistics of the data in comparison to the device manuals, (2) by investigating the relationship between physiological signals collected from different body locations and (3) comparing the changes in sensor data for different physical activities, as suggested in39.
Table 5 presents descriptive statistics of the collected data for each device together with reference values obtained from the devices’ manuals. These statistics support the validity of the dataset because the minimum and maximum values obtained from the sensors are within the expected range for the majority of the sensors. For instance, the minimum (47) and maximum (209) HR values measured with ZEPHYR are inside the expected range of [25:240]. Similarly, the minimum (0.87) and maximum (1.91) values of ACC sensor measured with the E4 devices are within the ±2 range. These observations confirm that the data in WEEE dataset are as expected according to the devices’ manuals. We observe that the minimum HR derived from the E4 and earbuds fall below the expected minimum, this could be due to the presence of motion artifacts in PPG signal from which HR is derived. We recommend careful identification and removal of artifacts in the PPG signal before further analysis.
To further evaluate the validity of our dataset, we explore the association between physiological signals collected from different body locations. Given that HR and ACC data are available from multiple body positions, we investigate the relationship between such data collected from different body positions. To perform this analysis, we compute Pearson product-moment correlation when data samples conform to a Gaussian distribution and Spearman rank correlation otherwise, as a common procedure in the literature40. We use Shapiro-Wilk test to verify whether the data conforms a Gaussian distribution. We test the p-values against both p < 0.05 threshold as well as the corrected threshold (\({p}_{c}=\frac{p}{n}=0.01\), where n refers to body locations or devices and is equal to 5), to account for the Bonferroni correction41. Figure 3 presents the heatmap with correlations coefficients between sensor data collected from different devices. As expected, we observe that the motion data (e.g., ACC, GYRO) collected from the ear, chest or wrist is significantly positively correlated to each other (p < 0.01).
We further explore the difference in sensor data for each physical activity. Figure 4 shows the distribution of EE measured using the indirect calorimetry (left), HR (middle) and GYRO (right) data measured using earbuds. As expected the average amount of EE during activities with high intensity movements is higher than for those with low intensity movements. For instance, the average EE during running or cycling are higher than during resting activities (e.g., sitting and standing). We observe similar patterns for HR and GYRO sensor data. This exploration of the data further confirms the validity and reliability of the collected data.
Usage Notes
The WEEE dataset38 is available for download at this link: https://doi.org/10.5281/zenodo.6420886. The dataset website is https://wearableenergyexpenditure.github.io/. To analyze the dataset, we recommend using existing libraries for preprocessing and cleaning the physiological signals. In particular, the possible libraries that could be used are: HeartPy (https://python-heart-rate-analysis-toolkit.readthedocs.io/en/latest/) to extract heart rate data from PPG or ECG sensors, NeuroKit (https://neurokit2.readthedocs.io/en/latest/index.html) and BrainFlow (https://github.com/brainflow-dev/brainflow) to analyze EEG, PPG, ECG, and other kinds of data from physiological sensors available in the WEEE dataset, cvxEDA (https://github.com/lciti/cvxEDA) for decomposing the EDA signal into the phasic and tonic components, EDArtifact (https://github.com/shkurtagashi/EDArtifact) for exploring, preprocessing and identifying artifacts in EDA data and EDAExplorer (https://github.com/MITMediaLabAffectiveComputing/eda-explorer) to extract peaks from EDA signals and extract features from the ACC sensor.
The WEEE dataset fosters research and development of new solutions to problems as follows:
-
Device/Sensor Fusion: The dataset contains raw measurements from sensors in multiple devices placed on the head, ear, wrist and chest. Thanks to its large number of wearable devices and sensor types, the dataset enables exploration of which sensor (device) or combination thereof enables a more accurate measurement of EE. For instance, the dataset enables exploring different sensor (device) fusion strategies such as e.g., stacking sensor channels one after the other, multi-input architecture, ensemble methods, and feature concatenation.
-
Sensor Location: Researchers may further explore how the sensor position impacts the EEE. To the best of our knowledge, our dataset enables for the first time using heart rate and motion data collected from the ear for EEE and comparing it to the same data sources collected from other body positions.
-
Individual Characteristics: The literature has shown that age, gender, body size and composition have an impact in EE. For instance, individuals with a larger body require a higher amount of energy than those with smaller body size because of the amount of tissues4. Our dataset enables a systematic, data-driven exploration of the impact of such individual characteristics in EEE.
-
Context Information: Several researchers have shown that combining human activity recognition and EEE generally leads to better EEE6. Our dataset contains information about the type of activity that participants performed and its intensity level, which allows researchers investigating methods to simultaneously recognize the activity type, intensity level and EEE as well as understand their impact in EEE.
-
Physiological Conditions: Investigating the impact of physiological conditions, e.g., physical activity level, diet, stress, and sleep in the overall EEE.
-
Data Quality: Exploration of the impact of data quality (e.g., presence of noise and missing data) in the overall EEE. For instance, researchers could develop new methods to leverage the data from available sensors to handle noisy data, missing data points, missing sensor or device problems.
-
State-of-the-art Comparison: The dataset also contains HR measurements from ECG sensor measured with Zephyr Bioharness and PPG sensor measured with research-grade devices (e.g., Empatica E4) and commercial devices (e.g., Fitbit and Apple Watch). This enables benchmarking existing HR-based EE measurement methods with new ones. Further, our dataset contains EE measurements from indirect calorimetry, which is one of the gold-standard measurement techniques for EE as well as METs derived from the compendium of physical activities based on activity type and intensity level. This enables the comparison of sensor-based EEE with gold-standard techniques.
While the WEEE data set opens up novel opportunities for computing systems that monitor energy expenditure, our approach presents some limitations and opportunities for further improvements. The first limitation stems from the low number of physical activities investigated. We opted for this decision to avoid having a long experiment protocol and to avoid causing fatigue to our study participants. Future work should consider extending our approach by adding more various physical activities. Even if our data set contains 3 activities, each of these activities has been performed in two intensity levels, which make the data set diverse in terms of types of activities and intensity levels.
Indirect calorimetry data
The data collected from the indirect calorimetry can be used as a ground truth in future analysis. To prepare indirect calorimetry data for the analysis, the VO2 data should first be cleaned, for instance, by removing the values when VO2 sensor did not record any data (e.g., VO2 = 0). Then VO2 data should be converted to EE using equations from the literature e.g., in4.
Earbuds data
To use the data collected from earbuds, one should first convert the raw ACC data to milli-g by multiplying it with 0.061 and the raw GYRO data to milli-dps (degrees per second) by multiplying with 17.5. This is to convert the raw data coming from the sensor from integer format to a more usable format (i.e., milli-g and milli-dps). Then remove the direct current (DC) offset from the GYRO data by applying a Butterworth band-pass filter. To clean the PPG signal, one could apply a Butterworth band-pass filter and then extract HR using the NeuroKit library mentioned before.
Wristband data
To clean ACC and TEMP data, we suggest to apply a central moving average filter with a window of 1 minute, similar to23. Then to compute the ACC magnitude. The EDA data should be cleaned using a first order Butterworth low-pass filter with a cut-off frequency of 0.6 Hz, similar to42. The EDA data can further be dicomposed into the tonic–the slowly changing component–and phasic–characterized by skin conductance responses (SCRs) or peaks that occur as a result of a stimuli–components, using the cvxEDA method proposed by Greco et al.43. To clean the PPG data, a first order Butterworth FIR filter with a cut-off frequency of 5 Hz should be applied, as suggested in44. The HR data can then be derived from PPG using the NeuroKit library45.
Questionnaire data
Figures 5 to 15 present a summary of the answers received from all the participants for the PSQI, SSS, IFIS, PSS and “How healthy is your diet?” questionnaires. Such data can be used as additional information regarding the physical and physiological state of participants before the experiment.
PSQI questionnaire33–Answers to the question “During last week, how would you rate your sleep quality overall?”.
SSS questionnaire34–Answers to the question “Please report your degree of sleepiness at the moment”.
IFIS questionnaire36–Answers to the question “Please try to think about your level of physical fitness (compared to your friends) and choose the right option”.
PSS questionnaire35–Answers to the question “In the last week, how often have you…”.
How healthy is your diet?36–Answers to the item “Eating habits”.
How healthy is your diet?36–Answers to the item “Fruit and vegetables”.
How healthy is your diet?36–Answers to the item “Fat”.
How healthy is your diet?36–Answers to the item “Starchy foods”.
How healthy is your diet?36–Answers to the item “Sugar”.
How healthy is your diet?36–Answers to the item “Salt”.
Other data
The data from Wahoo Ticker and Zephyr BioHarness are preprocessed and provided at a 1 Hz granularity. For these reason, data from such devices can be used as is.
Code availability
We provide the raw csv data files obtained during the data collection structured by user and device identifier. We did not implement any custom code to generate or process the data.
References
Heaney, J. Energy: Expenditure, intake, lack of. In Gellman, M. D. & Turner, J. R. (eds.) Encyclopedia of Behavioral Medicine, 699–700, https://doi.org/10.1007/978-1-4419-1005-9_454 (Springer New York, 2013).
Pande, A. et al. Using smartphone sensors for improving energy expenditure estimation. IEEE Journal of Translational Engineering in Health and Medicine 3, 1–12 (2015).
Álvarez-Garca, J. A., Cvetković, B. & Luštrek, M. A survey on energy expenditure estimation using wearable devices. ACM Computing Surveys (CSUR) 53, 1–35 (2020).
Hills, A. P., Mokhtar, N. & Byrne, N. M. Assessment of physical activity and energy expenditure: An overview of objective measures. Frontiers in Nutrition 1, 5 (2014).
Psota, T. & Chen, K. Measuring energy expenditure in clinical populations: Rewards and challenges. European Journal of Clinical Nutrition 67, 436–442 (2013).
Albinali, F., Intille, S., Haskell, W. & Rosenberger, M. Using wearable activity type detection to improve physical activity energy expenditure estimation. In Proceedings of the 12th ACM International Conference on Ubiquitous Computing, 311–320 (2010).
Altini, M., Penders, J., Vullers, R. & Amft, O. Combining wearable accelerometer and physiological data for activity and energy expenditure estimation. In Proceedings of the 4th Conference on Wireless Health, 1–8 (2013).
Kinnunen, H., Tanskanen, M., Kyröläinen, H. & Westerterp, K. Wrist-worn accelerometers in assessment of energy expenditure during intensive training. Physiological measurement 33, 1841 (2012).
Kinnunen, H. et al. Training-induced changes in daily energy expenditure: Methodological evaluation using wrist-worn accelerometer, heart rate monitor, and doubly labeled water technique. PLoS One 14, e0219563 (2019).
Bouarfa, L. et al. Predicting free-living energy expenditure using a miniaturized ear-worn sensor: An evaluation against doubly labeled water. IEEE Transactions on Biomedical Engineering 61, 566–575 (2013).
Manohar, N., MacMillan, F., Steiner, G. Z. & Arora, A. Recruitment of research participants. Handbook of Research Methods in Health Social Sciences; Liamputtong, P. Ed 71–98 (2018).
Ferlini, A. et al. In-ear PPG for vital signs. IEEE Pervasive Computing (2021).
Truong, H., Montanari, A. & Kawsar, F. Non-invasive blood pressure monitoring with multi-modal in-ear sensing. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6–10 (IEEE, 2022).
Garbarino, M., Lai, M., Bender, D., Picard, R. W. & Tognetti, S. Empatica E3–A wearable wireless multi-sensor device for real-time computerized biofeedback and data acquisition. In Proceedings of the International Conference on Wireless Mobile Communication and Healthcare (MobiHealth 2014) (2014).
Kalanadhabhatta, M., Min, C., Montanari, A. & Kawsar, F. FatigueSet: A multi-modal dataset for modeling mental fatigue and fatigability. In Proceedings of the Pervasive Computing Technologies for Healthcare: 15th EAI International Conference (PervasiveHealth 2021) (2022).
Montoye, A. H., Vondrasek, J. D. & James B Hancock, I. Validity and reliability of the vo2 master pro for oxygen consumption and ventilation assessment. International Journal of Exercise Science 13, 1382 (2020).
Gilgen-Ammann, R. et al. Validation of ambulatory monitoring devices to measure energy expenditure and heart rate in a military setting. Physiological Measurement 42, 085008 (2021).
Nazari, G. et al. Psychometric properties of the zephyr bioharness device: A systematic review. BMC Sports Science, Medicine and Rehabilitation 10, 1–8 (2018).
Cvetković, B., Milić, R. & Luštrek, M. Estimating energy expenditure with multiple models using different wearable sensors. IEEE journal of biomedical and health informatics 20, 1081–1087 (2015).
Cvetković, B., Kaluža, B., Milić, R. & Luštrek, M. Towards human energy expenditure estimation using smart phone inertial sensors. In International Joint Conference on Ambient Intelligence, 94–108 (Springer, 2013).
Gjoreski, H., Kaluža, B., Gams, M., Milić, R. & Luštrek, M. Context-based ensemble method for human energy expenditure estimation. Applied Soft Computing 37, 960–970 (2015).
Gashi, S. et al. The role of model personalization for sleep stage and sleep quality recognition using wearables. IEEE Pervasive Computing (2022).
Sadeghi, R., Banerjee, T., Hughes, J. C. & Lawhorne, L. W. Sleep quality prediction in caregivers using physiological signals. Computers in Biology and Medicine 110, 276–288 (2019).
The economist, technology quarterly: The quantified self. https://www.economist.com/technology-quarterly. Accessed: May, 2021.
Hajj-Boutros, G., Landry-Duval, M.-A., Comtois, A. S., Gouspillou, G. & Karelis, A. D. Wrist-worn devices for the measurement of heart rate and energy expenditure: A validation study for the Apple Watch 6, Polar Vantage V and Fitbit Sense. European Journal of Sport Science 1–13 (2022).
Krigolson, O. E., Williams, C. C., Norton, A., Hassall, C. D. & Colino, F. L. Choosing MUSE: Validation of a low-cost, portable EEG system for ERP research. Frontiers in Neuroscience 11, 109 (2017).
Chan, K. S. N., Srisurangkul, C., Depaiwa, N. & Pangkreung, S. Detection of driver drowsiness from eeg signals using wearable brain sensing headband. Journal of Research and Applications in Mechanical Engineering 9 (2021).
Alzahrani, A. et al. A multi-channel opto-electronic sensor to accurately monitor heart rate against motion artefact during exercise. Sensors 15, 25681–25702 (2015).
Klass, M., Faoro, V. & Carpentier, A. Assessment of energy expenditure during high intensity cycling and running using a heart rate and activity monitor in young active adults. PloS one 14, e0224948 (2019).
Gilgen-Ammann, R. et al. Energy expenditure estimation from respiration variables. Scientific reports 7, 1–7 (2017).
Parak, J., Uuskoski, M., Machek, J. & Korhonen, I. Estimating heart rate, energy expenditure, and physical performance with a wrist photoplethysmographic device during running. JMIR mHealth and uHealth 5, e7437 (2017).
Ainsworth, B. E. et al. Compendium of physical activities: An update of activity codes and met intensities. Medicine and science in sports and exercise 32, S498–S504 (2000).
Buysse, D. J., Reynolds, C. F. III, Monk, T. H., Berman, S. R. & Kupfer, D. J. The Pittsburgh Sleep Quality Index: A new instrument for psychiatric practice and research. Psychiatry Research 28, 193–213 (1989).
Shahid, A., Wilkinson, K., Marcu, S. & Shapiro, C. M. Stanford Sleepiness Scale (SSS). In STOP, THAT and One Hundred Other Sleep Scales, 369–370 (Springer, 2011).
Cohen, S. et al. Perceived stress scale. Measuring Stress: A Guide for Health and Social Scientists 10, 1–2 (1994).
Ortega, F. B. et al. The International Fitness Scale (IFIS): Usefulness of self-reported fitness in youth. International Journal of epidemiology 40, 701–711 (2011).
Thomas, S., Reading, J. & Shephard, R. J. Revision of the physical activity readiness questionnaire (par-q). Canadian Journal of Sport Sciences (1992).
Gashi, S., Min, C., Montanari, A., Santini, S. & Kawsar, F. A multi-device and multi-modal dataset for wearable human energy expenditure estimation. Zenodo https://doi.org/10.5281/zenodo.6420886 (2022).
Alhomayani, F. & Mahoor, M. H. OutFin, A multi-device and multi-modal dataset for outdoor localization based on the fingerprinting approach. Scientific Data 8, 1–14 (2021).
Field, A. & Hole, G. How to Design and Report Experiments (Sage, 2002).
Armstrong, R. A. When to use the Bonferroni correction. Ophthalmic and Physiological Optics 34, 502–508 (2014).
Gashi, S. et al. Detection of artifacts in ambulatory electrodermal activity data. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 4, 1–31 (2020).
Greco, A., Valenza, G., Lanata, A., Scilingo, E. P. & Citi, L. cvxEDA: A convex optimization approach to electrodermal activity processing. IEEE Transactions on Biomedical Engineering 63, 797–804 (2015).
Schmidt, P., Reiss, A., Dürichen, R. & Laerhoven, K. V. Wearable affect and stress recognition: A review. ArXiv abs/1811.08854 (2018).
Makowski, D. et al. NeuroKit2: A Python toolbox for neurophysiological signal processing. Behavior Research Methods https://doi.org/10.3758/s13428-020-01516-y (2021).
Kurpad, A., Raj, R., Maruthy, K. & Vaz, M. A simple method of measuring total daily energy expenditure and physical activity level from the heart rate in adult men. European Journal of Clinical Nutrition 60, 32–40 (2006).
Amelard, R., Hedge, E. T. & Hughson, R. L. Temporal prediction of oxygen uptake dynamics from wearable sensors during low-, moderate-, and heavy-intensity exercise. arXiv preprint arXiv:2105.09987 (2021).
Altini, M., Penders, J., Vullers, R. & Amft, O. Estimating energy expenditure using body-worn accelerometers: A comparison of methods, sensors number and positioning. IEEE Journal of Biomedical and Health Informatics 19, 219–226 (2014).
Chen, S., Lach, J., Amft, O., Altini, M. & Penders, J. Unsupervised activity clustering to estimate energy expenditure with a single body sensor. In 2013 IEEE International Conference on Body Sensor Networks, 1–6 (IEEE, 2013).
Paraschiakos, S. et al. Activity recognition using wearable sensors for tracking the elderly. User Modeling and User-Adapted Interaction 30, 567–605 (2020).
Menghini, L. et al. Stressing the accuracy: Wrist-worn wearable sensor validation over different conditions. Psychophysiology 56, e13441 (2019).
Author information
Authors and Affiliations
Contributions
All authors participated in the design of the study protocol. A.M., C.M. and S.G. conducted the experiments. F.K., S.S., A.M. and C.M. supervised the project. S.G. prepared a first draft of the manuscript. All authors edited, revised and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gashi, S., Min, C., Montanari, A. et al. A multidevice and multimodal dataset for human energy expenditure estimation using wearable devices. Sci Data 9, 537 (2022). https://doi.org/10.1038/s41597-022-01643-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-022-01643-5
- Springer Nature Limited