Skip to main content

Advertisement

Log in

Weighted scaling approach for metabolomics data analysis

  • Original Paper
  • Modern Methods and Applications for Biodiversity Studies
  • Published:
Japanese Journal of Statistics and Data Science Aims and scope Submit manuscript

Abstract

Systematic variation is a common issue in metabolomics data analysis. Therefore, different scaling and normalization techniques are used to preprocess the data for metabolomics data analysis. Although several scaling methods are available in the literature, however, choice of scaling, transformation and/or normalization technique influences the further statistical analysis. It is challenged to choose the appropriate scaling technique for downstream analysis to get accurate results or to make proper decision. Moreover, the existing scaling techniques are sensitive to outliers or extreme values. To fill the gap, our objective is to introduce a robust scaling approach that is not influenced by outliers as well as provides more accurate results for downstream analysis. Here, we introduced a new weighted scaling approach that is robust against outliers; however, no additional outlier detection/treatment step is needed in data preprocessing and also compared it with the conventional scaling and normalization techniques through artificial and real metabolomics datasets. We evaluated the performance of the proposed method in comparison to the other existing conventional scaling techniques using metabolomics data analysis in both the absence and presence of different percentages of outliers. Results show that in most cases, the proposed scaling technique is a better performer than the traditional scaling methods in both the absence and presence of outliers. The proposed method improves the further downstream metabolomics analysis. The R function of the proposed robust scaling method is available at https://github.com/nishithkumarpaul/robustScaling/blob/main/wscaling.R

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

Two datasets are available at the National Institute of Health (NIH) Common Fund’s National Metabolomics Data Repository (NMDR) website (https://www.metabolomicsworkbench.org/data/index.php). The breast cancer dataset was produced by GC–TOF-MS and processed by ChromaTOF software (v. 2.32) using the blood sample of 134 subjects.

References

Download references

Acknowledgements

We thanks and acknowledge the Ministry of Science and Technology, Bangladesh for the National Science and Technology (NST) fellowship.

Author information

Authors and Affiliations

Authors

Contributions

BB analyzed the data, drafted the manuscript, and executed the statistical analysis. NK and MAA worked to develop the weighted scaling approach for metabolomics data analysis. NK, MAA, and MAH coordinated and supervised the project. All authors carefully read and finally approved the manuscript.

Corresponding authors

Correspondence to Nishith Kumar or Md. Ashad Alam.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 15554 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Biswas, B., Kumar, N., Hoque, M.A. et al. Weighted scaling approach for metabolomics data analysis. Jpn J Stat Data Sci 6, 785–802 (2023). https://doi.org/10.1007/s42081-023-00205-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s42081-023-00205-2

Keywords

Navigation

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy