Remember Me
Or use your Academic/Social account:


You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.


Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message


Verify Password:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:

OpenAIRE is about to release its new face with lots of new content and services.
During September, you may notice downtime in services, while some functionalities (e.g. user registration, login, validation, claiming) will be temporarily disabled.
We apologize for the inconvenience, please stay tuned!
For further information please contact helpdesk[at]openaire.eu

fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Loo, Ruey Leng; Zou, Xin; Holmes, Elaine; Nicholson, Jeremy K. (2016)
Publisher: ACS Publications
Languages: English
Types: Article
Subjects: QA
We propose a novel data-driven approach aiming to reliably distinguish discriminatory metabolites from nondiscriminatory metabolites for a given spectroscopic data set containing two biological phenotypic subclasses. The automatic spectroscopic data categorization by clustering analysis (ASCLAN) algorithm aims to categorize spectral variables within a data set into three clusters corresponding to noise, nondiscriminatory and discriminatory metabolites regions. This is achieved by clustering each spectral variable based on the r(2) value representing the loading weight of each spectral variable as extracted from a orthogonal partial least-squares discriminant (OPLS-DA) model of the data set. The variables are ranked according to r(2) values and a series of principal component analysis (PCA) models are then built for subsets of these spectral data corresponding to ranges of r(2) values. The Q(2)X value for each PCA model is extracted. K-means clustering is then applied to the Q(2)X values to generate two clusters based on minimum Euclidean distance criterion. The cluster consisting of lower Q(2)X values is deemed devoid of metabolic information (noise), while the cluster consists of higher Q(2)X values is then further subclustered into two groups based on the r(2) values. We considered the cluster with high Q(2)X but low r(2) values as nondiscriminatory, while the cluster with high Q(2)X and r(2) values as discriminatory variables. The boundaries between these three clusters of spectral variables, on the basis of the r(2) values were considered as the cut off values for defining the noise, nondiscriminatory and discriminatory variables. We evaluated the ASCLAN algorithm using six simulated (1)H NMR spectroscopic data sets representing small, medium and large data sets (N = 50, 500, and 1000 samples per group, respectively), each with a reduced and full resolution set of variables (0.005 and 0.0005 ppm, respectively). ASCLAN correctly identified all discriminatory metabolites and showed zero false positive (100% specificity and positive predictive value) irrespective of the spectral resolution or the sample size in all six simulated data sets. This error rate was found to be superior to existing methods for ascertaining feature significance: univariate t test by Bonferroni correction (up to 10% false positive rate), Benjamini-Hochberg correction (up to 35% false positive rate) and metabolome wide significance level (MWSL, up to 0.4% false positive rate), as well as by various OPLS-DA parameters: variable importance to projection, (up to 15% false positive rate), loading coefficients (up to 35% false positive rate), and regression coefficients (up to 39% false positive rate). The application of ASCLAN was further exemplified using a widely investigated renal toxin, mercury II chloride (HgCl2) in rat model. ASCLAN successfully identified many of the known metabolites related to renal toxicity such as increased excretion of urinary creatinine, and different amino acids. The ASCLAN algorithm provides a framework for reliably differentiating discriminatory metabolites from nondiscriminatory metabolites in a biological data set without the need to set an arbitrary cut off value as applied to some of the conventional methods. This offers significant advantages over existing methods and the possibility for automation of high-throughput screening in "omics" data.
  • The results below are discovered through our pilot algorithms. Let us know how we are doing!

    • (1) Holmes, E.; Loo, R. L.; Stamler, J.; Bictash, M.; Yap, I. K.; Chan, Q.; Ebbels, T.; De Iorio, M.; Brown, I. J.; Veselkov, K. A.; Daviglus, M.
    • Nature 2008, 453, 396−400.
    • (2) Chan, E. C.; Koh, P. K.; Mal, M.; Cheah, P. Y.; Eu, K. W.; Backshall, A.; Cavill, R.; Nicholson, J. K.; Keun, H. C. J. Proteome Res.
    • (3) Bylesjo, M.; Rantalainen, M.; Cloarec, O.; Nicholson, J. K.; Holmes, E.; Trygg, J. J. Chemom. 2006, 20, 341−351.
    • (4) Cloarec, O.; Dumas, M. E.; Trygg, J.; Craig, A.; Barton, R. H.; Lindon, J. C.; Nicholson, J. K.; Holmes, E. Anal. Chem. 2005, 77, 517− 26.
    • (5) Coen, M.; Hong, Y. S.; Clayton, T. A.; Rohde, C. M.; Pearce, J.
    • T.; Reily, M. D.; Robertson, D. G.; Holmes, E.; Lindon, J. C.; Nicholson, J. K. J. Proteome Res. 2007, 6, 2711−9.
    • (6) Su, S. L.; Duan, J. A.; Wang, P. J.; Liu, P.; Guo, J. M.; Shang, E.; Qian, D. W.; Tang, Y. P.; Tang, Z. X. J. Proteome Res. 2013, 12, 852− 865.
    • (7) Nevedomskaya, E.; Pacchiarotta, T.; Artemov, A.; Meissner, A.; van Nieuwkoop, C.; van Dissel, J. T.; Mayboroda, O. A.; Deelder, A.
    • M. Metabolomics 2012, 8, 1227−1235.
    • (8) Mehmood, T.; Liland, K. H.; Snipen, L.; Saebo, S. Chemom. Intell.
    • Lab. Syst. 2012, 118, 62−69.
    • (9) Wiklund, S.; Johansson, E.; Sjostrom, L.; Mellerowicz, E. J.; Edlund, U.; Shockcor, J. P.; Gottfries, J.; Moritz, T.; Trygg, J. Anal.
    • Chem. 2008, 80, 115−22.
    • (10) Wang, L.; Hou, E.; Wang, L.; Wang, Y.; Yang, L.; Zheng, X.; Xie, G.; Sun, Q.; Liang, M.; Tian, Z. Anal. Chim. Acta 2015, 854, 95−105.
    • (11) Patel, S.; Ahmed, S. J. Pharm. Biomed. Anal. 2015, 107C, 63−74.
    • (12) Coen, M.; Goldfain-Blanc, F.; Rolland-Valognes, G.; Walther, B.; Robertson, D. G.; Holmes, E.; Lindon, J. C.; Nicholson, J. K. J.
    • Proteome Res. 2012, 11, 2427−40.
    • (13) Clayton, T. A.; Baker, D.; Lindon, J. C.; Everett, J. R.; Nicholson, J. K. Proc. Natl. Acad. Sci. U. S. A. 2009, 106, 14728−33.
    • (14) Winnike, J. H.; Li, Z.; Wright, F. A.; Macdonald, J. M.; O'Connell, T. M.; Watkins, P. B. Clin. Pharmacol. Ther. 2010, 88, 45− 51.
    • (15) Smith, E. A.; Macfarlane, G. T. Microb. Ecol. 1997, 33, 180−8.
    • (16) Cloarec, O.; Dumas, M. E.; Craig, A.; Barton, R. H.; Trygg, J.; Hudson, J.; Blancher, C.; Gauguier, D.; Lindon, J. C.; Holmes, E.; Nicholson, J. Anal. Chem. 2005, 77, 1282−9.
    • (17) Crockford, D. J.; Holmes, E.; Lindon, J. C.; Plumb, R. S.; Zirah, S.; Bruce, S. J.; Rainville, P.; Stumpf, C. L.; Nicholson, J. K. Anal.
    • Chem. 2006, 78, 363−71.
    • (18) Posma, J. M.; Garcia-Perez, I.; De Iorio, M.; Lindon, J. C.; Elliott, P.; Holmes, E.; Ebbels, T. M.; Nicholson, J. K. Anal. Chem.
    • (19) Zou, X.; Holmes, E.; Nicholson, J. K.; Loo, R. L. Anal. Chem.
    • (20) McDonald, J. H. Handbook of Biological Statistics, 2nd ed.; Sparky House Publishing: Baltimore, 2009.
    • (21) Chadeau-Hyam, M.; Ebbels, T. M.; Brown, I. J.; Chan, Q.; Stamler, J.; Huang, C. C.; Daviglus, M. L.; Ueshima, H.; Zhao, L.; Holmes, E.; Nicholson, J. K.; Elliott, P.; De Iorio, M. J. Proteome Res.
    • (22) Muncey, H. J.; Jones, R.; De Iorio, M.; Ebbels, T. M. BMC Bioinf. 2010, 11, 496.
    • (23) Wishart, D. S.; Jewison, T.; Guo, A. C.; Wilson, M.; Knox, C.; Liu, Y.; Djoumbou, Y.; Mandal, R.; Aziat, F.; Dong, E.; Bouatra, S.; Sinelnikov, I.; Arndt, D.; Xia, J.; Liu, P.; Yallou, F.; Bjorndahl, T.; Perez-Pineiro, R.; Eisner, R.; Allen, F.; Neveu, V.; Greiner, R.; Scalbert, A. Nucleic Acids Res. 2013, 41, D801−7.
    • (24) Holmes, E.; Cloarec, O.; Nicholson, J. K. J. Proteome Res. 2006, 5, 1313−20.
    • (25) Nicholson, J. K.; Timbrell, J. A.; Sadler, P. J. Mol. Pharmacol.
    • (26) Nicholson, J. K.; Kendall, M. D.; Osborn, D. Nature 1983, 304, 633−5.
    • (27) Gartland, K. P.; Bonner, F. W.; Nicholson, J. K. Mol. Pharmacol.
    • (28) Holmes, E.; Bonner, F. W.; Nicholson, J. K. Comp. Biochem.
    • Physiol., Part C: Pharmacol., Toxicol. Endocrinol. 1996, 114, 7−15.
    • (29) Holmes, E.; Bonner, F. W.; Sweatman, B. C.; Lindon, J. C.; Beddell, C. R.; Rahr, E.; Nicholson, J. K. Mol. Pharmacol. 1992, 42, 922−930.
    • (30) Dasgupta, S.; Freund, Y. IEEE Trans. Inf. Theory 2009, 55, 3229−3242.
    • (31) Dieterle, F.; Ross, A.; Schlotterbeck, G.; Senn, H. Anal. Chem.
    • (32) Cho, H. W.; Kim, S. B.; Jeong, M. K.; Park, Y.; Miller, N. G.; Ziegler, T. R.; Jones, D. P. Int. J. Data Min. Bioin. 2008, 2, 176−92.
    • (33) Clayton, T. A.; Lindon, J. C.; Cloarec, O.; Antti, H.; Charuel, C.; Hanton, G.; Provost, J. P.; Le Net, J. L.; Baker, D.; Walley, R. J.; Everett, J. R.; Nicholson, J. K. Nature 2006, 440, 1073−1077.
  • No related research data.
  • Discovered through pilot similarity algorithms. Send us your feedback.

Share - Bookmark

Funded by projects

  • RCUK | Urinary metabolic phenotyp...

Cite this article

Cookies make it easier for us to provide you with our services. With the usage of our services you permit us to use cookies.
More information Ok