LOGIN TO YOUR ACCOUNT

Username
Password
Remember Me
Or use your Academic/Social account:

CREATE AN ACCOUNT

Or use your Academic/Social account:

Congratulations!

You have just completed your registration at OpenAire.

Before you can login to the site, you will need to activate your account. An e-mail will be sent to you with the proper instructions.

Important!

Please note that this site is currently undergoing Beta testing.
Any new content you create is not guaranteed to be present to the final version of the site upon release.

Thank you for your patience,
OpenAire Dev Team.

Close This Message

CREATE AN ACCOUNT

Name:
Username:
Password:
Verify Password:
E-mail:
Verify E-mail:
*All Fields Are Required.
Please Verify You Are Human:
fbtwitterlinkedinvimeoflicker grey 14rssslideshare1
Jacques, R.M.; Fieller, N.R.J.; Ainscow, E.K. (2012)
Publisher: Taylor & Francis
Languages: English
Types: Article
Subjects:
The current paradigm for the identification of candidate drugs within the pharmaceutical industry typically involves the use of high throughput screens (HTS). High content screening (HCS) is the term given to the process of using an imaging platform to screen large numbers of compounds for some desirable biological activity. Classification methods have important applications in high content screening experiments where they are used to predict which compounds have the potential to be developed into new drugs. In this paper a new classification method is proposed for batches of compounds where the rule is updated sequentially using information from the classification of previous batches. This methodology accounts for the possibility that the training data are not a representative sample of the test data and that the underlying group distributions may change as new compounds are analysed. This technique is illustrated on an example data set using linear discriminant analysis, k-nearest neighbour and random forest classifiers. Random Forests are shown to be superior to the other classifiers and are further improved by the additional updating algorithm in terms of an increase in the number of true positives as well as decreasing the number of false positives.
  • The results below are discovered through our pilot algorithms. Let us know how we are doing!

    • [2] [5] [6] [8] [9] [15] C. Svetnik, A. Liaw, C. Tong, J.C. Culberson, R.P. Sheridan and B.P. Feuston, Random Forest: A classification and regression tool for compound classification and QSAR modelling, J. Chem. Inf. Comput. Sci. 43 (2003), pp. 1947-1958.
    • [16] W.N. Venables and B.D. Ripley, Modern Applied Statistics with S. (Fourth Edition), Springer, New York, 2002.
    • [17] X. Zhou, X. Chen, K.L. Liu, S. Lyman, R. King and S. Wong, Time-Lapse Cell Cycle Quantitative Data Analysis Using Gaussian Mixture Models, in Life Science Data Mining, World Scientific, Singapore, 2007.
  • No related research data.
  • No similar publications.

Share - Bookmark

Cite this article