| Help |
| File Format |
|
Experiment file should be tab-delimited with or without a header. Missing values are allowed (see example below, YBL001C ).
If there is a header it must have a title for each column. The first column is a row id (must be shorter than 30 characters, without space/blank character).
Example :
AutoClass can handle an “unlimited” number of lines and a maximum of 999 columns (default setting of AutoClass; the maximum number of columns may be increased upon user’s request). |
| Outputs |
Note that currently, the cdt are annotated only for Saccharomyces cerevisiae genes (the row id is used for other type of data). |
| [top] |
| Tutorial |
AutoClass@IJM provides an web interface to the powerful clustering software, AutoClass, developed by
the Ames Research Center at N.A.S.A. AutoClass is an unsupervised Bayesian classification system which
has many powerful features:
Our web interface aim at simplifying the use of AutoClass by:
|
| Input files |
AutoClass can handle three different types of data:
|
| Error parameter |
|
As quoted in the AutoClass documentation files:
"The fundamental question in all of this is: "To what extent do you
believe the numbers that are to be given to AutoClass?" AutoClass will
run quite happily with whatever it is given. It is up to the user to
decide what is meaningful and what is not.
[...]
For practical purposes:
Note that: the impact of the error parameter on classification is highly dependent on the structure of the dataset and as mentionned above, should be set according to the confidence into the data to be classified. For example, in some experiments with real scalar data ranging from 0 to 10 000, the datum 80 may be considered as not different from 100 (and an error of 0.2 is acceptable), in other experiments, with real scalar data ranging from 0 to 10, the datum 1 may be known to be very different from 1.1, and an error of 0.01 is required. If the error parameter entered by user is too large with respect to the data, the error message generated by AutoClass is interpreted and a e-mail is sent to the user with AutoClass log file attached. |
| Submit your files |
|
| [top] |
| Example files |
|
We provide two example files for you to test AutoClass@IJM (these examples can be loaded here): |
| Example One |
|
A file with real values : The data are from Yoshimoto's paper: Yoshimoto H. et al. (2002) "Genome-wide analysis of gene expression regulated by the calcineurin/Crz1p signaling pathway in Saccharomyces cerevisiae.J Biol Chem. 277(34):31079-88(GEO DataSet: GSE3456). |
| Example Two |
|
Two files: one with real values and the other with discrete values as test examples for clustering of heterogeneous data.
The data come from the French National Institut for Health Watch (I.N.V.S. http://www.invs.sante.fr ): one file reporting incidence and mortality rate for cancer (real values) and one file reporting the location of the primary cancer and the gender of the populations (discrete values) (source: Belot A. et al. (2008) "Cancer incidence and mortality in France over the period 1980-2005."Rev Epidemiol Sante Publique. 56(3):159-75). |
| [top] |
| References |
|
| [top] |
| Citing AutoClass@IJM | |
| We kindly ask users to cite the following paper when publishing results derived of the use of AutoClass@IJM : | |
|
AutoClass@IJM: a powerful tool for Bayesian classification of heterogeneous data in biology Fiona Achcar; Jean-Michel Camadro; Denis Mestivier Nucleic Acids Research 2009; doi: 10.1093/nar/gkp430 | |
| [top] | |
| Development | |
To ask a question or report a problem, please contact Achcar F. or Mestivier D. | |
| ({achcar,mestivier}[AT]ijm.univ-paris-diderot.fr) | |
| 1) "Modelling in Integrative Biology" group | 2) "Protein Engineering and Metabolic Control" group |
| Institut Jacques Monod | |
| [top] | |