Ames Mutagenicity Predictions - Random Forest

Service is currently down to license upgrades

This page uses random forest model to predict a structure (specified in SMILES format) is mutagenic or not (according to the Ames test), based on 166 bit MACCS fingerprints. The model is based on a 4337 compound dataset studied by Kazius et al. (J. Med. Chem, 2005, 48(1), 312-320), which was divided approximately equally (1.2:1) between mutagens and nonmutagens.

OOB error was 15.39% and test set error was 15.32%

Note, this page is a proof-of-concept of model deployment using R web services coupled to a remote R engine. The model development did not investigate other features and only really tuned the number of descriptors searched at each split.

Paste SMILES, one to a line


Some example SMILES:
    CCOc1ccc(cc1)C2CC(c3cccc(Cl)c3)n4nc(N)nc4N2
    CCOC1(OCC)N=C(N)C2(C#N)C(c3cccc(OC)c3OC)C12C#N
    Nc1nc2NC(CC(c3ccc(Cl)cc3)n2n1)c4ccc(F)cc4
    COc1ccc(cc1)C2CC(c3ccc(Cl)cc3)n4nc(N)nc4N2
    COc1cccc(c1)C2CC(c3ccc(Cl)cc3Cl)n4nc(N)nc4N2
    COc1ccc(cc1)C2CC(c3ccc(Cl)cc3Cl)n4nc(N)nc4N2
    CC(C)COC(=O)C1C2OC3(CN(C4CCCC4)C(=O)C13)C=C2
    CN1C(=O)N(C)C2=C(C1=O)C(NS(=O)(=O)c3ccccc3)(C(=O)N2)C(F)(F)F
    FC(F)(F)C1(NS(=O)(=O)c2ccccc2)N=C3SCCN3C1=O