Science Workflows

Taverna Workflows

We are developing computational workflows using our web service infrastructure and the open-source Taverna workflow tool. The emphasis is on developing workflows which encapsulate important processes in chemoinformatics and drug design, use diverse kinds of information together in novel ways, and which are of demonstrated scientific merit.

Below are descriptions of some of the workflows that we have developed, along with example output.

Workflow 1 - Finding relationships between compounds and proteins

NIH SIM SEARCH -> FILTER -> OMEGA -> FRED -> JMOL/HTML
					

Download Taverna Workflow File

Examples of workflow output

This workflow is a sequence of performing a similarity search on the NIH DTP Human Tumor data, filtering the results based on Pharmacokinetic properties (FILTER), converting to 3D (OMEGA), docking into a pre-defined protein (FRED) and visualizing (JMOL). This workflow opens up various possibilities, including:

Workflow 2 - HTS data organization and flagging

NIH SCREEN RETRIEVE -> FILTER -> TOXICITY FLAG -> SERIES GENERATION (Divkm) -> VISUALIZATION (VOPlot, 2Dviewer)
					

Example of Workflow Output

This workflow demonstrates how screening data can be flagged and organized for human analysis. The compounds and data values for a particular screen are retrieved, and then are filtered to remove compounds with reactive groups, etc. ToxTree is used to flag the potential toxicities of compounds. Divkmeans is used to add a column of cluster numbers. Finally, the results are visualized using VOPlot and the 2D viewer applet.

Sample Workflows - Some Part of the Above, Prototypes, or Test Workflows

ToxTreeBrief of ToxTreeServer

ToxTreeVerbose of ToxTreeServer

NIH Data Mining Dataflow