Generator of cost-effective GMO testing strategies
Description
GMOtrack is a command line utility that implements the GMOtrack algorithm. It generates cost-effective testing strategies for traceability of genetically modified organisms (GMO). Given a table of GMOs (along with the probabilities of their presence, the genetic elements present in their genome and a linear cost function) GMOtrack computes the optimal set of screening assays for a two-phase testing strategy.
GMOtrack is distributed free under GPL and can be downloaded from this web page.
Publications
Kralj Novak P., Gruden K., Morisset D., Lavracč., Šebih D., Rotter A., Žel J.
GMOtrack: generator of cost-effective GMO testing strategies
Journal of AOAC International 2009, Nov-Dec;92(6):1739-1746
[pdf], supplementary material [pdf], [txt]
Download
Windows executable: GMOtrack.zip
Download data below
Real-time PCR analysis costs: PCRcosts.pdf is a document with the cost schema
For developers: Python source code GMOtrack-sources.zip which is developed with Orange version 0.9.64.
Installation
Unzip the file GMOtrack.zip. GMOtrack is a command-line utility so run it from the command line as described below. To start the command line, you can use the run.bat script, which is enclosed in the GMOtrack.zip archive.
Usage
GMOtrack is a command-line utility. GMOtrack usage:
GMOtrack filename [-m int][-c float][-k float][-n float]
- -mmaximum assays for first phase constraint, default m=5
- -cminimum coverage (in %) of the first phase, default c=80
- -kcoefficient for the cost function g, default k=21.18
- -nconstant for the cost function g, default n=91.82
The program returns for each number i up to m the combination of i screening assays with the lowest expected total cost.
Sample output
Data
We have prepared the sample datasets GMO_EU.zip.
They were used for the experiments in the paper.
dataset | year | frequency | # GMOs | # screening assays |
GMO_EU_1997.tab | 1997 | all | 7 | 18 |
GMO_EU_1999.tab | 1999 | all | 15 | 22 |
GMO_EU_2002.tab | 2002 | all | 17 | 24 |
GMO_EU_2005.tab | 2005 | all | 20 | 24 |
GMO_EU_2008.tab | 2008 | all | 22 | 25 |
GMO_EU_2008-food.tab | 2008 | food | 22 | 25 |
GMO_EU_2008-feed.tab | 2008 | feed | 22 | 25 |
GMO_EU_beyond2008.tab | beyond 2008 | all | 37 | 32 |
Here we describe how you can prepare your own data to be used with GMOtrack. We recommend to take one of the available datasets (above) and modify it in a spreadsheet program (e.g. MS Excel). Be careful when saving, since the file must be a “Tab delimited” text file and the filename must end with “.tab”.
GMOtrack uses the “.tab” data format with some additional specifics.
The data must be in a tab separated text file with following format:
- File head:
- The first line is reserved for column names: meta attribute names and assay names.
- the first three columns must be named “GMOname”, “species” and “probability”.
- The second line denotes the data type:
- d for nominal values
- c for numbers
- The third line denotes column type:
- m for meta attributes (GMOname, species and probability)
- empty for assays
- The first line is reserved for column names: meta attribute names and assay names.
- File body:
- Each line is the file’s body is one GMO.
The first column represents GMO names.
The second column must be species.
The following columns are the responses of the GMO to the assay:- 1 means positive response,
- 0 means negative response.
- Each line is the file’s body is one GMO.
Example input data
Funding
Funding was provided by the Slovenian Research Agency (P4-0165, P2-0103); the Slovenian Ministry of Agriculture, Forestry and Food; and the Slovenian Ministry of the Environment and Spatial Planning (V4-0314).