Eff3ctidor

What is Effectidor's output?

How do I interpret the results?

How is Effectidor evaluated and where can I see its predicting accuracy?

What input does Effectidor require and what is it used for?

Effectidor has one obligatory input - an ORFs file.
This is a FASTA file including all the genome ORFs. See instructions for downloading this file here.
Some of the ORFs in this file (effectors and non-effectors) will be used to train the machine-learning algorithms, and based on the trained classifier, the main output - prediction for each ORF - will be performed.
In addition, it is recommended to supply a known Effectors file, as appearing in the ORFs file. Alternatively, a homology search against an internal effectors dataset will be performed to constitute the "known effectors" ORFs for the learning process.

In the advanced options you can supply data that will result in additional features to feed the machine-learning and improve the results. These data include:

Host proteome archive. Protein FASTA files with the proteome of a known host of the studied bacterium. Multiple files can be included. All these files should be compressed in a single zip archive.
This input will be used for homology searches. As effectors interact with host proteins for their function, we expect them to have eukaryotic domains, that will be recognized in this homology search.
Archive of proteomes of closely related bacteria without T3SSs. This archive may contain several proteome records, each in a seperate FASTA file. These FASTA files should be compressed in a single zip archive. A homology search will be performed against each of these proteomes. As these bacteria are closely related to the studied bacterium, the vast majority of the proteins in the studied bacterium are expected to have an ortholog in these proteomes. Nevertheless, since they do not encode a T3SS, effectors are not expected to have orthologs in these proteomes. Thus, these features are usually very informative for the machine-learning.
GFF3 file(s). These files will be used to compute genome organization features.
Full genome FASTA files. The full genome will be used to search for regulatory elements in the promoter region of each ORF. Speciffically, we allow searching for the following motifs: PIP-box, relevant for Xanthomonas, Ralstonia, and Acidovorax. hrp-box, relevant for Pseudomonas syringae and plant pathogens of the Enterobacteria family. mxiE-box, relevant for Shigella. exs-box, relevant for Pseudomonas aeruginosa. tts-box which is relevant for rhizobia.

What is Effectidor's output?

How do I interpret the results?

How is Effectidor evaluated and where can I see its predicting accuracy?

What input does Effectidor require and what is it used for?

What is the expected running time of Effectidor?

How long will my results be saved in the servers?

Why do you need my email?

Can I use Effectidor to run the analysis on several genomes simultaneously?