• The file AllUnconfirmedTargetsRanked.xls contains a spreadsheet with the columns corresponding to TFs. Genes are ordering from most likely to least likely of being a new target for the TF based on the semi-supervised method that uses expression data, motif information, and knowledge of genes with curated direct evidence. Genes which already have direct evidence of being regulated by the TF curated in EcoCyc 11.5 are excluded from this list. An entry in a cell is "gene|predictedsign" where "gene" is the blattner number of the gene and predictedsign is '+' if the TF primarily serves as an activator of the gene and '-' if the TF primarily serves as a repressor.
  • The file TopPredictions.xls is the subset of predictions from AllUnconfirmedTargetsRanked.xls that was used in the Aerobic-Anaerobic application. The number of new predictions is equivalent to the number of genes wtih confirmed direct evidence.
  • The file ConfirmedTargets.xls contains those TF-gene interactions supported by direct evidence in EcoCyc 11.5. The file is organized into columns by TFs as the files above. The format of an entry is "gene|predictedsign|annotatedsign", where the presence of the entry in gene in a column implies direct evidence the TF regulates the gene. predictedsign is defined as above, while for annotatedsign a '+' means only a curated direct evidence activator relationship, a '-' means only a curated direct evidence repressor relationship, otherwise the sign is '+-'. Genes are ordered using the same method as used to order new predictions.