Gene Catalogs
|
KEGG GENES is a collection of gene catalogs for all complete genomes generated from publicly available resources, mostly NCBI RefSeq.
They are subject to SSDB computation and KO assignment (gene annotation) by KOALA tool.
KEGG DGENES for draft genomes of some eukaryotes and KEGG EGENES for EST datasets of mostly plants are supplementary gene catalogs, which are given automatic KO assignment by KAAS with GENES used as a reference data set.
There is now a fourth type of gene catalogs, MGENES for metagenomes (see also KEGG GENOME) with automatic annotation.
The viral gene catalog, VGENES, is not yet fully integrated in the KEGG system.
|
Gene Annotation
|
The annotation of KEGG GENES involves assignment of KO identifiers (K numbers).
Internally, this is done using the KOALA and GFIT annotation tools based on the SSDB database (see: Ortholog Annotation in KEGG).
The annotation of KEGG DGENES and EGENES is done automatically using the KAAS program, and EGENES is generated from EST datasets by the EGassembler program. Both of these programs are made publicly available.
|
Annotate genomes using KEGG
KAAS: automatic annotation (KO assignment) and pathway reconstruction
[reference]
Generate EST consensus contigs
EGassembler: automatic assembly of ESTs to generate consensus contigs
[reference]
Search similar sequences in GENES
| BLAST: | sequence similarity search by BLAST |
| FASTA: | sequence similarity search by FASTA |
Gene Name Conversion
|
KEGG GENES can be retrieved by giving identifiers of outside databases, such as NCBI-GeneID (Entrez Gene ID), NCBI-gi, and UniProt accession numbers.
Cross-reference lists are available at the FTP site.
|
Last updated: February 12, 2010
|
|