PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG033457t1
Common NameTCM_033457
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GeBP
Protein Properties Length: 188aa    MW: 21812.2 Da    PI: 10.2441
Description GeBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG033457t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1DUF57393.22.9e-2939130298
            DUF573   2 lfqrlwseeDeivlLqGlidfkaktgkspsddidafyefvkksisfkvsksqlveKirrLKkKfkkkvkkaksgkepsfskehdqkifelskk 94 
                        fqr+++e+De+++L +++++  k+ ++ps+di++fy+f+ ksi+++v+k ql +Ki+rLKkKf+k+ k      + +f ++h qkif ls+ 
  Thecc1EG033457t1  39 PFQRVFNEDDEVAVLEAILEYSIKKVTNPSADINGFYDFIMKSIHVNVTKAQLKDKIKRLKKKFRKNAKG-----NRTFLHSHAQKIFYLSNT 126
                       59****************************************************************9999.....789*************** PP

            DUF573  95 iWgs 98 
                       iWg+
  Thecc1EG033457t1 127 IWGQ 130
                       **96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF045043.4E-2540129IPR007592Protein of unknown function DUF573
Sequence ? help Back to Top
Protein Sequence    Length: 188 aa     Download sequence    Send to blast
MTTTTIHANC SSDSNEVKCP KRKASAVVET YREDEKKSPF QRVFNEDDEV AVLEAILEYS  60
IKKVTNPSAD INGFYDFIMK SIHVNVTKAQ LKDKIKRLKK KFRKNAKGNR TFLHSHAQKI  120
FYLSNTIWGQ EVKGKEAMEV DMGESRATSE QKWRKLEIAE LEVFLQRKKL IVEQAKLMLK  180
RLKYEHK*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
195103KRLKKKFRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021299423.11e-109probable transcription factor At4g00390
TrEMBLA0A061FBD51e-135A0A061FBD5_THECC; Uncharacterized protein
STRINGEOY141831e-136(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM31328181
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G61730.13e-12DNA-binding storekeeper protein-related transcriptional regulator
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]