PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG013821t2
Common NameTCM_013821
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 335aa    MW: 37800.6 Da    PI: 4.4651
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG013821t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.61e-1896149356
                       --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   3 kRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       k+++++ +q++ Le+ Fe ++++  e++ +LAk lgL+ rq+ +WFqNrRa++k
  Thecc1EG013821t2  96 KKRRLSVDQVQFLEKSFEVENKLEPERKVQLAKDLGLQPRQIAIWFQNRRARWK 149
                       566899***********************************************9 PP

2HD-ZIP_I/II127.84.6e-4195186192
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreelk 92 
                       ekkrrls +qv++LE+sFe e+kLeperKv+la++Lglqprq+a+WFqnrRAR+ktkqlEkdyeaL+++y++lk++ ++L ke+++L+ee+ 
  Thecc1EG013821t2  95 EKKRRLSVDQVQFLEKSFEVENKLEPERKVQLAKDLGLQPRQIAIWFQNRRARWKTKQLEKDYEALQASYNSLKADYDNLLKEKDKLKEEVL 186
                       69**************************************************************************************9985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.37E-1885153IPR009057Homeodomain-like
PROSITE profilePS5007117.291151IPR001356Homeobox domain
SMARTSM003896.1E-1994155IPR001356Homeobox domain
PfamPF000465.4E-1696149IPR001356Homeobox domain
CDDcd000861.84E-1796152No hitNo description
Gene3DG3DSA:1.10.10.604.4E-1998158IPR009057Homeodomain-like
PRINTSPR000311.2E-5122131IPR000047Helix-turn-helix motif
PROSITE patternPS000270126149IPR017970Homeobox, conserved site
PRINTSPR000311.2E-5131147IPR000047Helix-turn-helix motif
PfamPF021831.7E-15151192IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 335 aa     Download sequence    Send to blast
MAGGRVYPSN TSSAGGSNNL SVLLQSQRVP SSSEPLDPLF IPGSSPSSFL VSGRRSMVSF  60
EDVHRANIAN RSFFRTFDQE ENGDEDLDEY FHQPEKKRRL SVDQVQFLEK SFEVENKLEP  120
ERKVQLAKDL GLQPRQIAIW FQNRRARWKT KQLEKDYEAL QASYNSLKAD YDNLLKEKDK  180
LKEEVLQLTD KLLVKEKEKG NSELSDVNKL SQEPPQKLVV AETASEGEES KVSVVACKQE  240
DISSAKSDIF DSDSPHYTDG VHSSLLEAAD SSYPFEPDQS DLSQDEEDNL SKGLLHPPSY  300
IFPKLEDDEY SDPPASSCNF GFPVEDHAFW SWAY*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1143151RRARWKTKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007037212.10.0PREDICTED: homeobox-leucine zipper protein HAT5
TrEMBLA0A061FXE80.0A0A061FXE8_THECC; Homeobox protein, putative isoform 2
STRINGEOY217130.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM54528143
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G01470.18e-42homeobox 1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]