PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG011330t2
Common NameTCM_011330
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 721aa    MW: 79072.6 Da    PI: 5.9526
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG011330t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.46.7e-2058113156
                       TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       +++ +++t++q++e+e++F+++++p+ ++r+eL ++lgL+  qVk+WFqN+R+++k
  Thecc1EG011330t2  58 KKRYHRHTQHQIHEMEAFFKECPHPDDKQRKELGRELGLEPLQVKFWFQNKRTQMK 113
                       688999***********************************************999 PP

2START227.83.2e-712414611206
                       HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEE CS
             START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kae 80 
                       ela +a++el+++a+ +ep+W++s     +++n++e++++f+++ +     ++ ea+r+++vv+m++ +lve+l+d+  qW++ +     ka+
  Thecc1EG011330t2 241 ELAVAAMEELIRMAQMGEPLWMTSLdgttSMLNEEEYIRTFPRGIGpkptgFKCEASRETAVVIMNHINLVEILMDVH-QWSTVFSgivsKAS 332
                       57899**************************************999********************************.************** PP

                       EEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEE CS
             START  81 tlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtw 166
                       tl+v+s+g      galq+m+ae+q++splvp R++++vRy++q+ +g+w++vdvS+d+ ++ p+    vR++++pSg+li++++ng+skvtw
  Thecc1EG011330t2 333 TLDVLSTGvagnynGALQVMTAEFQVPSPLVPtRESYYVRYCKQHAEGTWAVVDVSLDNLRPSPT----VRCRRRPSGCLIQEMPNGYSKVTW 421
                       **************************************************************996....************************ PP

                       EE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
             START 167 vehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                       vehv++++r +h+l+++lv+sg+a+gak+w atl+rqce+
  Thecc1EG011330t2 422 VEHVEVDDRGVHNLYKQLVSSGHAFGAKRWIATLDRQCER 461
                       **************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.609.3E-2237113IPR009057Homeodomain-like
SuperFamilySSF466893.55E-1944115IPR009057Homeodomain-like
PROSITE profilePS5007116.58455115IPR001356Homeobox domain
SMARTSM003891.3E-1956119IPR001356Homeobox domain
CDDcd000861.88E-1858116No hitNo description
PfamPF000461.7E-1758113IPR001356Homeobox domain
PROSITE profilePS5084845.652232464IPR002913START domain
SuperFamilySSF559613.02E-36234463No hitNo description
CDDcd088753.74E-132236460No hitNo description
SMARTSM002343.0E-68241461IPR002913START domain
PfamPF018522.0E-60242461IPR002913START domain
Gene3DG3DSA:3.30.530.204.5E-6337447IPR023393START-like domain
SuperFamilySSF559615.22E-24481712No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010090Biological Processtrichome morphogenesis
GO:0048497Biological Processmaintenance of floral organ identity
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 721 aa     Download sequence    Send to blast
MFQPNMMEGQ LHPLEMTQNT SESEIARMRD EEFDSTTKSG SENHEGASGD DQDPRPKKKR  60
YHRHTQHQIH EMEAFFKECP HPDDKQRKEL GRELGLEPLQ VKFWFQNKRT QMKTQHERQE  120
NTQLRTENEK LRADNMRFRE ALSTASCPNC GGPTAVGQMS FDEHHLRLEN ARLREEIDRI  180
SAIAAKYVGK PVVNYPLLSS PMPPRPLDFG AQPGTGEMYG AGDLLRSISA PSEADKPMII  240
ELAVAAMEEL IRMAQMGEPL WMTSLDGTTS MLNEEEYIRT FPRGIGPKPT GFKCEASRET  300
AVVIMNHINL VEILMDVHQW STVFSGIVSK ASTLDVLSTG VAGNYNGALQ VMTAEFQVPS  360
PLVPTRESYY VRYCKQHAEG TWAVVDVSLD NLRPSPTVRC RRRPSGCLIQ EMPNGYSKVT  420
WVEHVEVDDR GVHNLYKQLV SSGHAFGAKR WIATLDRQCE RLASVMATNI PTGDVGVITN  480
QDGRKSMLKL AERMVISFCA GVSASTAHTW TTLSGTGADD VRVMTRKSVD DPGRPPGIVL  540
SAATSFWLPV SPKRVFDFLR DENSRSEWDI LSNGGVVQEM AHIANGRDTG NCVSLLRVNS  600
ANSSQSNMLI LQESCADPTA SFVIYAPVDI VAMNVVLNGG DPDYVALLPS GFAILPDGTT  660
ASAGGIGDAG SAGSLLTVAF QILVDSVPTA KLSLGSVATV NNLIACTVER IKASLSCENA  720
*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007045612.10.0PREDICTED: homeobox-leucine zipper protein HDG2 isoform X4
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLA0A061EGL00.0A0A061EGL0_THECC; Homeodomain GLABROUS 2 isoform 2
STRINGEOY014430.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.40.0homeodomain GLABROUS 2
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]