PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG005412t2
Common NameTCM_005412
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 819aa    MW: 88681.7 Da    PI: 6.279
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG005412t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.14.6e-21113168156
                       TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       +++ +++t++q++eLe+lF+++++p++++r eL+k+l L++rqVk+WFqNrR+++k
  Thecc1EG005412t2 113 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMK 168
                       688999***********************************************999 PP

2START210.65.9e-663225461206
                       HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEE CS
             START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kae 80 
                       ela++a++elvk+a+ +ep+W +s     e++n de+l++f++  +     + +ea+r++gvv+ ++  lve+l+d+  +W e+++    + +
  Thecc1EG005412t2 322 ELALAAMDELVKMAQTDEPLWIRSLeggrEILNHDEYLRTFTPCIGmkpggFVTEASRETGVVIINSLALVETLMDST-RWAEMFPcmiaRTS 413
                       5899**************************************9998999*****************************.************** PP

                       EEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEE CS
             START  81 tlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtw 166
                       t++vissg      galqlm aelq+lsplvp R++ f+R+++q+ +g+w++vdvS+d  ++ +  + +v +++lpSg+++++++ng+skvtw
  Thecc1EG005412t2 414 TTDVISSGmggtrnGALQLMHAELQVLSPLVPvREVNFLRFCKQHAEGVWAVVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTW 506
                       **************************************************************999**************************** PP

                       EE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
             START 167 vehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                       veh++++++++h+l+r+l++sg+ +ga++wvatlqrqce+
  Thecc1EG005412t2 507 VEHAEYEESQVHQLYRPLLSSGMGFGAQRWVATLQRQCEC 546
                       **************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466899.84E-2195170IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.606.4E-22100170IPR009057Homeodomain-like
PROSITE profilePS5007117.216110170IPR001356Homeobox domain
SMARTSM003899.9E-18111174IPR001356Homeobox domain
CDDcd000861.32E-18112170No hitNo description
PfamPF000461.4E-18113168IPR001356Homeobox domain
PROSITE patternPS000270145168IPR017970Homeobox, conserved site
PROSITE profilePS5084844.892313549IPR002913START domain
SuperFamilySSF559613.3E-34315546No hitNo description
CDDcd088755.18E-130317545No hitNo description
PfamPF018521.7E-57322546IPR002913START domain
SMARTSM002343.8E-51322546IPR002913START domain
SuperFamilySSF559615.22E-24574811No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009827Biological Processplant-type cell wall modification
GO:0042335Biological Processcuticle development
GO:0043481Biological Processanthocyanin accumulation in tissues in response to UV light
GO:0048765Biological Processroot hair cell differentiation
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 819 aa     Download sequence    Send to blast
MSFGGFLDNS SGGGGARIVA DIPYSNNMPT GAIAQPRLVS PSLAKNMFNS PGLSLALQPN  60
IDNQGDGTRM GENFEGSVGR RSREEEHESR SGSDNMDGGS GDDQDAADNP PRKKRYHRHT  120
PQQIQELEAL FKECPHPDEK QRLELSKRLC LETRQVKFWF QNRRTQMKTQ LERHENSLLR  180
QENDKLRAEN MSIRDAMRNP ICTNCGGPAI IGDISLEEQH LRIENARLKD ELDRVCALAG  240
KFLGRPISAL ATSIAPPMPN SSLELGVGSN GFGGLSTVPT TLPLGPDFGG GITNALPVAP  300
PNRPTTGVTG LDRSVERSMF LELALAAMDE LVKMAQTDEP LWIRSLEGGR EILNHDEYLR  360
TFTPCIGMKP GGFVTEASRE TGVVIINSLA LVETLMDSTR WAEMFPCMIA RTSTTDVISS  420
GMGGTRNGAL QLMHAELQVL SPLVPVREVN FLRFCKQHAE GVWAVVDVSI DTIRETSGAP  480
TFVNCRRLPS GCVVQDMPNG YSKVTWVEHA EYEESQVHQL YRPLLSSGMG FGAQRWVATL  540
QRQCECLAIL MSSTVPTRDH TAITASGRRS MLKLAQRMTD NFCAGVCAST LHKWNKLNNA  600
GNVDEDVRVM TRKSVDDPGE PPGIVLSAAT SVWLPVSPQR LFDFLRDERL RSEWDILSNG  660
GPMQEMAHIA KGQDHGNCVS LLRASAMNAN QSSMLILQET CIDAAGSLVV YAPVDIPAMH  720
VVMNGGDSAY VALLPSGFAI VPDGPGSRGP TSNGHVNGNG GGGGGRSQRV GGSLLTVAFQ  780
ILVNSLPTAK LTVESVETVN NLISCTVQKI KAALQCES*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00421DAPTransfer from AT4G00730Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007051913.20.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A061DTK70.0A0A061DTK7_THECC; HD domain class transcription factor isoform 2
STRINGEOX960690.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]