PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG018470t1
Common NameTCM_018470
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 284aa    MW: 31999.1 Da    PI: 6.6047
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG018470t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox56.44.9e-1882134456
                       -SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   4 RttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       + +++ eq+++Le+ Fe  +++  e++ +LAk lgL+ rq+ +WFqNrRa++k
  Thecc1EG018470t1  82 KKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWK 134
                       4568889*********************************************9 PP

2HD-ZIP_I/II1291.9e-4180170191
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                       ekk+rl+ eqvk+LE+sFe  +kLeperK++la++Lglqprq+a+WFqnrRAR+ktkqlEkdyeaLk+++dalk++n++L++++++L +el
  Thecc1EG018470t1  80 EKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLEKDYEALKKQFDALKADNDALQAQNKKLSAEL 170
                       69*************************************************************************************9988 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.31E-1971138IPR009057Homeodomain-like
PROSITE profilePS5007116.81176136IPR001356Homeobox domain
SMARTSM003891.5E-1779140IPR001356Homeobox domain
CDDcd000863.92E-1681137No hitNo description
PfamPF000462.7E-1582134IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.604.0E-1984144IPR009057Homeodomain-like
PRINTSPR000319.9E-6107116IPR000047Helix-turn-helix motif
PROSITE patternPS000270111134IPR017970Homeobox, conserved site
PRINTSPR000319.9E-6116132IPR000047Helix-turn-helix motif
PfamPF021832.5E-14136176IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009733Biological Processresponse to auxin
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 284 aa     Download sequence    Send to blast
MIFSIDMAFP PHAFLFQSHE DNDHLPSPTS LSSLPSCPPQ LFHGGAPLMM KRSVSFSGVD  60
KSEEVHGDDE LSDDGSHIGE KKKRLNLEQV KALEKSFELG NKLEPERKMQ LAKALGLQPR  120
QIAIWFQNRR ARWKTKQLEK DYEALKKQFD ALKADNDALQ AQNKKLSAEL LALKTKDSNE  180
ISIKKENEGS WSNGSDNSCD VNLDISRKTV ITSPVSSQLS SKHFFPSSVR PASMTQLLQG  240
SSRPDLQCLK LDQVVQEESF CHMFNGVEEQ QGFWPWAEQQ NFH*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1128136RRARWKTKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00322DAPTransfer from AT3G01220Download
Motif logo
Regulation -- Description ? help Back to Top
Source Description
UniProtINDUCTION: By auxin. {ECO:0000269|PubMed:12644682}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007032488.10.0PREDICTED: homeobox-leucine zipper protein ATHB-20
SwissprotQ8LAT07e-96ATB20_ARATH; Homeobox-leucine zipper protein ATHB-20
TrEMBLA0A061EM690.0A0A061EM69_THECC; Homeobox protein 20
STRINGEOY034140.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM30952666
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G01220.18e-89homeobox protein 20
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]