PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG001039t3
Common NameTCM_001039
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 318aa    MW: 35828.3 Da    PI: 4.6335
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG001039t3genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.55.5e-194497356
                      --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                      k+++++ +q+++Le+ Fe ++++  e++ +LA++lgL+ rqV vWFqNrRa++k
  Thecc1EG001039t3 44 KKRRLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWK 97
                      566899***********************************************9 PP

2HD-ZIP_I/II128.82.3e-4143135193
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreelke 93 
                       ekkrrls +qvk+LE++Fe e+kLeperKv+la+eLglqprqvavWFqnrRAR+ktkqlE+dy  Lk++y++lk + ++L++++e+L +e++e
  Thecc1EG001039t3  43 EKKRRLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGLLKTSYETLKVNYDTLQHDNEALLKEIRE 135
                       69*************************************************************************************999986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.65E-1930101IPR009057Homeodomain-like
PROSITE profilePS5007117.3463999IPR001356Homeobox domain
SMARTSM003891.1E-1742103IPR001356Homeobox domain
PfamPF000463.6E-164497IPR001356Homeobox domain
CDDcd000861.21E-1644100No hitNo description
Gene3DG3DSA:1.10.10.609.5E-2046105IPR009057Homeodomain-like
PRINTSPR000318.9E-67079IPR000047Helix-turn-helix motif
PROSITE patternPS0002707497IPR017970Homeobox, conserved site
PRINTSPR000318.9E-67995IPR000047Helix-turn-helix motif
PfamPF021834.4E-1699140IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009637Biological Processresponse to blue light
GO:0030308Biological Processnegative regulation of cell growth
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0048510Biological Processregulation of timing of transition from vegetative to reproductive phase
GO:0048573Biological Processphotoperiodism, flowering
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008483Molecular Functiontransaminase activity
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 318 aa     Download sequence    Send to blast
MSICPTTDEH SPRNNHIYSR EFQSMLDGLD EEGCVEESGH VAEKKRRLSV DQVKALEKNF  60
EVENKLEPER KVKLAQELGL QPRQVAVWFQ NRRARWKTKQ LERDYGLLKT SYETLKVNYD  120
TLQHDNEALL KEIRELKAKL NGESTESNLS VKEEVIVHET DNKTLEQSEP PPVSSLVTSS  180
EPAELNYESF NNSIGSVGAT LFPDLKDGSS DSDSSAILNE DNNNCSPNNA AISSSGVLQS  240
QQHLLMSPTT TSSLNFNSSS SSPSSMNCFQ FSKSTYQPSH QYVKMEEHNF FSADEACNFF  300
SDEQAPSLHW YSPEQWN*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
19199RRARWKTKQ
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00051PBMTransfer from AT4G40060Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ5270863e-40HQ527086.1 Gossypium herbaceum clone NBRI_D2_391 simple sequence repeat marker, mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007047858.20.0PREDICTED: homeobox-leucine zipper protein ATHB-6
TrEMBLA0A061DJ940.0A0A061DJ94_THECC; Alanine--glyoxylate aminotransferase 2 isoform 1
STRINGEOX920160.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G40060.15e-70homeobox protein 16
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]