PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG006767t2
Common NameTCM_006767
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family BBR-BPC
Protein Properties Length: 311aa    MW: 34690.6 Da    PI: 9.8892
Description BBR-BPC family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG006767t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GAGA_bind3551.6e-10813101301
         GAGA_bind   1 mdddgsre..rnkg.yye................paaslkenlglqlmssiaerdakirernlalsekkaavaerdmaflqrdkalaernkal 74 
                       md  g++e  r k  yy+                ++++l   +++++ms++aerda+irern+a+sekk+a+a+rd+a++qrdkalaer+ al
  Thecc1EG006767t2   1 MDGAGQQEsgRYKLdYYKgahtpwnmmpqhhmkeQNNALV--MNKKIMSILAERDAAIRERNIAISEKKEALAARDEALQQRDKALAERDSAL 91 
                       7777877777877778889999999996555532224444..899************************************************ PP

         GAGA_bind  75 verdnkllalllvenslasalpvgvqvlsgtksidslqqlsepqledsavelreeeklealpieeaaeeakekkkkkkrqrakkpkekkakkk 167
                       ++rdn+l+ l+++en++  + p g ++ +g k ++   + + ++++++     e++ ++alp+++ a e+ +++ +k+++++k   +k a+k+
  Thecc1EG006767t2  92 MDRDNALAVLQYRENAM--NFPLGGGIQRGGKRMHP--TYHSTDVGETLNS--EMHVTDALPVSTIACEEGKSRPVKRTKENKAVSSKSARKV 178
                       ***************95..568899999****9994..4456677776655..9***********************99999999999999** PP

         GAGA_bind 168 kkksekskkkvkkesaderskaekksidlvlngvslDestlPvPvCsCtGalrqCYkWGnGGWqSaCCtttiSvyPLPvstkrrgaRiagrKm 260
                       kk  e+ +++  +e   ++ k+e++ +d++ln v++De+t+PvPvCsCtG++rqCYkWGnGGWqS+CCttt+S yPLP+++++r+aR++grKm
  Thecc1EG006767t2 179 KKVAEDLNRQAGTEV--KKCKSEWNGQDIGLNMVNFDETTMPVPVCSCTGVPRQCYKWGNGGWQSSCCTTTMSSYPLPQMPNKRHARVGGRKM 269
                       **9999999999998..57************************************************************************** PP

         GAGA_bind 261 SqgafkklLekLaaeGydlsnpvDLkdhWAkHGtnkfvtir 301
                       S+++f+klL++LaaeG dls p+DLk++WA+HGtn+++ti+
  Thecc1EG006767t2 270 SGSVFTKLLSRLAAEGQDLSIPLDLKNYWARHGTNRYITIK 310
                       ****************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM012268.1E-1571310IPR010409GAGA-binding transcriptional activator
PfamPF062173.8E-10013310IPR010409GAGA-binding transcriptional activator
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009723Biological Processresponse to ethylene
GO:0050793Biological Processregulation of developmental process
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0042803Molecular Functionprotein homodimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 311 aa     Download sequence    Send to blast
MDGAGQQESG RYKLDYYKGA HTPWNMMPQH HMKEQNNALV MNKKIMSILA ERDAAIRERN  60
IAISEKKEAL AARDEALQQR DKALAERDSA LMDRDNALAV LQYRENAMNF PLGGGIQRGG  120
KRMHPTYHST DVGETLNSEM HVTDALPVST IACEEGKSRP VKRTKENKAV SSKSARKVKK  180
VAEDLNRQAG TEVKKCKSEW NGQDIGLNMV NFDETTMPVP VCSCTGVPRQ CYKWGNGGWQ  240
SSCCTTTMSS YPLPQMPNKR HARVGGRKMS GSVFTKLLSR LAAEGQDLSI PLDLKNYWAR  300
HGTNRYITIK *
Functional Description ? help Back to Top
Source Description
UniProtTranscriptional regulator that specifically binds to GA-rich elements (GAGA-repeats) present in regulatory sequences of genes involved in developmental processes. {ECO:0000269|PubMed:14731261}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021286691.10.0protein BASIC PENTACYSTEINE4-like
RefseqXP_021286692.10.0protein BASIC PENTACYSTEINE4-like
SwissprotQ8S8C61e-142BPC4_ARATH; Protein BASIC PENTACYSTEINE4
TrEMBLA0A061DYQ60.0A0A061DYQ6_THECC; GAGA-motif binding transcriptional activator isoform 1
STRINGEOX978430.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G21240.21e-138basic pentacysteine 4
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]