PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG034172t1
Common NameTCM_034172
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family bZIP
Protein Properties Length: 562aa    MW: 61297.6 Da    PI: 6.4629
Description bZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG034172t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1bZIP_137.16.7e-12396454563
                       CHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH CS
            bZIP_1   5 krerrkqkNReAArrsRqRKkaeieeLeekvkeLeaeNkaLkkeleelkkevaklksev 63 
                       kr++r   NR +A rs +RK  +i+eLe kv++L++e ++L  +l +l+  +a l+s++
  Thecc1EG034172t1 396 KRAKRILANRQSAARSKERKMRYIAELEHKVQTLQTEATTLSAQLTMLQRDSAGLTSQN 454
                       9****************************************************999998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM003381.4E-18392456IPR004827Basic-leucine zipper domain
PROSITE profilePS5021710.909394457IPR004827Basic-leucine zipper domain
Gene3DG3DSA:1.20.5.1701.6E-9396453No hitNo description
SuperFamilySSF579594.58E-11396451No hitNo description
PfamPF001708.7E-10396454IPR004827Basic-leucine zipper domain
CDDcd147032.49E-23397448No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 562 aa     Download sequence    Send to blast
MEGISESRSN MQKLQSPPSN SIPKPQSNLD IPIFNASQMA PSPHTRLSPE NNNNKRPGIP  60
PSHPNYPAAT SPYSQIIGSR SNSQQGAPSH SRSLSQPTFF SLDSLPPWSP PPYREPSVAS  120
LSDPASNDVS MEERVVNSNV RSSLPSPVAR GVNEFRVGES SSLPPRKGHR RSSSDVPLGF  180
SAMIQSSPQL LPIGSRGVLD RSVSGRESSS GVEKPIQLVK RESEWSKDGS SNVEGMSERK  240
SEGDVADDLF NAYMNLDSLE TLNSSGTEDK DLDSRASGTK TYGGESSDNE VESRVNGHPI  300
SMQGMSAGAS NEKGVKRSAG GDIAPTARHH RSVSMDSYMG SLQFDDESSK IPPGSSVDAN  360
SGKFNLELGS SEFSEAEMKK IMENEKLAEI ASVDPKRAKR ILANRQSAAR SKERKMRYIA  420
ELEHKVQTLQ TEATTLSAQL TMLQRDSAGL TSQNNELKFR LQAMEQQAQL KDALNEALAA  480
EVQRLKVTAA ELSGEAHLSS CMAQQLSLNH PMFQLQPQQP QQVNVYQMQQ QQQHQQPQHS  540
QHNQLQTQQQ QNDDPTANES K*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankKF0480440.0KF048044.1 Firmiana danxiaensis microsatellite Fir_SSR16 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007017723.10.0PREDICTED: uncharacterized protein LOC18591502
TrEMBLA0A061FDF40.0A0A061FDF4_THECC; Basic-leucine zipper transcription factor family protein, putative isoform 1
STRINGEOY149480.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM42242757
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G38900.31e-122bZIP family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]