PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG006766t1
Common NameTCM_006766
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family bZIP
Protein Properties Length: 592aa    MW: 64100.7 Da    PI: 6.662
Description bZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG006766t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1bZIP_131.73.4e-10433490562
                       CHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH CS
            bZIP_1   5 krerrkqkNReAArrsRqRKkaeieeLeekvkeLeaeNkaLkkeleelkkevaklkse 62 
                       kr++r   NR +A rs +RK  +i eLe kv++L++e ++L  +l  l+  +  l+++
  Thecc1EG006766t1 433 KRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQ 490
                       9********************************************9998877666665 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM003389.3E-18429493IPR004827Basic-leucine zipper domain
PROSITE profilePS5021710.737431494IPR004827Basic-leucine zipper domain
PfamPF001709.1E-9433487IPR004827Basic-leucine zipper domain
SuperFamilySSF579593.31E-11433484No hitNo description
Gene3DG3DSA:1.20.5.1701.4E-10433489No hitNo description
CDDcd147036.32E-25434483No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005829Cellular Componentcytosol
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 592 aa     Download sequence    Send to blast
MGDSEEGNTD VMQRIQSSFG TSSSSIPKQP LSMNQLEIPQ LNPNQIRAPR HFSHFGQNFN  60
GGVGDAANKR VGIPPSHPNQ IPPISPYSQI PVSRQMNQQM GSQSFSPGPT HSRSLSQPSS  120
FFSLDSLPPL SPSPFRDCSS VAVPDQICTD VSMEDRDAAS HSLLPPSPFS RGNSPRVGES  180
LPPRKSHRRS NSDIPFGFNT IMQSSPPLIP LRGSGGLERS VSGKENSGVP KPAQLVKKET  240
SWERGADGNA EGMGERKSEG EVVDDLFSAY MNLDNIDALN SSGTDDKNNG TENHEDLDSR  300
ASGTKTNGGD SSDNEAESSV NESGNSALRG GMNSTDKREG IKRSAGGDIA PTGRHYRSVS  360
MDSFMGKLNF GDESPKLPPS PGTRPGQLSP SNSIDGNSAA FSLEFGNGEF SGAELKKIMA  420
NEKLAEIAMS DPKRAKRILA NRQSAARSKE RKMRYISELE HKVQTLQTEA TTLSAQLTLL  480
QRDSVGLTNQ NNELKFRLQA MEQQAQLRDA LNEALTAEVR RLKLATQELG GDSDPSKGMV  540
SQQLSVNHQM FQLHQQQSSQ LNIPHQFQQQ QLPPQPQQQN GNTTAKTESN Q*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017971734.10.0PREDICTED: transcription factor RF2a
TrEMBLA0A061DZL60.0A0A061DZL6_THECC; Basic-leucine zipper transcription factor family protein isoform 1
STRINGEOX978370.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM42242757
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G38900.31e-158bZIP family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]