PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.322110.1
Common NameCsa_3G006770, LOC101206627
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family bZIP
Protein Properties Length: 406aa    MW: 43332.7 Da    PI: 6.901
Description bZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.322110.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1bZIP_168.88.8e-22298360163
                     XXXXCHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH CS
          bZIP_1   1 ekelkrerrkqkNReAArrsRqRKkaeieeLeekvkeLeaeNkaLkkeleelkkevaklksev 63 
                     e+elkr+rrkq+NRe+ArrsR+RK+ae++eL+ ++++L++eN +L++e+++++ e+++l se+
  Cucsa.322110.1 298 ERELKRQRRKQSNRESARRSRLRKQAECDELAHRAEALQEENASLRSEVNRIRSEYEQLLSEN 360
                     89**********************************************************998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF077772.2E-29497IPR012900G-box binding protein, multifunctional mosaic region
PfamPF165962.7E-48136262No hitNo description
Gene3DG3DSA:1.20.5.1701.5E-17291357No hitNo description
PfamPF001701.4E-20298360IPR004827Basic-leucine zipper domain
SMARTSM003381.2E-22298362IPR004827Basic-leucine zipper domain
PROSITE profilePS5021713.38300363IPR004827Basic-leucine zipper domain
SuperFamilySSF579591.75E-11301357No hitNo description
CDDcd147023.46E-23303353No hitNo description
PROSITE patternPS000360305320IPR004827Basic-leucine zipper domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 406 aa     Download sequence    Send to blast
MSGSEMEKPP KDRETKTPPP TTTQEQTTTT SAGTVNPDWS GFQAYSPIPP HGFLASSPQA  60
HPYMWGVQHI MPPYGTPPHP YVAMYPPGGI YAHPSMPPGS YPFSPFAMPS PNGVTEASGN  120
TAGSLEGDVK PPEVKEKLPI KRSKGSLGSL NMITGKNNEL GKTSGTSANG AYSKSAESGS  180
EGTSEGSDAN SQNESQPKLG SRQDSLEVEV SQNGNSVHGT QNGGSNTQAM AVIPLATAGA  240
PGVVPGPTTN LNIGMDYWGA SSAIPAMRGK VQSTPVAGGL VTTGSRDSIQ SQLWLQDERE  300
LKRQRRKQSN RESARRSRLR KQAECDELAH RAEALQEENA SLRSEVNRIR SEYEQLLSEN  360
ASLKERLGEV SGNEELRTSR NGQRTNNETT TKTTESEVVQ VGNKN*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1314320RRSRLRK
2314321RRSRLRKQ
Functional Description ? help Back to Top
Source Description
UniProtTranscriptional activator that binds to the G-box motif (5'-CACGTG-3') and other cis-acting elements with 5'-ACGT-3' core, such as Hex, C-box and as-1 motifs. Possesses high binding affinity to G-box, much lower affinity to Hex and C-box, and little affinity to as-1 element (PubMed:18315949). G-box and G-box-like motifs are cis-acting elements defined in promoters of certain plant genes which are regulated by such diverse stimuli as light-induction or hormone control (Probable). Binds to the G-box motif 5'-CACGTG-3' of LHCB2.4 (At3g27690) promoter. May act as transcriptional repressor in light-regulated expression of LHCB2.4. Binds DNA as monomer. DNA-binding activity is redox-dependent (PubMed:22718771). {ECO:0000269|PubMed:18315949, ECO:0000269|PubMed:22718771, ECO:0000305|PubMed:18315949}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00291DAPTransfer from AT2G35530Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818601e-125LN681860.1 Cucumis melo genomic scaffold, anchoredscaffold00021.
GenBankLN7132601e-125LN713260.1 Cucumis melo genomic chromosome, chr_6.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004146103.10.0PREDICTED: transcription factor HBP-1a
SwissprotQ501B21e-169BZP16_ARATH; bZIP transcription factor 16
TrEMBLA0A0A0L5290.0A0A0A0L529_CUCSA; Uncharacterized protein
STRINGXP_004146103.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF40173256
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35530.11e-158basic region/leucine zipper transcription factor 16
Publications ? help Back to Top
  1. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  2. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  3. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]
  4. Ezer D, et al.
    The G-Box Transcriptional Regulatory Code in Arabidopsis.
    Plant Physiol., 2017. 175(2): p. 628-640
    [PMID:28864470]