PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG85980.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family bZIP
Protein Properties Length: 1682aa    MW: 170770 Da    PI: 5.0049
Description bZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG85980.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1bZIP_134.35.1e-11825882259
                 XXXCHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH CS
      bZIP_1   2 kelkrerrkqkNReAArrsRqRKkaeieeLeekvkeLeaeNkaLkkeleelkkevakl 59 
                 +++k++ r+ +NRe+A+ sRqRKk +++ Le++  ++ a   +L  ++ +l+    +l
  GBG85980.1 825 EADKKQARLLRNRESAQLSRQRKKVYVDDLENRLRTMAATIAELNATIAHLTADNVAL 882
                 689************************************9999999999998766555 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1682 aa     Download sequence    
MEEVCLEKQE GEGDAFRFCA AELSQTDGDR DRDGDARESI GAAPSAAEAV AVGVPADDLL  60
SFAETMISCP EEDGDLLSMG FADGFSEPPA AAAAAAEVVA STVDISRQFE LMEKGEGDEG  120
DDLWGIMEAV DMSPEGDAYG MDDEWLCLLA DSASAGVEDE QRSSLVPLTN PLDSGTEDKP  180
MDIEGEGGDL RSEGFVGSHL VNNASLLTAA SGSSLCSSLP SRDSCLGICI PVDPAPSRVI  240
DGDVLYDGSG ARTVSPAGIA PQPVRVVGGG GDAAPLNPPP DVSSAPVKFP PSPSLVPHAT  300
VDGLRFDGDP KGATATAAGK GGTSAVRSST AEAQNCKLST DSAEARNCKI LTDCVERPVP  360
VISPPEQPLY GSDSSSRHER DCSEAEENAL SGRAESPREC SQIARKTNAD DALVDERMGD  420
AAAADGSKNS SLEAVHLVAS DKSAPDPRSS GSDFLSTAAL ASDLTCAQSS METAAQSSMV  480
IATSRHLLAP SDGGTSQSPA VEQGRVAAFQ GGLGAHSFSP CMLPDSAAAA PRWSMTRAAA  540
ACTGELVDTA SPRSGRSVAC ALPPAAGASP QSPPAQDTSG KDAMVRRGDG AAGGGEKTTA  600
QGLPGKFPPS ETSAAGSGDG FVNDTRIRAP LATKATSAKS GRGGGAQGAG YGMAKRSNDL  660
SAAAAGGGER RTSPASSERN ASSPETCRSG GGSAVGATSS LCSRDSGAGD DSGNDEQQFS  720
AHGDERHAGP WQERECSRGI GLDRDQQQLE QQQQQQQEQQ QQQHQQQRED RRSSSSSSTS  780
VLNPSSPEGG EGVGQGEIAT AMVDYNDGGG GGENSDEGVK SANGEADKKQ ARLLRNRESA  840
QLSRQRKKVY VDDLENRLRT MAATIAELNA TIAHLTADNV ALRRQLGYFY PAPPGGGAPP  900
AGIAAAGGPI IAGGGVGSMC GAAQVAHMYG GSGAPMMLPR HPGVMPPLGV ASMFGHGGQA  960
PPIPIPRWPS HRAAAAASAA ATGSKKQKGK RSASREAEKE ASTSASAATA TAAAPTSDRE  1020
KKRTKRDSRA VLTLSLFCFV LVFVPFGGLW DWSPLSPFAA LPESRGGTYI PPGGRGGGGV  1080
GVGSEHDGGR ILVDSGGGGG GRPPGGRVLM GLGEESGPGD PPGDRREEGE GRQDEGGEMW  1140
RRGNATILQP PVPLERAMFL SGGHSPTVKP SSSSARDAAE KESLLILNRT VIGRNTEIVP  1200
ASLIVPRGHE IVRVYGNLII QSVMAGDKAA WGHREGNGEA KGGGGREGGG MPGNAMSDTD  1260
RHQLKAMASL SINGGGDGGA VAMNYPTSGN ALPASASERS GQGQSSAGGR PVAPGQLGSS  1320
SGSGLKGGEE RRGQASSRRH HHHLDHYPSH HHHYENTWIF GPGGLSGPVL TSGVCTELFQ  1380
FETSSSSIVP DVVVGTSTEE EEAGQGGAGA GGSGTISWKK AMEGRVAAMA ATEAARRAAA  1440
VGSGGSVRMP TTGTIEKAEA GNGDDLNLTE AMGQTQGKHG YYDYGRKGGR CVKDKRVKPL  1500
PSASAGVNAT AVAMEKVANG GRRANGSQYE DRCTREDALA DVRVGARSRG GGEVGVNGSD  1560
RRSMEGEEDP TGRSFGYGGS RTDKSLPSTM VVSVLLPSMQ PGEDSAAGGG RGGGGSSTGK  1620
NGGSGSGPGK LTQIFVVVLV NSVKYVTYSC LLPTLSATAA AMAGPTAAAG AAAAAAAGAG  1680
VL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110731079GGRGGGG
212421248GGRGGGG
312431249GGRGGGG
412441250GGRGGGG
516081614GGRGGGG
616091615GGRGGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G40950.11e-17bZIP family protein