PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG74331.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family bZIP
Protein Properties Length: 1076aa    MW: 108867 Da    PI: 7.5756
Description bZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG74331.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1bZIP_138.72.1e-12849895147
                 XXXXCHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH CS
      bZIP_1   1 ekelkrerrkqkNReAArrsRqRKkaeieeLeekvkeLeaeNkaLkk 47 
                 e+ lkr++r+++NRe+A  sRqRKk +++eLe k++ Le+    L+ 
  GBG74331.1 849 EEVLKRQKRLIRNRESACQSRQRKKSYQKELEAKCQMLEGAVAHLRH 895
                 5679***********************************98888875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1076 aa     Download sequence    
MIECGSLEPS PASSTVAAAT AALQNYQLCC GSPIAMVGAE WGFQMLMCEE SEAANDACLA  60
RIAAAGAPAG PPASSTAPSL CQSAAAAALV SADGNNLFPS CAAAAGAGAD QQRIVGPQQS  120
QLEHRSGGGR DRDHGGRTVG SGFEAELRYC RSGWGEGEAV FGSSSRGGGG GGGGGGGGGG  180
RGMRSEAGKA RSSAAQPEVK PAMRAEERSP CSRSSSIHET GDNFIFMDTM RTDGELREQS  240
AEQGPNLLTD LDFEEFPPLE PLPRLSIAEA GGWGFPDFAG SVGGGLGGAF LQAPMAEQQQ  300
QQLTAAAAAN EWRGESAGAG GGTRSRSRCA GNPGIDEGRR TNLSAAAGAS AAACHAAKAQ  360
QQQREAAHVG SGGGGGGDIA CGGRGQQRNA DGSPSVSSCI SVDGRRAEAP GGALRRRAGA  420
AGAGAAGGRA AGGRGSREYH SRGIGGGGGG RGVSEGRSFI GEGREGTPES THRESAAVSS  480
EHRGGGGVRV MSPLGASQVA PSSSANENAS SQAVNSTVSA AMYYNSPTAL RSLDDVAAAG  540
ADTSTGKVVP VGQQQQREAA MERGEENGGS MTRLLGYGDD SGKQAQGRRG GGEDEEESGA  600
RGVAQADLTQ AQQQQQQLQI PHVHVHKGTS AMDWEMYMNL SGSSAVAMQG VRGVGEGGGG  660
GGEGGGEGGG GPSPCADVAA YRRGIVDLSV TSGMRAVIPE AGCWLGGQMA MNGREVGGGA  720
SAVAVQIGAA ACATPGLSAG LGAVATGIPA PAVGGGAAGA AQGGGGGSVL GRRKSRSWLD  780
HKQKHAQAAQ AAQAAALASA AASVAAIGAA DSGLGGVGSD PDGAGGNKSS GTSGIDNADA  840
SAPRSSFEEE VLKRQKRLIR NRESACQSRQ RKKSYQKELE AKCQMLEGAV AHLRHAMAIT  900
AMENSVLRDE LARFKQAGGD GAGGAKPAAL FSDSLPLESQ SHRTCLSRVQ PVVSRSRRLQ  960
LVAYLCQEFL LVLHVVLLGA LVLQVGLPSP RQFRCPLMDL WLIKPKVPAA IEEEQWKAMR  1020
KMTLRRDWHR WWSAIPAATG SALLSPYMGA SFRAVHTVVP NRQKGHEGTL MIPVCC
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1166181RGGGGGGGGGGGGGGR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G56660.14e-09bZIP family protein