PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG87806.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family GATA
Protein Properties Length: 1529aa    MW: 158452 Da    PI: 8.0313
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG87806.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA60.32.4e-1910831117135
        GATA    1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35  
                  C+ Cgt+kTplWR+gp+g+k+LCnaCG++y+k g+
  GBG87806.1 1083 CVECGTSKTPLWRNGPRGPKSLCNACGIRYKKAGK 1117
                  ********************************886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1529 aa     Download sequence    
MMCSSRRRLE DNGKEEEEEE EEGEEGEGKL HSHGGNALHT QSKALVSISI AYKRVILSLR  60
APVRRYARVR REEWVIAKKR RRRCEEEGRQ GGGGGGGGGG GEERREGWLV AISNDVSLNI  120
HRHRAAACLS GSRALSEMDR ELRTVGPSSN GGAPRGTFLN DADDNPIAET IHQGGEGGRG  180
TEALSAESRF MTFGKALNCV LASWIAAQRR IAAGTYQLAN MDRKKVIMGV QKESGAEGGI  240
GVGGVGGVGV GGVGGGQEQP SLGGCSGVSK VSATMATVGE SWTLTSTGCH VPTRRLSDRN  300
TKAGAAVFPS LLLEGGPVDV NGKPSEGNIV DGGEQQWKVC APCTSSGSVA AAAGAAAAAA  360
TAMVATVTAA GAGAAGVGVG VGVAGGSRAS LKLEFSGSHH RSHPLYDQSH QSHTIFRHPA  420
KGPSSRRSTP GSLCHHHHHH NYLPHLPLRA TATAPPPPPP PPLSPPLPSP SVPSAPGASA  480
VPPPPPHSYH SFSLHQPHIQ PHCLLPSTSA PACRSLHASV PSPSLAATME TTTSTATAKR  540
KTPGDSLLPA AEQAPFVAVV VPSPIMAQGN SKLTMVGTIP NSIPLSENCP NGTNSRDRDQ  600
SDSEATIPMV VSCSEDDDED TEDDEEVQKV QAERQKQVLS ANNVSHCAPE AAALQGWVTC  660
YCMPNAPNAE AGAKVPAAAG EMSSDAAAQM SAAKSKGRGS GAFKAHNDID LAALTAARLK  720
ERGGGRAIKG NGERLRRRRA PPIKEEEEEE DDDNDDDEDD DDDDDDDDDE EMDKEKTECK  780
KGAYYARRRL VKRGNNKIAE TTATGLRMTV RGGAGTGRTA RVYGQPSVTK RHNKSAPMSG  840
GAPHLGNLHG PPGGSAMGTV AATNQQNVTR SATVMAAMSA AKDPPSRQGP EAAQGMRRGE  900
GADRGAEAGL VRGSFLSDSD ATGGGGLPRK VGGSRSSVTG TRRSPNVVLN GPGAMGSCAG  960
RTGVGGGRRG VGGETCHPSR SASSSWSLQG CNISAERSPA SDRNADREGN TGEPMVLDDD  1020
GERRGTTPRD RRHNQDVDDD RGTSACHGGD LKRGCNAAGV MGGGSEAGSP PNNKPGCGGP  1080
RVCVECGTSK TPLWRNGPRG PKSLCNACGI RYKKAGKLRA LEGKGMDSNE PIIIPPGGRA  1140
NRTNKKPVGG KKCKPSDNQE VTPASATLAT PGPVHGSEET AAVAVAIGKP QDGPVVAGMR  1200
KRTSAASVEC PQQSQAMRKG NSQNASALEG VAKPRGILWA GSTQIKKGLF LADPPGCDGS  1260
RGGGGGGLPG GGGGAGGERG AGGAGGRGAE GGQSAATGDQ RRDVREDVPP AENVSPSKNG  1320
LAKPNCMPKN DCPSRNDFLS KKDLQPKTDC PSKNDSSLKS GLLSRNGLSW TERDEEDERV  1380
DDDEGRIDRT GSSADSCVTL QGSPSTASWP LKTKWKKFAR SEAMAAEALF PRVSSPPVPT  1440
PADLRSRCME PPGNWGGKKR ARVVDRVGES APGIAPQHAL GSDCEDQKAK PRMEFDFLQE  1500
TDSEGHGMTD EVQGAILLMA LFKGCPCPA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17883KKRRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G18380.16e-14GATA family protein