PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG83770.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family GATA
Protein Properties Length: 994aa    MW: 103218 Da    PI: 8.3142
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG83770.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA54.71.3e-17655687133
        GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkk 33 
                 C+ Cgt kTplWR+gp+g+k+LCnaCG++++k 
  GBG83770.1 655 CVGCGTMKTPLWRSGPSGPKSLCNACGIRHKKA 687
                 999****************************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 994 aa     Download sequence    
MVEGECKGKA PDNLPRGSSR VVTSMDKEID TAEDSRLTVC HAGMHGIGDR DRCNGDLEEE  60
EEEEEGGNAD SKKKVSCDGS VSKSENATPS CGAAEGTYEG YGSPFKDRSH VLKGVCNTGK  120
EVGMNPAAGA VSMTGVMRGL CNPMKGPSSR RTGSRPRQTI AAPGLGERLV LLNRTTREMG  180
AQAIAGTGAT TGTGSRTGTL TGTASGISSG TKTGTGTSSG TGTASGPGTR TGTVAGNESG  240
NGMMVYEVRA VRRNLESCAS QQARQSGQAG AALATRGATS ILTGSESASD TAASENRGAG  300
RDLASGASHQ VKQRVGNVEG GCDVKEERTL GDDVDGGGAV DSGADVAVQR VAAEPTQQRV  360
LGLESNASVL ARSIDCGTVA RLGESGECTA VEGVWSKGEC SGVAKLGEDH LQMSRVAVMR  420
GVGNGGVKVN GHPHPFAAGM KLGADVAREA EGVGRCERGS GADAEMTRGL PQFAWMESFG  480
AGVSVQKGSE AGYKGGMAVA GAPLPVDGAQ GKVPPVVACA VKGRVQLSWR DSPCLMGSGR  540
VGGKAGKGGG SAFVDIASRR CGGESRSSSE RVSERGPSCR LGSWSKWRAD AEAQGCARRE  600
VADAGCEMQD HELELQRDRE LEMRRMSNGN QSGSSNESTV GEFEESKPGA ASRACVGCGT  660
MKTPLWRSGP SGPKSLCNAC GIRHKKASKL RASMSGEDNT EPMIRITLAA KSIKSGPRKR  720
KASAREQSVN TVELPVEHHV SGSVEVCASV ATDSTAPVEG CCPSSSPQSQ SDHFPLKNKM  780
WMGMSARVMK KGRTVVVDPE RDASRAMAGD ERCRSSSCSE AGEGDHTRSS ADSCVTLQGS  840
PPSMVSQLMK GAGCNAAAAA AYMKTSPSST AVMGLFPRVS SGQGRPDTHC VEPPIGWGGR  900
RARAGERMSH KDWARKRKSS LEGPEIAEDT GVRTKLGGRG PSDEVEGAIL LMTLFKGCPV  960
RTTRQMPRKR DFPAKMDKRV EEQTTWYGAD LVWS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1542550GGKAGKGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G06740.19e-14GATA family protein