PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.4_1s0081.2.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family GATA
Protein Properties Length: 1046aa    MW: 112162 Da    PI: 8.571
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.4_1s0081.2.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA42.87e-14173206134
                GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                         C +Cg  ++p WRrgp  +++LCnaCG+++  kg
  Bobra.4_1s0081.2.p 173 CDHCGRRNSPAWRRGPADKPHLCNACGVRFLGKG 206
                         9***************99999********97766 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1046 aa     Download sequence    
MAAQSTLQNR KSSQMLIRVI SSSPCCLQRL MVHFIKQQNE QGPLGLREVA TGKSKHTDLT  60
CQVWRSPRSP VAVAQGTNNG FGQLELGFIS ATKGTAKTLS SPASIAGDDA KRSLGRTTGE  120
TERATPAHSD PTDEHCTSGQ AGRGSQLNRK RRRKETPICI LHSTGGASAG GACDHCGRRN  180
SPAWRRGPAD KPHLCNACGV RFLGKGTLEG YMPGMKQRED FKLPDDNLCG ALDEEGGDDT  240
DSAKTTMPQL SGKTRCRSQR TPRRNTTPRS DMVSGPPAVA DAESDDGRAA SCYSCWADCV  300
RMALSHHPGN ARKVVSFLEN ACGLAMAEPL SSPHFLAELN RVRDLYWMDK NRAMDALKHL  360
LEDEAALTLS GAEALLTLSG ASPMPDDRKR DALCSTEDVN SRDSSWAVCE ECGRARETVC  420
WLPVWWRYLC GDQDYSKNPE LTCKDVGDFV APRGLPRSEV SPVIRRGIRG QGPQNRFLPA  480
SVGRSNARRT SGVADMSTEG FKVVDNPVYD GYGSPKSPRH KPFTGLRSSF TDSLSASDFE  540
FLRWLPQGWR MDFGWQRLGE PGRVSPAGLN YFAPDGSGPY QSQEDVLARI AAQPSFRPPQ  600
PGATAATFTP GPAFHRSPSG SVLVSMCGGA VSTTGEAPPG AIHKDTPMVQ AEMQSYILQA  660
ETPITEAPHL ARSEVSLTHL PSAAGEPSSE PSPLSAKEPA KGGPRSKKAR TKRSGPSHPR  720
QRQAKTFEVD GDFCFGVREL VPNTFELGGW PRDLQQSINV VYKPAKAAID GTVPDAALRG  780
QGKMTPPQGL LQKEGMRELE PAQAHAGSND KGRATQTTLV KHKSEPVGKL HGQALTELHG  840
PPSLLPPPRL LPPARPALGM TDELVFCPVE GPPLIVATPE GGSVANPRGL LRAVSDPSSA  900
AKAYAANALE QAGASPEPHD LKEVDWKGVL KVSVGGTEPD EICALVAKLP ASDAEQLPST  960
LLISSFKPRA EIFMANVGSK PVLLRLATCA EASKKIIRKM EEDKLAAIAH LKAFDFALIP  1020
HTNQKNGLHI VGLKLLSKEF SGSTS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1149153RKRRR
2149155RKRRRKE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17570.32e-11GATA family protein