PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC5BG020600.3
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family GATA
Protein Properties Length: 485aa    MW: 52564.8 Da    PI: 9.8552
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC5BG020600.3genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA35.41.5e-11149183135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +C++ +Tp+ R+gp g  tLCnaCG+ y k+g+
  TRIDC5BG020600.3 149 CLHCKAVETPQRRSGPMGRGTLCNACGVWYSKNGT 183
                       99******************************997 PP

2GATA32.51.2e-10239273135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +Cg+++ plW +g+ g +++C aCG++y+k +l
  TRIDC5BG020600.3 239 CLHCGSSEPPLWIEGSMGRREVCTACGMRYKKGRL 273
                       99******************************986 PP

3GATA55.11e-17343377135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C++Cg+++Tp+WR+gp g  tLCnaCG++yr  +l
  TRIDC5BG020600.3 343 CQHCGSSETPQWREGPKGRATLCNACGVRYRQGRL 377
                       *******************************9886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 485 aa     Download sequence    
VAAAVHGDGG GRRGRRAEQP PGLGRGRCPR FRRSEATRDM AGGGGAGGKE DGGTEPLLLS  60
VLALPATALP AVVSRLEAAV PRKARSYLPR NVPSAWWSLR IPFIQPLPPA GDPANEEEGR  120
RFPRPQRVQV APSLDPGTAD KPPKRLKRCL HCKAVETPQR RSGPMGRGTL CNACGVWYSK  180
NGTLPEHLPV SSPIVDSPLE NPIWEPEVPG AIYLVRKSAT ERMPPRTEAA PAPRPGTSCL  240
HCGSSEPPLW IEGSMGRREV CTACGMRYKK GRLLPECRPA ECSVTDSRQE SPVINSPPES  300
PIWEPEAPPS VHLPRKPSKK KKRRRSRSEA PSAPWPANKG KRCQHCGSSE TPQWREGPKG  360
RATLCNACGV RYRQGRLLPE YRPMASPTFV PTKHANSHRK VLQLHRTRQS NDEHPSPLPA  420
DSVTNLPPIR DELPTTSTAG LASEDPTDAP GYTDNPINVP SSLDSLLLDG PSAPLIVESE  480
DFAIS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1319324KKKKRR
2319325KKKKRRR
3320326KKKKRRR
4320325KKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32890.13e-30GATA family protein