PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC5BG020600.4
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family GATA
Protein Properties Length: 495aa    MW: 53474.8 Da    PI: 9.2528
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC5BG020600.4genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA35.41.5e-11159193135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +C++ +Tp+ R+gp g  tLCnaCG+ y k+g+
  TRIDC5BG020600.4 159 CLHCKAVETPQRRSGPMGRGTLCNACGVWYSKNGT 193
                       99******************************997 PP

2GATA32.51.2e-10249283135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +Cg+++ plW +g+ g +++C aCG++y+k +l
  TRIDC5BG020600.4 249 CLHCGSSEPPLWIEGSMGRREVCTACGMRYKKGRL 283
                       99******************************986 PP

3GATA551.1e-17353387135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C++Cg+++Tp+WR+gp g  tLCnaCG++yr  +l
  TRIDC5BG020600.4 353 CQHCGSSETPQWREGPKGRATLCNACGVRYRQGRL 387
                       *******************************9886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 495 aa     Download sequence    
LLSVILSSAA AARSQPPSMA TAEEGEGGEP PGLGRGRCPR FRRSEATRDM AGGGGAGGKE  60
DGGTEPLLLS VLALPATALP AVVSRLEAAV PRKARSYLPR NVPSAWWSLR IPFIQPLPPA  120
GDPANEEEGR RFPRPQRVQV APSLDPGTAD KPPKRLKRCL HCKAVETPQR RSGPMGRGTL  180
CNACGVWYSK NGTLPEHLPV SSPIVDSPLE NPIWEPEVPG AIYLVRKSAT ERMPPRTEAA  240
PAPRPGTSCL HCGSSEPPLW IEGSMGRREV CTACGMRYKK GRLLPECRPA ECSVTDSRQE  300
SPVINSPPES PIWEPEAPPS VHLPRKPSKK KKRRRSRSEA PSAPWPANKG KRCQHCGSSE  360
TPQWREGPKG RATLCNACGV RYRQGRLLPE YRPMASPTFV PTKHANSHRK VLQLHRTRQS  420
NDEHPSPLPA DSVTNLPPIR DELPTTSTAG LASEDPTDAP GYTDNPINVP SSLDSLLLDG  480
PSAPLIVESE DFAIS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1329334KKKKRR
2329335KKKKRRR
3330336KKKKRRR
4330335KKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32890.12e-30GATA family protein