PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC5BG020600.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family GATA
Protein Properties Length: 406aa    MW: 44695.9 Da    PI: 9.5056
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC5BG020600.1genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA35.71.2e-1170104135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +C++ +Tp+ R+gp g  tLCnaCG+ y k+g+
  TRIDC5BG020600.1  70 CLHCKAVETPQRRSGPMGRGTLCNACGVWYSKNGT 104
                       99******************************997 PP

2GATA32.89.6e-11160194135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +Cg+++ plW +g+ g +++C aCG++y+k +l
  TRIDC5BG020600.1 160 CLHCGSSEPPLWIEGSMGRREVCTACGMRYKKGRL 194
                       99******************************986 PP

3GATA55.48.4e-18264298135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C++Cg+++Tp+WR+gp g  tLCnaCG++yr  +l
  TRIDC5BG020600.1 264 CQHCGSSETPQWREGPKGRATLCNACGVRYRQGRL 298
                       *******************************9886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 406 aa     Download sequence    
VPRKARSYLP RNVPSAWWSL RIPFIQPLPP AGDPANEEEG RRFPRPQRVQ VAPSLDPGTA  60
DKPPKRLKRC LHCKAVETPQ RRSGPMGRGT LCNACGVWYS KNGTLPEHLP VSSPIVDSPL  120
ENPIWEPEVP GAIYLVRKSA TERMPPRTEA APAPRPGTSC LHCGSSEPPL WIEGSMGRRE  180
VCTACGMRYK KGRLLPECRP AECSVTDSRQ ESPVINSPPE SPIWEPEAPP SVHLPRKPSK  240
KKKRRRSRSE APSAPWPANK GKRCQHCGSS ETPQWREGPK GRATLCNACG VRYRQGRLLP  300
EYRPMASPTF VPTKHANSHR KVLQLHRTRQ SNDEHPSPLP ADSVTNLPPI RDELPTTSTA  360
GLASEDPTDA PGYTDNPINV PSSLDSLLLD GPSAPLIVES EDFAIS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1240245KKKKRR
2240246KKKKRRR
3241247KKKKRRR
4241246KKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32890.19e-31GATA family protein