PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC5AG020080.5
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family GATA
Protein Properties Length: 390aa    MW: 42958.8 Da    PI: 9.3943
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC5AG020080.5genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA35.81.1e-115488135
              GATA  1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35
                      C +C++ +Tp+ R+gp g  tLCnaCG+ y k+g+
  TRIDC5AG020080.5 54 CLHCKAVETPQRRSGPMGRGTLCNACGVWYSKNGT 88
                      99******************************997 PP

2GATA32.41.3e-10144178135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +Cg+++ plW +g+ g +++C aCG++y+k +l
  TRIDC5AG020080.5 144 CLHCGSSDPPLWIEGSMGRREVCTACGMRYKKGRL 178
                       99******************************986 PP

3GATA55.57.9e-18248282135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C++Cg+++Tp+WR+gp g  tLCnaCG++yr  +l
  TRIDC5AG020080.5 248 CQHCGSSETPQWREGPKGRATLCNACGVRYRQGRL 282
                       *******************************9886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 390 aa     Download sequence    
WWSLRIPFIQ PLPPAGDPAN EEEGRRFPRT QRVQVAPSLD PGTADKPPKR LKRCLHCKAV  60
ETPQRRSGPM GRGTLCNACG VWYSKNGTLP EHRPVASPIV DSPLESPIWE PEVPGAIYLV  120
RKSATERMPP RTEAAPAPRP GTSCLHCGSS DPPLWIEGSM GRREVCTACG MRYKKGRLLP  180
ECRPAECSVT DSQQESPVIN SPPESPIWEP EAPPSVHLPR KPSKKKKRRR SRSEAPSAPW  240
PANKGKRCQH CGSSETPQWR EGPKGRATLC NACGVRYRQG RLLPEYRPVA SPTFVPTKHA  300
NTHRKVLQLH RTRQSNDEHP SPLPADSVTN LPPIRNELPT TSTAGLASED PTDAPGYTDN  360
PINVPSSLDS LLLDGPSAPL IVENTDAWKF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1224229KKKKRR
2224230KKKKRRR
3225231KKKKRRR
4225230KKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32890.11e-30GATA family protein