PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC5AG020080.6
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family GATA
Protein Properties Length: 514aa    MW: 55538.2 Da    PI: 10.1979
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC5AG020080.6genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA35.31.6e-11156190135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +C++ +Tp+ R+gp g  tLCnaCG+ y k+g+
  TRIDC5AG020080.6 156 CLHCKAVETPQRRSGPMGRGTLCNACGVWYSKNGT 190
                       99******************************997 PP

2GATA31.91.9e-10246280135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +Cg+++ plW +g+ g +++C aCG++y+k +l
  TRIDC5AG020080.6 246 CLHCGSSDPPLWIEGSMGRREVCTACGMRYKKGRL 280
                       99******************************986 PP

3GATA551.1e-17350384135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C++Cg+++Tp+WR+gp g  tLCnaCG++yr  +l
  TRIDC5AG020080.6 350 CQHCGSSETPQWREGPKGRATLCNACGVRYRQGRL 384
                       *******************************9886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 514 aa     Download sequence    
VHGDGGGRRG RRGTSIGLSF FTFGGWPPGL GRGRCPRFRR SEATRGMAGG GGVGGKEDGG  60
TEPLLLSVLA LPATALPAVV SRLEAAVPRK ARSYLPRNVP SAWWSLRIPF IQPLPPAGDP  120
ANEEEGRRFP RTQRVQVAPS LDPGTADKPP KRLKRCLHCK AVETPQRRSG PMGRGTLCNA  180
CGVWYSKNGT LPEHRPVASP IVDSPLESPI WEPEVPGAIY LVRKSATERM PPRTEAAPAP  240
RPGTSCLHCG SSDPPLWIEG SMGRREVCTA CGMRYKKGRL LPECRPAECS VTDSQQESPV  300
INSPPESPIW EPEAPPSVHL PRKPSKKKKR RRSRSEAPSA PWPANKGKRC QHCGSSETPQ  360
WREGPKGRAT LCNACGVRYR QGRLLPEYRP VASPTFVPTK HANTHRKVLQ LHRTRQSNDE  420
HPSPLPADSV TNLPPIRNEL PTTSTAGLAS EDPTDAPGYT DNPINVPSSL DSLLLDGPSA  480
PLIILMLGNF KLSITEGHCT GPSSDQTSSV NSTI
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1326331KKKKRR
2326332KKKKRRR
3327333KKKKRRR
4327332KKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32890.15e-30GATA family protein