PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC5AG020080.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family GATA
Protein Properties Length: 407aa    MW: 44709.8 Da    PI: 9.6338
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC5AG020080.2genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA35.71.2e-1171105135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +C++ +Tp+ R+gp g  tLCnaCG+ y k+g+
  TRIDC5AG020080.2  71 CLHCKAVETPQRRSGPMGRGTLCNACGVWYSKNGT 105
                       99******************************997 PP

2GATA32.31.4e-10161195135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +Cg+++ plW +g+ g +++C aCG++y+k +l
  TRIDC5AG020080.2 161 CLHCGSSDPPLWIEGSMGRREVCTACGMRYKKGRL 195
                       99******************************986 PP

3GATA55.48.4e-18265299135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C++Cg+++Tp+WR+gp g  tLCnaCG++yr  +l
  TRIDC5AG020080.2 265 CQHCGSSETPQWREGPKGRATLCNACGVRYRQGRL 299
                       *******************************9886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 407 aa     Download sequence    
AVPRKARSYL PRNVPSAWWS LRIPFIQPLP PAGDPANEEE GRRFPRTQRV QVAPSLDPGT  60
ADKPPKRLKR CLHCKAVETP QRRSGPMGRG TLCNACGVWY SKNGTLPEHR PVASPIVDSP  120
LESPIWEPEV PGAIYLVRKS ATERMPPRTE AAPAPRPGTS CLHCGSSDPP LWIEGSMGRR  180
EVCTACGMRY KKGRLLPECR PAECSVTDSQ QESPVINSPP ESPIWEPEAP PSVHLPRKPS  240
KKKKRRRSRS EAPSAPWPAN KGKRCQHCGS SETPQWREGP KGRATLCNAC GVRYRQGRLL  300
PEYRPVASPT FVPTKHANTH RKVLQLHRTR QSNDEHPSPL PADSVTNLPP IRNELPTTST  360
AGLASEDPTD APGYTDNPIN VPSSLDSLLL DGPSAPLIVE SEDFAIS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1241246KKKKRR
2241247KKKKRRR
3242248KKKKRRR
4242247KKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32890.11e-30GATA family protein