PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pd.00g160550.m01
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family GATA
Protein Properties Length: 629aa    MW: 68565.7 Da    PI: 4.7749
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pd.00g160550.m01genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA48.11.6e-15207243135
              GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +Cg ++  Tp++Rrgp+g+++LCnaCGl+++ +g+
  Pd.00g160550.m01 207 CKHCGISSksTPMMRRGPSGPRSLCNACGLFWANRGT 243
                       *****99999***********************9997 PP

2GATA48.71e-15483519135
              GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C++Cg+++  Tp +Rrgp g++tLCnaCGl+++ kg+
  Pd.00g160550.m01 483 CQHCGVSEnnTPAMRRGPAGPRTLCNACGLMWANKGT 519
                       *******99*************************997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 629 aa     Download sequence    
MYGHSEPMTM PNSIPAGGDD DAAGPGVDSI ENAHIHYEPH TLEDGGGVVA VVEDVSSDPV  60
YDVGSSEMRA QPYDGSSQLT LSFRGQVFVF DAVTPEKVQA VLLLLGGSEL SSGPQGAELA  120
SQNQRGTEDF PIRCSQPHRA ASLSRFRQKR KERCFDKKVR YSVRQEVALR MQRNKGQFSS  180
SKKSDGDYSW GNGQESGQDD SHAETSCKHC GISSKSTPMM RRGPSGPRSL CNACGLFWAN  240
RGTLRELSKR TQDHSVTPAE QGEADTKDLN SVTAIDAHNS LVPFSNDDDW LSGGTGLGFE  300
LGEGGGGEAG DLVWREYLMA AVNPQPLQAR PFEEHGRGPI PIEDDEAEYE DGGDDGEVYV  360
FPAVTPEKVQ AVLLLLGGRD VPTGVPTVEV SYDQNTRGVA DTPKRSNLSR RIASLVRFRE  420
KRKERCFDKK IRYTVRKEVA QRMLRKNGQF ASLKQNSGDS GWDSAQSGLQ DGTSRPETVL  480
RRCQHCGVSE NNTPAMRRGP AGPRTLCNAC GLMWANKGTL RDLSKGGRNL TMDHIEPGTP  540
IEVKPLLVEG EFSGNQDEHE TLEGPSKTVI ERSNDASVNL DEQDLHETAE DLTNSLPMGI  600
VSSANDEQEP LVELTNPSDT DLDIPTNFD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1401406DTPKRS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G24470.23e-79GATA family protein