PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Rmu_sc0006586.1_g000008.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family GATA
Protein Properties Length: 1018aa    MW: 111676 Da    PI: 8.7912
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Rmu_sc0006586.1_g000008.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA28.71.8e-09106136134
                       GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                                C+ Cg+t+Tp+W    +g+  +CnaCG  y++ +
  Rmu_sc0006586.1_g000008.1 106 CAYCGVTQTPQWP---EGPLGICNACGFSYKSGR 136
                                99**********7...666779*********987 PP

2GATA50.52.8e-16176209134
                       GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                                C++Cg t+Tp+WR gp g+ tLCnaCG++y++ +
  Rmu_sc0006586.1_g000008.1 176 CTHCGITQTPQWRLGPLGPQTLCNACGVRYKSGR 209
                                *******************************987 PP

3GATA34.13.7e-11483511129
                       GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGly 29 
                                C+nCgt ++p+W +gp g+kt+C +CGl 
  Rmu_sc0006586.1_g000008.1 483 CQNCGTEQSPQWWEGPLGPKTICLSCGLA 511
                                ***************************96 PP

4GATA27.64e-09738766129
                       GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGly 29 
                                C+nCg+ +Tp W  +p g+k LC aCGl 
  Rmu_sc0006586.1_g000008.1 738 CQNCGVEQTPSWLGSPLGPKPLCHACGLA 766
                                ***************************95 PP

5GATA24.34.3e-08872901131
                       GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyr 31 
                                C +Cg ++T  WR gp g++tLCnaCG +y+
  Rmu_sc0006586.1_g000008.1 872 CMQCG-KQTNRWRMGPLGPTTLCNACGNKYQ 901
                                77888.579******************9985 PP

6GATA54.12.1e-17958989132
                       GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrk 32 
                                C++Cg+ kTplWR+gp g+ktLCnaCGl++ k
  Rmu_sc0006586.1_g000008.1 958 CQHCGAEKTPLWRSGPLGSKTLCNACGLRWYK 989
                                ******************************65 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1018 aa     Download sequence    
MVSLDPDDFL AGLADDPDYP SRPFPATENL AEEEELEWLS NKDVFPAIET CMLTEEPPKK  60
AMANAEQGNP VSVSVVKASI GGGGALLLPS RKTKDESVKV KSMRVCAYCG VTQTPQWPEG  120
PLGICNACGF SYKSGRLCTE DSPKVQSIKS DKVKSIKSDK VNGIKSDKVK TIKRKCTHCG  180
ITQTPQWRLG PLGPQTLCNA CGVRYKSGRL CPEYRPANCP TFSQELHSNM HRKVMEIRKQ  240
IYGNEGTVVK PVDKCTNSAA TTALEKMKKD MFPTVATVEQ PRRSVIAEQQ SPVSVLENST  300
NSSITLKPHQ PPSKFLREQQ LFCNQPNNKH NQEDTASVEI KRNIILMSSC GTLELPYKAP  360
SEVLGQQQLF SCQTTSKRRK KNIVEMKNEG NNATLTSSCG NLEPPHQAPS QFLEQHQPNN  420
KCNKKHTAKV EMTMKPVDIC STTLMKSKKD SLPAIETLNI LEHSRGIFIA EQQSPVVGIA  480
VNCQNCGTEQ SPQWWEGPLG PKTICLSCGL AKYTGRRKRS MTLVSNKKRK KEDIAKVEMA  540
VKPGDKCTNS TTALMNSKKD PLPAVGTSNI SEQPSVIVIA EQQSPASVFE NSISSSMMLM  600
SSSGTVNLHN QAPSEFPPQN QFCDQANNKP KVKDTTTVEF EGNSSTLMRS SGALEPLHQA  660
PNEFLDQQQQ AKNKPNKKDT AKVESDGSST TLMSSCGTNK LPHQAHSEFL GQQQLFCNQA  720
NNGTKKENTA KGESEGKCQN CGVEQTPSWL GSPLGPKPLC HACGLAKCTA GQQRSIAVEV  780
GKLACPSPAT GEFTNSTCSF PVLEEFHLSQ KPSDIANAEQ QKPFSVLESS TNNSIALMSS  840
CRIKIPHRAR SKVLRRRRST IPGQQVGVGK NCMQCGKQTN RWRMGPLGPT TLCNACGNKY  900
QPLWGRSGVT VDIAKNCQHC GSEETTQLLL CPLGPKTLCT VCYHWFKASA EVEIERKCQH  960
CGAEKTPLWR SGPLGSKTLC NACGLRWYKF RDLCRDRSAS SPTLLRELHS NCYRNVLI
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1516531RRKRSMTLVSNKKRKK
2517532RRKRSMTLVSNKKRKK
3527531KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G34680.24e-34GATA family protein