PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr4g0396691
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family GATA
Protein Properties Length: 1021aa    MW: 112074 Da    PI: 8.8845
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr4g0396691genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA28.71.8e-09109139134
                    GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                             C+ Cg+t+Tp+W    +g+  +CnaCG  y++ +
  RcHm_v2.0_Chr4g0396691 109 CAYCGVTQTPQWP---EGPLGICNACGFSYKSGR 139
                             99**********7...666779*********987 PP

2GATA50.52.8e-16179212134
                    GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                             C++Cg t+Tp+WR gp g+ tLCnaCG++y++ +
  RcHm_v2.0_Chr4g0396691 179 CTHCGITQTPQWRLGPLGPQTLCNACGVRYKSGR 212
                             *******************************987 PP

3GATA32.98.6e-11486514129
                    GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGly 29 
                             C+nCg+ ++p+W +gp g+kt+C +CGl 
  RcHm_v2.0_Chr4g0396691 486 CQNCGAEQSPQWWEGPLGPKTICLSCGLA 514
                             ***************************96 PP

4GATA27.64e-09741769129
                    GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGly 29 
                             C+nCg+ +Tp W  +p g+k LC aCGl 
  RcHm_v2.0_Chr4g0396691 741 CQNCGVQQTPSWLGSPLGPKPLCHACGLA 769
                             ***************************95 PP

5GATA24.34.3e-08875904131
                    GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyr 31 
                             C +Cg ++T  WR gp g++tLCnaCG +y+
  RcHm_v2.0_Chr4g0396691 875 CMQCG-KQTNRWRMGPLGPTTLCNACGNKYQ 904
                             77888.579******************9985 PP

6GATA54.12.1e-17961992132
                    GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrk 32 
                             C++Cg+ kTplWR+gp g+ktLCnaCGl++ k
  RcHm_v2.0_Chr4g0396691 961 CQHCGAEKTPLWRSGPLGSKTLCNACGLRWYK 992
                             ******************************65 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1021 aa     Download sequence    
MVSLDPDDFL AGLADDRDYP SRPFPATENL AEEEELEWLS NKDVFPAIET CMLTEEPPKK  60
AVANAEQGNP VSVSVVKASI GGGGALLLPL MPSRKTKDES VKVKSMRVCA YCGVTQTPQW  120
PEGPLGICNA CGFSYKSGRL CTEDSPKVQS IKSDKVKSIK SDKVNGIKSD KVKTIKRKCT  180
HCGITQTPQW RLGPLGPQTL CNACGVRYKS GRLCPEYRPA NSPTFSQELH SNKHRKVMEI  240
RNQIYGNEGT VVRPVDKCTN SASTTAPEKM KKDMFPTVAT VEQPRRSVIA EQQSPVSVLE  300
NSTNSSITLK PHQPPSKFLR EQQLFCNQPN NKHNQEDTAS VEIKRNIILM SSCGTLELPY  360
KAPSEVLGQQ QLFSCQTTSK RRKKNIVEMK NEGNNATLTS SCGNLELPHQ APSQFLEQHQ  420
PNNKCNKKHT AKVEMTMKPV DICSTTLMKS KKDSLPAIET LNILEHSRGI FIAEQQSPVV  480
GIAVNCQNCG AEQSPQWWEG PLGPKTICLS CGLAKYTGRR KRSMSLVSNK KRKKEDIAKV  540
EMAVKPGDKC TNSTTALMNS KKDPLPAVGT SNISEQPSVI VISEQQSPAS VFENSISSSM  600
MLMSSSGTVN LHNQAPSEFP PQNQFCDQAN NKPKVKDTTT VEFEGNSSTL MRSSGALEPL  660
HQAPNEFLDQ QQQAKNKPNK KDTAKVESDG SSTTLMSSCG TNKLPHQAQS EFLVQQQLFC  720
NQANNGTKKE NTAKGESEGK CQNCGVQQTP SWLGSPLGPK PLCHACGLAK CTAGQQRSIA  780
VEVGKLACPS PATGEFTNST CTFPVLEEFH LSQKPSDIAN AEQQKPFSVL ESSTNNSIAL  840
MSSCRIKIPH RARSKVLRRR RSTIPGQQVG VGKNCMQCGK QTNRWRMGPL GPTTLCNACG  900
NKYQPLWGRS GVTVDIAKNC QHCGSEETTQ LLLCPLGPKT LCTVCYHWFK ASAEVEIERK  960
CQHCGAEKTP LWRSGPLGSK TLCNACGLRW YKFRDLCRDR SASSPTLLRE LHSNCYRNVL  1020
I
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1519534RRKRSMSLVSNKKRKK
2520535RRKRSMSLVSNKKRKK
3530534KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G34680.22e-33GATA family protein