PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr1g0350001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family WRKY
Protein Properties Length: 756aa    MW: 85116.8 Da    PI: 7.7947
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr1g0350001genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY49.49.3e-16481539358
                             -SS-EEEEEEE--TT-SS-EEE.EEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS CS
                    WRKY   3 DgynWrKYGqKevkgsefprsY.YrCtsa...gCpvkkkversaedpkvveitYegeHnh 58 
                             Dg++W KY q ++ g+e+pr Y Y Ct +   gC++ k+v++s+ d+k+++itY+g+H++
  RcHm_v2.0_Chr1g0350001 481 DGFTWGKYDQTDIPGAEYPRVYyYVCTPQnvkGCMAIKEVKHSS-DKKILKITYRGKHTC 539
                             9*******************86257*9888889**********9.*************98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 756 aa     Download sequence    
MAAPLSSWPS SIEDRCTVVV HLDLSRCNLM ELSDAIAHFS SLKALTLSGN DNLESLPAIM  60
NELASLEKLE LDGCKRLRSI PELSSTISYI NAHDCTALES VSTPQSPYDI GRCFIFSNCS  120
QLVQKDVFRD IVETHLPPQG NPSRPLYVSF PGSEIPQWFT HQCRGSSVTA KLPPNWFDKK  180
FLGFTICAFT NKPQEIVSSI ISSPFPYFKL TARCFCTFKG DHCEYRFSFY LFDIHSVDFG  240
RPWVESNHRC WLESNHMLVG YVPWSESGMI KREEVNERRY TEAKFEIQLH YESSKRSDQC  300
IEKCGVRFVF DDNEEDSGES MVEVDNSERS FQGCSAEPNG SDITGDTSDE DEQYLKLLSE  360
VFKGANRPPS YIKIEDNDLS GKRKTLERWT KKVRVTSGLM QGSVDDGFNW SWNSSGIDGQ  420
AGYCSERVQP RCWAKAIKPT HDDPTIFQMT YVGRHTCWKF QPMLTHQVGV TPDMRIEGPF  480
DGFTWGKYDQ TDIPGAEYPR VYYYVCTPQN VKGCMAIKEV KHSSDKKILK ITYRGKHTCI  540
EASDSIAGWF ERLNSKGSVE SSFHEQVPLS PSSPGANSQD MSNRKEVTAK KSKGEEVAEL  600
EVEKTEKKKK KKKKDKDNGV LGSSDGEKSV KVKRHKGKEE AGSPQTEKSA KMKKKKNKEA  660
QDHGADAATD NGKGDGEADK SGKKKKEKKK RKQEEITDSP VSVPAKKSKG EEAAELKVEK  720
TEKKKKKKDK DNGVLDCSDG EKSVNVKKHK DKEEAG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1607613KKKKKKK
2607633KKKKKKKKDKDNGVLGSSDGEKSVKVK
3608614KKKKKKK
4608634KKKKKKKKDKDNGVLGSSDGEKSVKVK
5609635KKKKKKKKDKDNGVLGSSDGEKSVKVK
6610636KKKKKKKKDKDNGVLGSSDGEKSVKVK
7611637KKKKKKKKDKDNGVLGSSDGEKSVKVK
8653692KKKKNKEAQDHGADAATDNGKGDGEADKSGKKKKEKKKRK
9683709KKKKKKKKDKDNGVLGSSDGEKSVKVK
10685724KKKKNKEAQDHGADAATDNGKGDGEADKSGKKKKEKKKRK
11688727KKKKNKEAQDHGADAATDNGKGDGEADKSGKKKKEKKKRK
12689728KKKKNKEAQDHGADAATDNGKGDGEADKSGKKKKEKKKRK
13723752KKKKKKDKDNGVLDCSDGEKSVNVKKHKDK
14723749KKKKKKKKDKDNGVLGSSDGEKSVKVK
15724753KKKKKKDKDNGVLDCSDGEKSVNVKKHKDK
16724750KKKKKKKKDKDNGVLGSSDGEKSVKVK
17725754KKKKKKDKDNGVLDCSDGEKSVNVKKHKDK
18725751KKKKKKKKDKDNGVLGSSDGEKSVKVK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G11070.22e-16WRKY family protein