PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Rmu_sc0003674.1_g000007.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family WRKY
Protein Properties Length: 1743aa    MW: 196969 Da    PI: 8.026
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Rmu_sc0003674.1_g000007.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY47.43.8e-1512971355358
                                 -SS-EEEEEEE--TT-SS-EEE.EEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS CS
                       WRKY    3 DgynWrKYGqKevkgsefprsY.YrCtsa...gCpvkkkversaedpkvveitYegeHnh 58  
                                 Dg++W  Y q ++ gse+pr Y Y Ct +   gC++ k+v++s+ d+k+++itY+g+H++
  Rmu_sc0003674.1_g000007.1 1297 DGFTWGEYDQTDIPGSEYPRVYyYVCTPQnvkGCMAIKEVQHSS-DQKILKITYRGRHTC 1355
                                 9*******************86257*9888889***********.*************98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1743 aa     Download sequence    
MDPASSSSSS SSSSSSSLTY DVFLSFRGED TRQSFVCHLH KALEKALKVY IDKEDLKKGD  60
RLSDLLKAIA ESKLSIVVFS ENYGNSTWCL KELVQIMEYE EKHKQIVIPV FYKVDPSHIR  120
KQTGRFGKAY DEHERKCKKD KKLLKEVQRW RPALSKAADL SGWDSNSKDF GDDAKLIERI  180
VNDVLDKVNR ISPHIEDGLV AMDSHLRQVG RLLNDAPDDA VYVGIWGMGG LGKTTIARAV  240
YNRISHQFDY RCFLKNVRER FKSKGEVEMQ LQFLSEIFKG IVNNFDGDHM MWLERLRKKK  300
ILVVLDDVLD SSQIDTLLGE KRSFGGGSII IVTTRDEHVV RGFEIYKPEP LSDEDALKLF  360
SQKAFDTCKP PGEYEHLSKC IAEYARGLPL ALIVLGKFLR KRSVDHWEGQ LKKLKECPHA  420
DIENVLKISY DGLEPCQQVI FLDIACFFKG MEKDYVTKVL ANCDCISPGL DLDVLVERAL  480
VTVSYSNQLE MHDLLQEMGR QIEKDGKRSR FWGEEAERVL TENTATENAG RLMIDLPKKE  540
GGSFNTSMRI FTYGQRYSGD DGQNHLTGEF NFLSRELKVL VWHGYPQKCL PSNFDPKNLV  600
QLDMPYSNIE QQLWEGSKPA KKLAIIDLSG SSCLKRIPDL TLVTNLKELH FDRCSSLKEV  660
HPSISSLKNL VLFSLAGCDE LKSLPGSIDQ MKSLKTLDLH HCLNFEIFPE ISEVMEALSK  720
LDLLQTAIKE LPSSIERLRG LKSLDMTFCF SLIRLPDSIC NLAELRSLSL RDCSKLCKLP  780
ENIGNLESLL EFQVHGTGLE QLPISILRLK VGELYFSCCR KMAAPLSSWP SSIEDCCTAV  840
LHLDLRYCNL LELSDAIAHF SSLKALTLSR NDNLESLPAI MNRLASLEKL ELDGCKSLRS  900
IPELSSTISY INAHDCRALE SVSTPQSPYD VGRCFIFSNC SQLVQKDVFR DIVKTHLPPQ  960
GNRSRPFYVS LPGSEIPEWF THQCRGSSVT AKLPPNWFDN KFLGFTVCAV INKTQDEIIA  1020
SIASRPDSFL RLTARCFCTF KGDHREYRFS FYLFDLRCFD SDGCWLESNH MLLGYVPWSE  1080
SGMIKREEVI KREEVNERRY TEAKFEIQLH DGSSHRYTLS DRFKPCIERC GVRFLFANNE  1140
EDFGEPMAEV DNSERSFEGC SAEPSGTDIT NEDEQYLKLS QVFKGANRAW SFNKFEDNDL  1200
SGKRKTLERW TQKVRVTSGL MQGPVDDGFK WKSDGGGIDG VSYYCGYCSA RAIKPTQDDP  1260
TICQITYVGR HTCWKFQPML THQVGVTPGM RIEGPFDGFT WGEYDQTDIP GSEYPRVYYY  1320
VCTPQNVKGC MAIKEVQHSS DQKILKITYR GRHTCIEASD SIAGWFERLK SKGSVESSFH  1380
EQVPLSPSSP GANSQDMSNR KEVTAKKSKG EEVAELEVEK TEKKKKKKKK KDKDNGVLDS  1440
SDGEKSVKVK RHEGKEEAGS PQTEKSEKKK KKKNKEAQDH GADAATDNGK GDGEADKSGK  1500
KKEKKKRKQE EIIDSPVSVP AKKSKGEEAA ELKVEKSEKK KKKKKDKDNG VLDSSDGEKS  1560
VNVEKHKDKE EAGSPQTEKS EKKKKKKNKE AQDHGADVAT DNGKGDGEAD KSEKKKEKKK  1620
RKQEEITNSP VTVPAKKGKG EEVAGLEVEK TEKKKKKKKD KDNGVLDSSY GEKSVKVKKH  1680
TDKEEAGSPQ TEKSEKKKNK EAEDHSVDAA TDNSKGDGEA DKSEKKKKKK KKKKKDKSDA  1740
DEE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
114231429KKKKKKK
214231431KKKKKKKKK
314231450KKKKKKKKKDKDNGVLDSSDGEKSVKVK
414241430KKKKKKK
514241451KKKKKKKKKDKDNGVLDSSDGEKSVKVK
614251431KKKKKKK
714681508KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
814691509KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
914701510KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
1015011541KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
1115041544KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
1215051545KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
1315391545KKKKKKK
1415391566KKKKKKKKKDKDNGVLDSSDGEKSVKVK
1515401567KKKKKKKKKDKDNGVLDSSDGEKSVKVK
1615411568KKKKKKKKKDKDNGVLDSSDGEKSVKVK
1715421569KKKKKKKKKDKDNGVLDSSDGEKSVKVK
1815821622KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
1915831623KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
2015841624KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
2116151655KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
2216181658KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
2316191659KKKKKKNKEAQDHGADAATDNGKGDGEADKSGKKKEKKKRK
2416531659KKKKKKK
2516541683KKKKKKDKDNGVLDSSYGEKSVKVKKHTDK
2616551684KKKKKKDKDNGVLDSSYGEKSVKVKKHTDK
2716561685KKKKKKDKDNGVLDSSYGEKSVKVKKHTDK
2817251731KKKKKKK
2917251733KKKKKKKKK
3017251735KKKKKKKKKKK
3117261732KKKKKKK
3217261734KKKKKKKKK
3317261736KKKKKKKKKKK
3417271733KKKKKKK
3517271735KKKKKKKKK
3617281734KKKKKKK
3717291735KKKKKKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G45050.19e-88WRKY family protein