PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022769626.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family WRKY
Protein Properties Length: 1822aa    MW: 208343 Da    PI: 7.1289
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022769626.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY79.34.3e-2512641323158
                      ---SS-EEEEEEE..--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS CS
            WRKY    1 ldDgynWrKYGqK..evkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnh 58  
                      ldDgy+WrKYG+K  +v+g+++pr YY+C++ gCp+kk+ er+ +d++++++tYeg Hnh
  XP_022769626.1 1264 LDDGYRWRKYGKKkkSVQGNPHPRCYYKCSTMGCPAKKRFERDYQDTSFLITTYEGVHNH 1323
                      59*********85227******************************************** PP

2WRKY86.32.7e-2714661526259
                      --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
            WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                      dDg++WrK GqKe+ gs++pr+YYrCt++   +C+++k+v+rs++dp+++eitY+g+H+++
  XP_022769626.1 1466 DDGFSWRKCGQKEILGSKYPRAYYRCTHRnvqDCMATKQVQRSDDDPTIFEITYHGRHTCT 1526
                      8***************************98999**************************96 PP

3WRKY81.21.1e-2516331693259
                      --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
            WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                      dDg++WrK GqKev g+++pr+YYrCt++   +C+++k+v+rs++dp+++eitY g+H+++
  XP_022769626.1 1633 DDGFSWRKCGQKEVLGTKYPRAYYRCTHRnvqNCWATKQVKRSDDDPTIFEITYCGRHTCT 1693
                      8***************************99999**************************96 PP

4WRKY25.72.3e-0818001822224
                      --SS-EEEEEEE--TT-SS-EEE CS
            WRKY    2 dDgynWrKYGqKevkgsefprsY 24  
                      dDg++WrKY qKe+ gs+++rsY
  XP_022769626.1 1800 DDGFSWRKYEQKEILGSKYTRSY 1822
                      8*********************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1822 aa     Download sequence    
MQQSFPKASG SRLPELRKKE EWAAKEIHLA DDKHKVSELP KSPNYSSLIA LYLQGNYERT  60
AVPPLFFRRM ALLQVLDLSH TSIKSLPKSL PKLVSLKKLS LRGCELFMEL SPQVGKLKNL  120
EELDLDETQI IDLPSEIGRL VKLSHLRVSF YHICGKKKSK SNFVIHPEAI SVLSQLAELS  180
IDVNPADKRW DDSVEAVVKE ACNSKTLKTL SLYLPKFQLL DNISLIYPSL PHFKFTVGHH  240
KRRIISRVPH EVEAEFRNWD KCLKFVNGES IPIEIEAVLK YSSSFFLDNH ATAMNLSEFG  300
IENMKRLKFC LLAECNKMET LIDGQMHDER NEDDQSESDT GSAEHVLESL EYLSIYYMEN  360
LWSLWRGPNR CGCMSRLKFL ALHTCPQLRH IFSRTLFENF VNLEEIIVED CPQVTSLVSR  420
ASVKPMMSNK FFPSLKRLLL LYLPRLVSIS NGLLITPKLE SIGFYNCPKL KSISKVELSS  480
KTLKIIKGER QWWEDLNWNE TEWGNRPDYL MHMFSPISNE KDVMTQLTED RDLLEATIQN  540
VGQQQEYFVR QDLLDLTESV HHNSAKMQQS VPMASGSGLP KLRKEEEWAA KEIHLTDDKH  600
NVFELPKSPY CSSLIALYLQ GNYKLTAIPP QFFRRMALLQ VLDLSHTSIK SLPKSLPKLV  660
SLKKLWLRGC ELFMELSPQV GKLKNLEELY LDETQIMDLP SEIGKLVKLS HLRVSFYHSC  720
GKKKSKSNFV IHPETISVLS QLAELSIEVN PADKRWDDLV EAVVKEVCNS KTLKTLSLHL  780
PKFQLLDNLS LIYPSLSHFS FTVGHRKNRI VSRVPYEVEA EFRNWDKCLK FVNGENIPIE  840
IEAVLKYSSS FFLDNHATAM NLSEFGIKNM KGLKFCLLAE CNKMETLIDG EINDERNEDD  900
QSKSDLGSAE HLLESLEYLS IYYMENLWSI WRGRNRYGCM SKLKFLALHT CPQLRNIFSH  960
TLLENFVNLE EIIVEDCPQV TSLVSHASVK PMMSNKFLPS LKRLLLLYLP GLISISNGLL  1020
IAPKLESIGF YNCPKLKSLS KMELSSKTLK IIKGECQWWE DLNWNETERG TRPDYLMRIF  1080
TPIRNEKDVM TQLTEDRDLL DATIQNEGQQ QGNCGSLLLD YKEERIPGTD VTKSPSSCIL  1140
PSNPLTGTNV TKCPSACILP SNSWTGTDLT NSSSSCILPF NPLRTFDAPK QALSFFSSEK  1200
NKRLEDCYFD QAAEICEVDV DEDEPKAKRS NCTENENKGV IGPVSKTTRG HRVAVRTRSN  1260
SVVLDDGYRW RKYGKKKKSV QGNPHPRCYY KCSTMGCPAK KRFERDYQDT SFLITTYEGV  1320
HNHGCYNMRL YNLHTRLCND HRANYMEDAA DSAKTISPTK GYEDVFEAPI QDIGVQPEYE  1380
GIPEALIRDE SQQSADPQPS ELAIRMSESP PSPIGSTPWS EVYDCDFKEQ ELKDDSDKRL  1440
LCCRRNLSRW KELIRVPSTG LEVPPDDGFS WRKCGQKEIL GSKYPRAYYR CTHRNVQDCM  1500
ATKQVQRSDD DPTIFEITYH GRHTCTLASH VVPSPGPLEN QDHGTSSVRS TYWKVFSMLN  1560
CNTSLVAEPQ PSEQAIRMSD SQPPRIGSTP RSEVHDCDFE EQELKGDSKK RKTPSRWTEL  1620
IRVPSTGLEV PPDDGFSWRK CGQKEVLGTK YPRAYYRCTH RNVQNCWATK QVKRSDDDPT  1680
IFEITYCGRH TCTLASHVVP SPGPLKNQDQ GTCSVLSTYC KAFSMLNCNT SLVAEAQPSE  1740
QAIRMSESQP PRIGSTPQSE VHDCDFEEQE LEDDSKKRKT LSRWTELIRV PSTGLEVPPD  1800
DGFSWRKYEQ KEILGSKYTR SY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116041612LKGDSKKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G11070.14e-38WRKY family protein