PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D12G1827
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family WRKY
Protein Properties Length: 1490aa    MW: 168199 Da    PI: 6.9854
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D12G1827genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY85.93.6e-27668726159
                  ---SS-EEEEEEE--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS- CS
         WRKY   1 ldDgynWrKYGqKevkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnhe 59 
                  ldDgy+WrK+GqK+vkg+++pr YYrC s+ C +kk ver+++d++++++tY+g Hnh+
  Gh_D12G1827 668 LDDGYRWRKWGQKMVKGNPYPRLYYRCLSTCCLAKKYVERDSQDTSFFVTTYHGLHNHD 726
                  59********************************************************7 PP

2WRKY81.11.1e-25785845259
                  --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
         WRKY   2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59 
                  dDg++WrKYGqK++ g+++pr YYrC+     gC +kk+v+rs++dp+++eitY g+H+++
  Gh_D12G1827 785 DDGFCWRKYGQKDILGAKYPRRYYRCANGpsqGCFAKKRVKRSSDDPTIFEITYCGKHTCN 845
                  8*************************976666***************************97 PP

3WRKY45.41.6e-1412611317260
                   --SS-EEEEEEE--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS-- CS
         WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnhek 60  
                    D  +W+ YG K   g++  +s+YrC+++gC++kk vers  d+k +++  + +Hnh+k
  Gh_D12G1827 1261 ADASTWKCYGTKGLIGNR-RKSFYRCAHPGCQAKKSVERSL-DGKSFIVLSRASHNHPK 1317
                   58889***********97.58********************.***************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF520581.65E-2933183IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.101.1E-1835177IPR032675Leucine-rich repeat domain, L domain-like
PfamPF138551.5E-946105IPR001611Leucine-rich repeat
SMARTSM00369306992IPR003591Leucine-rich repeat, typical subtype
SMARTSM003694.9116139IPR003591Leucine-rich repeat, typical subtype
Gene3DG3DSA:3.80.10.103.2E-6178209IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.103.2E-6294478IPR032675Leucine-rich repeat domain, L domain-like
SuperFamilySSF520581.65E-29363474IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:2.20.25.808.5E-26661726IPR003657WRKY domain
PROSITE profilePS5081128.151663728IPR003657WRKY domain
SuperFamilySSF1182907.98E-24664726IPR003657WRKY domain
SMARTSM007741.9E-30668727IPR003657WRKY domain
PfamPF031061.3E-20669726IPR003657WRKY domain
SuperFamilySSF1182906.02E-23783846IPR003657WRKY domain
Gene3DG3DSA:2.20.25.803.4E-25783846IPR003657WRKY domain
SMARTSM007741.0E-33784846IPR003657WRKY domain
PROSITE profilePS5081123.112785847IPR003657WRKY domain
PfamPF031061.9E-22785844IPR003657WRKY domain
PfamPF078873.7E-489591100IPR012416CALMODULIN-BINDING PROTEIN60
PROSITE profilePS508111312551318IPR003657WRKY domain
Gene3DG3DSA:2.20.25.802.4E-1212561317IPR003657WRKY domain
SuperFamilySSF1182903.66E-1112591316IPR003657WRKY domain
SMARTSM007745.8E-812601317IPR003657WRKY domain
PfamPF031069.2E-1012621316IPR003657WRKY domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0006950Biological Processresponse to stress
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005516Molecular Functioncalmodulin binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1490 aa     Download sequence    Send to blast
MLSSVTKARG PGLAEVQQVE WGAKEIHLAD DKHIVSELPR CPNCSSLIAL YLQGNYELTA  60
IPPLFFQRMQ LLQILDLSRT SIKSLPKSLR KLFALKKLLL QGCDLFMELS PQVGKLKNLE  120
ELHLDETQIM GLPKETGKLL KLQLLKVSFY HLCGKKTLKS DILIHPETIS NLSQLAELSI  180
DVNPADKRWD DSVEAVLKEV CNSKTLRTLS LYLPTFQLLD YASLIYPSLS RFRFTVGHHK  240
RRIISRVPHE VEAEFRKWDK CLRFVNGETI PTQIKGVLKY STSFFLDHHT TAMNLSEFGI  300
ENMKGLKFCL LAECNKMETL IDGEMHYERN EDDQSESDPG SVQHMLESLE YLSIYYMEDL  360
QCIWRGADSF VCMSKLKFLA LHACPQLSKI FSLTLLENFI NLEEIIVEDC PRVTSLVSHA  420
SVKPIMSDKI FLPSLKRLLV LYLPELVSIS NGLLIAPKLE SIGCYNCPKL KSISKMELSS  480
KTLKIIKGEC EWWEGMNWNE TEWGDGPGYL MHIFSPIDNQ KDVMTQMVEG RDPHEATIQN  540
EDQQLGDQKP LEVSTQDHRG QCLDYTEERM MGTDVKEPPS GCVFPSNPLC MTYHAPEQAR  600
SFTSGNNRSL EDDECFLVPN IVEVDVDEDE PKAKRWNHTE NENKGVIGSA SKTTRGNRAA  660
NQIRSKVLDD GYRWRKWGQK MVKGNPYPRL YYRCLSTCCL AKKYVERDSQ DTSFFVTTYH  720
GLHNHDEWNS RGLSKLHSQP CIDHRANNVK DAAEAAKSIC PTKEYGDVFQ ASIQDEGAHP  780
DLAPDDGFCW RKYGQKDILG AKYPRRYYRC ANGPSQGCFA KKRVKRSSDD PTIFEITYCG  840
KHTCNVMPPT APSEYQEQGT RIEPQQQHNQ LTEENQKQQS QDLLVLPSTP GQCVEQSFNQ  900
KSNSGNDQLT ITHEDNNSTI VCQESSPSPS DSSSMGSQLS AAALSTEQLQ AQRFDEPAKQ  960
NQLYKKHYPP MLGDEVWRLD RIDKNGIIHK RLASEGINTV QDFLKMWVVN PGELRRILGP  1020
IMSERKLDHA INHARTCVMG NKYYVFRGSN YRILLNPICQ LMGAEVNGSI YPTHSLSNID  1080
TVYLEKLVRQ AYVNWSSLEE IEGISNEIIG PLTQDIMAQR TAANVINTIP SNLPAMPPSG  1140
PWLPELPDHP VLMDNSNVLS SPTTGECVVQ SLNQKNNSGN DQQTISQEDD NSIIVYHAPP  1200
PSHSNSSAMI FELSATTLPI AQVQAQSFEE LAERNQSKGN LQFQACCYEQ DSDLTKSGKA  1260
ADASTWKCYG TKGLIGNRRK SFYRCAHPGC QAKKSVERSL DGKSFIVLSR ASHNHPKSLP  1320
TRTSLSAFSH IRASNHLTLK IPDKSSVTYE GGQMDMDGLV FFVRSVEDET GLQNLMDKKS  1380
DQPSDRHKVV VGLGVRKAKT SFRKRAKTTV FKLSSDTNFR EMVQSFTGKH TTEVQEEKVT  1440
RGIPRKRAWD ANLKEISGND IHCVRENVAD GTSQPDVIQA DHPIIRCIPR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1wj2_A6e-216677261675Probable WRKY transcription factor 4
2lex_A6e-216677261675Probable WRKY transcription factor 4
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.255840.0ovule
Expression -- Microarray ? help Back to Top
Source ID E-value
GEO738639900.0
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2431140.0AC243114.1 Gossypium raimondii clone GR__Ba0041F12-jfm, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016736437.10.0PREDICTED: uncharacterized protein LOC107946563 isoform X1
TrEMBLA0A1U8NBP10.0A0A1U8NBP1_GOSHI; uncharacterized protein LOC107946563 isoform X1
STRINGGorai.008G200800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1975946
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G07100.25e-27WRKY DNA-binding protein 26