PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG004951t1
Common NameTCM_004951
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family WRKY
Protein Properties Length: 512aa    MW: 55234.8 Da    PI: 8.7675
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG004951t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY98.15.8e-31269327259
                       --SS-EEEEEEE--TT-SS-EEEEEE-ST.T---EEEEEE-SSSTTEEEEEEES--SS- CS
              WRKY   2 dDgynWrKYGqKevkgsefprsYYrCtsa.gCpvkkkversaedpkvveitYegeHnhe 59 
                       +Dg++WrKYGqK+ kg+++pr+YYrCt+a gCpv+k+v+r+aed++++++tYeg+Hnh+
  Thecc1EG004951t1 269 SDGCQWRKYGQKMAKGNPCPRAYYRCTMAvGCPVRKQVQRCAEDKSILITTYEGNHNHP 327
                       7*********************************************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.20.25.801.8E-33253329IPR003657WRKY domain
SuperFamilySSF1182905.89E-28261329IPR003657WRKY domain
PROSITE profilePS5081129.964263329IPR003657WRKY domain
SMARTSM007742.4E-37268328IPR003657WRKY domain
PfamPF031066.7E-27270327IPR003657WRKY domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 512 aa     Download sequence    Send to blast
MEKQHRRELK FLRSGDFLRP NSGVPDRTLD DSSDHVKPTI KEMDFFSTNS QPHDPLQESK  60
INNGSSSLFD SGVNTGLNLL SSSPRVSRTT NEETPNSEMR ALRIELQRLH EENRRLRSML  120
DQITKNYNEL QGQLFMAVQK QAHGNQGEQK GAVNGMSSLT ESVQQFMDPR PSTALDVNAP  180
SASDDKTQEL SVSPVNTTEV VSKERDHQMT RIPGKHVSVE DGTDRTSQSW GSPKSPKVEQ  240
SKNEEQVSEV PFRKARVSVR ARSEAPLISD GCQWRKYGQK MAKGNPCPRA YYRCTMAVGC  300
PVRKQVQRCA EDKSILITTY EGNHNHPLPP AATAMANTTS AAAAMLLSGS TTSKDGLSSS  360
GYFPSLPYGS TMATLSASAP FPTITLDLTQ GPNAVPFFRP PPSTATFPLP LQGYPQLLGH  420
SMFSPTKLSA LPVTQLGQRP ASMVDTVTAA IASDPNFTAA LAAAISTIMG APPSNSGNTI  480
NNGGNSNASN RVPGLPGSPQ LPQSCTTFST N*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2ayd_A2e-21255331176WRKY transcription factor 1
Search in ModeBase
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00115ampDAPTransfer from AT4G01720Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017969824.10.0PREDICTED: probable WRKY transcription factor 47
TrEMBLA0A061DRV70.0A0A061DRV7_THECC; WRKY family transcription factor, putative
STRINGEOX954650.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM71562743
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G01720.11e-103WRKY family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]