PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG014419t1
Common NameTCM_014419
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family WRKY
Protein Properties Length: 1079aa    MW: 122039 Da    PI: 6.0659
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG014419t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY78.38.7e-25653710259
                       --SS-EEEEEEE--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS- CS
              WRKY   2 dDgynWrKYGqKevkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnhe 59 
                       +Dgy+WrKY  K +kg++++r+YY+C ++gCpvkk v r+++d+++++ tYeg Hnhe
  Thecc1EG014419t1 653 EDGYRWRKYRTKIIKGNPHSRNYYKCLTRGCPVKKMVARDSQDTSFLVLTYEGIHNHE 710
                       8********************************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF520581.6E-2912191IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.105.4E-2235191IPR032675Leucine-rich repeat domain, L domain-like
PfamPF138556.7E-851111IPR001611Leucine-rich repeat
Gene3DG3DSA:3.80.10.103.8E-8251485IPR032675Leucine-rich repeat domain, L domain-like
SuperFamilySSF520581.6E-29343488IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:2.20.25.802.0E-26640712IPR003657WRKY domain
PROSITE profilePS5081127.944647712IPR003657WRKY domain
SuperFamilySSF1182901.7E-22647712IPR003657WRKY domain
SMARTSM007743.9E-26652711IPR003657WRKY domain
PfamPF031063.7E-20653710IPR003657WRKY domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005515Molecular Functionprotein binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1079 aa     Download sequence    Send to blast
MHTSTRMQQS VLNESGSGLT QLQKEEVWAG KRVHFMNDNK VSELPPSPNC PSLIELYLQW  60
NYELTAIPPL FFQRIALLQV LDLSHTSIKC LPKSLPKLVA LKKLLLRCCQ LFMELSPLVG  120
KLSNLEELDL DETQIMDMPR EIGKLLKLRH LRVSFYQICG KKKSKLNIVI HPETISNLSQ  180
LTLLSIDVNP TDKRWDDSVE AVVKEVCNSK TLMTLSLYLP KFQLLDCISS LYPSLSGFRF  240
IVGHHKRRII SRVPREVEAE FRNWDKCLKF VNGENIPIEI KGVLKYSTSF FLDQHATALN  300
LSEFGIENMK RLKFCLLVDC NKMETIIDGE RHYDGNEDDP SESDPSPVEN VLESLEYLSI  360
YYMENLGSIW RGTSHYGCMS KLKFLALHTC PRLINIFSHT LLGNFVNLEE FILEDCPLVT  420
SLVSHASVKP MMADKFLPSL KRLLLLYLPE LVSISYGLLI APKLETIGFY NCPKLKSISK  480
MELSSKTLKI IKGELQWWED MKWNEAEWGN RPDYLVHIFS PIDKEKDVMT QLAEDGDLFE  540
ATMQNEGQQL GNCGSLLSDY MEETVTGTDV TESISSAPKQ AWSFSSEKNK RLEDDYFDLA  600
PETGDVDDDE DGPKDKRWNC TENENKGVIG FASKIVSGDR TEFRKKTNME ILEDGYRWRK  660
YRTKIIKGNP HSRNYYKCLT RGCPVKKMVA RDSQDTSFLV LTYEGIHNHE QPFFKVLNNV  720
EDAAEAAKTI SPTKGYRDVL EASIQDKGLH SGIPEASVRD ESQQSASRNS RMVSELSRMR  780
LQDGYVSYPW ERRMRDVLSV PNSSCFLSIL LLPKASDRVA SQYNDLEDTF TRANAWLNAS  840
QASGVPIVFM NIQTESLLTK FSGETASSAV NAGSLSDLSN LANASLYGFE DYHGVDIGVV  900
RAVRLWYAPL AGEIPIEIKL KEDDTKLGFA ISRTEEGFIY ISSVMDDDEN VPSTRSGLSN  960
LYKESVSASR LLVVSRLSNQ KVLPWMVSST GAVRCFDTVS LSQKLSLLRH VHMPILMHVF  1020
LWDQSVVSRG FGSARLRIPS PSVLPLPPKV RLAHQPNDDN QILPLPPEEP NESIVTGE*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2ayd_A3e-21640712274WRKY transcription factor 1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021285514.10.0disease resistance protein At4g27190-like
TrEMBLA0A061FXH20.0A0A061FXH2_THECC; Uncharacterized protein
STRINGEOY222000.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1975946
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G26170.12e-25WRKY DNA-binding protein 50
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]