PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG037635t1
Common NameTCM_037635
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HSF
Protein Properties Length: 471aa    MW: 53039 Da    PI: 4.4436
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG037635t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind111.55.9e-351312222102
                       HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXXXXXXXX CS
      HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkkgkkell 94 
                       Fl+k++e++ed+e++ ++sws n nsf+v+d+++f++++LpkyFkh+nf+SF+RQLn+YgF+k+++++         weF++k F+ gkk+ll
  Thecc1EG037635t1 131 FLRKTFEMVEDPETDPIVSWSVNRNSFIVWDSHKFSENLLPKYFKHKNFSSFIRQLNTYGFRKIDSDR---------WEFANKGFQGGKKHLL 214
                       9********************999*****************************************999.........**************** PP

                       XXXXXXXX CS
      HSF_DNA-bind  95 ekikrkks 102
                       ++i+r+++
  Thecc1EG037635t1 215 KNIERRSR 222
                       *****986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.106.8E-37124215IPR011991Winged helix-turn-helix DNA-binding domain
SuperFamilySSF467851.77E-32126220IPR011991Winged helix-turn-helix DNA-binding domain
SMARTSM004154.0E-55127220IPR000232Heat shock factor (HSF)-type, DNA-binding
PRINTSPR000562.2E-18131154IPR000232Heat shock factor (HSF)-type, DNA-binding
PfamPF004471.2E-30131220IPR000232Heat shock factor (HSF)-type, DNA-binding
PRINTSPR000562.2E-18169181IPR000232Heat shock factor (HSF)-type, DNA-binding
PROSITE patternPS004340170194IPR000232Heat shock factor (HSF)-type, DNA-binding
PRINTSPR000562.2E-18182194IPR000232Heat shock factor (HSF)-type, DNA-binding
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 471 aa     Download sequence    Send to blast
MVVPDSGGGE EGFGLSCTTS MLKESKKMED EPDNENQSES NIVVKEEPVA APAAVTESSA  60
ATCGGEDQVA LLKSVKEEHG EEDDEEMGMV DDMMNGGDCN KISNNGSSSS SSSDVSPNPI  120
EGLHESGPPP FLRKTFEMVE DPETDPIVSW SVNRNSFIVW DSHKFSENLL PKYFKHKNFS  180
SFIRQLNTYG FRKIDSDRWE FANKGFQGGK KHLLKNIERR SRYNKQQQGG VICANSSTSF  240
GLETELEILK KDQSALQLEV LKLRQQQEES NHQLSVFEER IRFSECRQQQ MCNFFVKIAK  300
FPNFIQQLIQ KRKQQKKELD EGEFSKKRKL LETQVTKSLP EAMGTDQSVK CSNQVDQERL  360
ESMQPDEFSK YLPDGMENNN QMENEFSASM EDGLCCSLQD QKSSVPEMSS VYHVMSENLL  420
GESSIVDNVT NEELSVNDSK IYLELEDLIN WKPCSWGGFA SELVEQTGCV *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5d5u_B1e-2212222017129Heat shock factor protein 1
5d5v_B1e-2212222017129Heat shock factor protein 1
5d5v_D1e-2212222017129Heat shock factor protein 1
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1310328KRKQQKKELDEGEFSKKRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017982155.10.0PREDICTED: heat stress transcription factor A-2
TrEMBLA0A061GLI80.0A0A061GLI8_THECC; Heat shock transcription factor A2, putative
STRINGEOY304110.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM74981940
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G26150.13e-79heat shock transcription factor A2
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]