PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PHT55852.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family HSF
Protein Properties Length: 404aa    MW: 45693.7 Da    PI: 5.834
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PHT55852.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind1118.4e-35141072102
                   HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESX..XXXXXXXXXXX CS
  HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhks..Fkkgkkellek 96 
                   Fl+k+ye+++d++ + ++sws++++sf+v+++ +f++++Lp+yFkh+nf+SF+RQLn+YgF+k++ e+         weF+++   F +g+ +ll++
    PHT55852.1  14 FLTKTYEMVDDPSCDAIVSWSSSNKSFIVWNPPNFSRDLLPRYFKHNNFSSFIRQLNTYGFRKIDAEK---------WEFANEDnnFIRGQPHLLKN 101
                   9****************************************************************999.........***998778*********** PP

                   XXXXXX CS
  HSF_DNA-bind  97 ikrkks 102
                   i+r+k 
    PHT55852.1 102 IHRRKP 107
                   ***985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 404 aa     Download sequence    
MDEGACSTNA LPPFLTKTYE MVDDPSCDAI VSWSSSNKSF IVWNPPNFSR DLLPRYFKHN  60
NFSSFIRQLN TYGFRKIDAE KWEFANEDNN FIRGQPHLLK NIHRRKPVHS HSAQNLHGLS  120
SPLTESERQG YKEDIQKLKH ENGSLHLDLQ RHEQDHHGLE SQLQVLTERV QHVEHRQKTM  180
LSALARTLGN SVADLSHMPQ LQVNDRKRRL PGNSCLYTES DLEDTRGFSS KALTRKNMNP  240
SSLLTMNTER LDQLESSLTF WEDVLHNVDQ AGIQQNCSLE LDESRSCADS PAISYTQLDI  300
DVGPKASGID MNSEPNANPT PDASEPDDKA AAGTATIVPT GVNDVFWEQF LTENPVSEVQ  360
SERKDIASKK NENKPVDSAK YWWNMKSVNS LAEQLGHLTP AEKT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1205210DRKRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G18880.11e-106HSF family protein