PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc010503.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family HSF
Protein Properties Length: 468aa    MW: 52318 Da    PI: 6.3495
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc010503.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind116.41.8e-36271192103
                            HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXXX CS
           HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkkg 89 
                            Fl k+y++++d+++++++sws+ +n+fvv+d+ efak++LpkyFkh+nf+SFvRQLn+YgF+kv+ ++         weF+++ F +g
  Cse_sc010503.1_g020.1  27 FLVKTYDMVDDPSTDKVVSWSAANNTFVVWDPPEFAKDLLPKYFKHNNFSSFVRQLNTYGFRKVDPDR---------WEFANEGFLRG 105
                            9********************999*****************************************999.........*********** PP

                            XXXXXXXXXXXXXX CS
           HSF_DNA-bind  90 kkellekikrkkse 103
                            +k+ll++i r+ks+
  Cse_sc010503.1_g020.1 106 QKHLLKTIVRRKSA 119
                            ***********985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 468 aa     Download sequence    
MEPPRSSNDG AAIPPPPMPA ANAPPPFLVK TYDMVDDPST DKVVSWSAAN NTFVVWDPPE  60
FAKDLLPKYF KHNNFSSFVR QLNTYGFRKV DPDRWEFANE GFLRGQKHLL KTIVRRKSAS  120
GHTQPQQPPH QQTSSMGACV EVGKFGLEEE VERLKRDKNV LMQELVRLRQ QQQTTDNQMQ  180
SMVKRLQGME QRQQQMMSFL AKAVNSPGFL AHFVQQQNES TRLITEGNKK RRYKEDTVLS  240
TVDSPDGRIV KYQPMMNDAA QAMLKQIMKL DSPSSRLNTF SSGVTLQDVT PSPSSQSYLP  300
AVTGVLSAPL EADSEVVAAD HFPDAISLID GQDLPDVSDL SHLQEMAPDA SIESYMHPET  360
PNLPLDIGSF SPEVDVEWDS NLLTEMEKST NNIYLSKQQP PDCKNGIIIS DLSKPQPPPL  420
HRHHHRHHQR PAIRRCHSQM VCCGGDGGGL RWVSTGTIAL EESSSEIG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1229234KKRRYK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G32330.11e-115HSF family protein