PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0000108.1_g800.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family HSF
Protein Properties Length: 512aa    MW: 55169.1 Da    PI: 4.415
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0000108.1_g800.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind803.6e-25421262102
                                HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESX CS
               HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhks 85 
                                Fl+k+y++++d+++++++sws ++nsfvv+++ efa+++LpkyFkh+nf+SF       gF+kv+ ++         weF+++ 
  Pav_sc0000108.1_g800.1.mk  42 FLSKTYDMVDDPATDQVVSWSPTNNSFVVWNPPEFARDLLPKYFKHNNFSSF-------GFRKVDPDR---------WEFANEG 109
                                9********************999***************************9.......9*****999.........******* PP

                                XXXXXXXXXXXXXXXXX CS
               HSF_DNA-bind  86 Fkkgkkellekikrkks 102
                                F +g+k+ll++i+r+k 
  Pav_sc0000108.1_g800.1.mk 110 FLRGQKHLLKSINRRKP 126
                                **************986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 512 aa     Download sequence    
MGGANNNGDE ASMAGGGGGG AQQAGLAPAP APLSNSNAPP PFLSKTYDMV DDPATDQVVS  60
WSPTNNSFVV WNPPEFARDL LPKYFKHNNF SSFGFRKVDP DRWEFANEGF LRGQKHLLKS  120
INRRKPAHGH SHQQPQPSQG QNSVAACVEV GKFGLEEEVE RLKRDKNVLM QELIKLRQQQ  180
QSTDNQLQAM VQRLQGMEQR QQQMMSFLAK AVQSPSFLTQ FVQQQNESNR RIIEVNKKRR  240
LKQDEGGHSG TPDGQIVKYQ PPVNEAAKAM LRQIMTTDTS SSQLESFNDS PDNFLIGNGS  300
SSSSSLIDSG SSSSRASGVT LQEVPLISGP GSSSAISEVQ SSLRAANSGT VTGAPFSDIN  360
ALVGAQEAQS IPISQAGVII PELSQIPEMV PESLVDFPEE NMAPDAGVGF IENMASDAGD  420
GFIGDILGLD GSMPIDIDSI PPDPDIEALL KNWDQFLQSP EPDEMDSTSA EGGVPMGNEE  480
QPSTGNGWNK TQHNMDNLTE KMGLLTSDTK GV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1236243NKKRRLKQ
2237242KKRRLK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G32330.11e-119HSF family protein