PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0000983.1_g520.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family HSF
Protein Properties Length: 380aa    MW: 43001.9 Da    PI: 5.2441
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0000983.1_g520.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind99.33.6e-31441562102
                                HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTT........... CS
               HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekksks........... 74 
                                Fl+k++++++d+++++++sws+ g sfvv+d+++f  ++Lp+yFkh+nf+SFvRQLn+Y    + + ++   s           
  Pav_sc0000983.1_g520.1.mk  44 FLTKTFDMVDDPSTNRIVSWSRGGGSFVVWDPHTFVMNLLPRYFKHNNFSSFVRQLNTYVSVFTPKLKSLIGSgisddctgfrk 127
                                9**********************************************************8666666665544466788777665 PP

                                .XTTSEEEEESXXXXXXXXXXXXXXXXXX CS
               HSF_DNA-bind  75 .kekiweFkhksFkkgkkellekikrkks 102
                                 +++ weF+++ F +g+k+ll++ikr+k+
  Pav_sc0000983.1_g520.1.mk 128 vDPDRWEFANEGFVRGQKHLLKNIKRRKT 156
                                449************************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 380 aa     Download sequence    
MNYLYPVKEE FPGSSSSQSG PGDPVVMIPP QPMEGLNDTG PPPFLTKTFD MVDDPSTNRI  60
VSWSRGGGSF VVWDPHTFVM NLLPRYFKHN NFSSFVRQLN TYVSVFTPKL KSLIGSGISD  120
DCTGFRKVDP DRWEFANEGF VRGQKHLLKN IKRRKTPSQP LPAQQALGPC VEVGQFGLDG  180
EIDRLRRDKQ VLMMELVKLR QQQQNTRAYL QAMEQRLQGT EMKQQQMMAF LARAMQNPAF  240
MQQLVQQKDK RKELEEAMTK KRRRPIDQGP SGVGGGKSTL KGKGTNLIKC EPLEFGDCDY  300
EMSELEALAL EIQGFGKARK EQDEELEPLQ SGKELDEGFW EELFSERFEG DFSIPSAIEG  360
EDEDVIILAD RLGYLGSCPK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1148154LKNIKRR
2250264KRKELEEAMTKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22830.11e-123HSF family protein