PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP005520.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family HSF
Protein Properties Length: 551aa    MW: 60726.8 Da    PI: 4.7133
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP005520.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind98.37.3e-31431662102
                   HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHT.T.EEE---SSBTTTT.....................X CS
  HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmY.g.Fkkvkdeekksks.....................k 75 
                   Fl k+y++++d++++ ++sw+ ++nsfvv+++ efa+++LpkyFkh+nf+SFvRQLn+Y g +++v+ + k+  +                     +
   PCP005520.1  43 FLCKTYDMVDDPATDPVVSWTPTNNSFVVWKPPEFARDLLPKYFKHNNFSSFVRQLNTYvGlWTMVTLRAKHWGDldsgyetllvpymskmvmqkvD 139
                   9********************999***********************************64356665555554449999999999999887776544 PP

                   TTSEEEEESXXXXXXXXXXXXXXXXXX CS
  HSF_DNA-bind  76 ekiweFkhksFkkgkkellekikrkks 102
                   ++ weF+h+ F +g+k+ll++i+r+k 
   PCP005520.1 140 PDRWEFAHEGFLRGQKDLLKSINRRKP 166
                   9***********************986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 551 aa     Download sequence    
MGGADNNNRD ETSVAAGGQQ AASGSSSAPA PAPLLKSNAP PPFLCKTYDM VDDPATDPVV  60
SWTPTNNSFV VWKPPEFARD LLPKYFKHNN FSSFVRQLNT YVGLWTMVTL RAKHWGDLDS  120
GYETLLVPYM SKMVMQKVDP DRWEFAHEGF LRGQKDLLKS INRRKPAHGH SHQQPQPSHG  180
QNSMGACVEV GKFGLEEEVE RLKRDKNVLM QELIRLRQQQ QYTDNQLQTM LQRLQGMEQR  240
QQQMVSFLAK AMHSPGLFTQ FVQQNESSRR ITEVNKKRRL KKEGIAESEH STTPDGQIVK  300
YQPPMNEAAK AMLRQIMNRD ISSRLESSNN NFDNFLIGDG SSLSSSAVNS GSSSSRTSGV  360
TLQEVPLASG HGLPSVISET HSSPRVTNSG TVMRSLFSDV NALVGAQESP SIPIPQTNVI  420
IPELSQIPDM PPEGLVDIPE ESMAGEDTGV GFIENMASEA GDGYIDPSPE VLNGSLAIDF  480
DNISSDIEAF LKDWDDIIQN PGADEMDSTC AEVGLRIENE EQQPTENGDK TPNMDNLTEK  540
MGLLTSDTKG V
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1275282NKKRRLKK
2276281KKRRLK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G32330.11e-103HSF family protein