PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr4g0387751
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family HSF
Protein Properties Length: 350aa    MW: 40432.6 Da    PI: 5.1567
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr4g0387751genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind117.77e-37271192103
                             HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXX CS
            HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkk 88 
                             Fl+k+yei+ed+++++++sws+ +nsfvv+d++ fa ++LpkyFkh nf+SFvRQLn+YgF+kv++++         weF+h+ F +
  RcHm_v2.0_Chr4g0387751  27 FLNKTYEIVEDSTTNHIVSWSKANNSFVVWDPQAFAISLLPKYFKHGNFSSFVRQLNTYGFRKVDTDK---------WEFAHEVFLR 104
                             9****************************************************************999.........********** PP

                             XXXXXXXXXXXXXXX CS
            HSF_DNA-bind  89 gkkellekikrkkse 103
                             g+k+ll++i+r++++
  RcHm_v2.0_Chr4g0387751 105 GQKHLLKNIRRRRTS 119
                             ***********9876 PP

Sequence ? help Back to Top
Protein Sequence    Length: 350 aa     Download sequence    
MNPQGQVKEA PLPVPLERLN DQGPPPFLNK TYEIVEDSTT NHIVSWSKAN NSFVVWDPQA  60
FAISLLPKYF KHGNFSSFVR QLNTYGFRKV DTDKWEFAHE VFLRGQKHLL KNIRRRRTSH  120
HNVHYGSKQD LDSCIEVGSF GSLDGEIDQL RRDKEVLMGE LVKLRQQQQT ARLHLHGMED  180
RLKRTEMKQQ QMMNFLARAM QNPNFLQQLV QKKERKKQLE EAITKKRRRT IEGPSSIEVG  240
ALGQGGGEAV VKVEPEYYGD ISELDVSGLD NFAMDMPVTD ENEKVHDGEE CMEKEEEYER  300
RGKDLSEVFW KEFLNEGVDE EIEVLGVQEE DEEDVSVLIE QLGCLVSKPK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1104117RGQKHLLKNIRRRR
2215229RKKQLEEAITKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22830.11e-120HSF family protein