PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz04g32130.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family HSF
Protein Properties Length: 606aa    MW: 64110.3 Da    PI: 6.0257
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz04g32130.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind1118.4e-35301222103
                    HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXXXXXXXXXXX CS
   HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkkgkkelleki 97 
                    Fl+k+y+++ed++++ ++sw  +g+sf+v+++ ef++++Lp++Fkh+nf+SFvRQLn+YgF+kv+ ++         weF++++F +g+kell++i
  Cz04g32130.t1  30 FLTKTYDLVEDPSTDPIVSWAPDGHSFIVWKPPEFSRDLLPRHFKHNNFSSFVRQLNTYGFRKVDPDR---------WEFANDHFIRGRKELLREI 116
                    9****************************************************************999.........******************* PP

                    XXXXXX CS
   HSF_DNA-bind  98 krkkse 103
                    +r+k +
  Cz04g32130.t1 117 HRRKPS 122
                    ***975 PP

Sequence ? help Back to Top
Protein Sequence    Length: 606 aa     Download sequence    
MSGMRRPGSA SSSGGGHSAD LSAANQPPPF LTKTYDLVED PSTDPIVSWA PDGHSFIVWK  60
PPEFSRDLLP RHFKHNNFSS FVRQLNTYGF RKVDPDRWEF ANDHFIRGRK ELLREIHRRK  120
PSATPTNLAS ALMPQGQTAI ELGNYGGMQD EIDALKRDKN VLMLELVRLR QQQQTADSRI  180
RDLQLRLDST EARQNTIVNF LARVAQNPTV LQQMVSVAQS AGLQRLSSGR SGANRKKRRA  240
RGGDEDSDGS GGPAPEDNHA QIIQYTPNHN GDFTEALLRS MASLIPVSPA GNTAASSFDL  300
SSPFNTMHLG QPLHRHGAEV DIGELAADAD RQLNAMKIDS DLPLDAHQQQ QAVAPASVTI  360
QEQPASSSFH LNSVPSLGAD ATVAMSSALP ANGDSPTAAA IASSPFITND SMPQQQQNHY  420
PPAAAASTSI PGLPNISSPI GAAPAAAVAA PLQPSGGSGF LGGPMPAGMT TVINPLGQAM  480
SVPGPSAAPP AGNPVVSMPP NGTFLPASHS QPSTSAAAVV AVDSKTRVDS PDDDDLDLPM  540
DLINGLQSMG STDLMLAEDL SKDELWSSLF GSQSSDGIAD FSNLHLADGA LPAHAPHHHH  600
VHRQP*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1235241RKKRRAR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17750.13e-75HSF family protein