PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 858789
Common NameARALYDRAFT_658233
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family HSF
Protein Properties Length: 296aa    MW: 34979.8 Da    PI: 7.9215
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
858789genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind66.56e-2117103296
                   HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHH.HSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXXXXXXXXXX CS
  HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpk.yFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkkgkkellek 96 
                   F++ +y++++d + +++isws++g+sf+++++eef ++ L++  F+ +n++SF   Ln  gF+k+++ +         weF++++F +g+ +l+++
        858789  17 FITTTYDMVDDLSSDSIISWSQSGKSFIIWNPEEFYNNFLQRfCFQGDNINSFFSYLNSHGFRKIDSGN---------WEFANDNFVRGQPHLINN 103
                   9***************************************9977*********************9988.........*************99876 PP

2HSF_DNA-bind59.58.8e-191522432102
                   HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXXXXXXXXXXXX CS
  HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkkgkkellekik 98 
                   F +klye+++d++ + +isws++g+sf++++++ef k++L ++  + ++  F  +L+ + Fkk++ ++         weF++++F +g+ +l+e i 
        858789 152 FPTKLYEMVDDPSSDAIISWSQSGRSFIIWNPKEFCKDLLRRFSNTLHIPLFFHKLQRFSFKKIDPKK---------WEFANDNFVRGQCHLVEIII 239
                   899*************************************************************9998.........**************999887 PP

                   XXXX CS
  HSF_DNA-bind  99 rkks 102
                    +++
        858789 240 SNEK 243
                   6655 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.104.4E-258101IPR011991Winged helix-turn-helix DNA-binding domain
SuperFamilySSF467855.03E-2313103IPR011991Winged helix-turn-helix DNA-binding domain
SMARTSM004151.8E-3713107IPR000232Heat shock factor (HSF)-type, DNA-binding
PRINTSPR000561.7E-71740IPR000232Heat shock factor (HSF)-type, DNA-binding
PfamPF004471.2E-1917103IPR000232Heat shock factor (HSF)-type, DNA-binding
PRINTSPR000561.7E-76981IPR000232Heat shock factor (HSF)-type, DNA-binding
Gene3DG3DSA:1.10.10.101.4E-25143235IPR011991Winged helix-turn-helix DNA-binding domain
SMARTSM004158.1E-27148241IPR000232Heat shock factor (HSF)-type, DNA-binding
SuperFamilySSF467859.25E-23149238IPR011991Winged helix-turn-helix DNA-binding domain
PfamPF004471.0E-18152237IPR000232Heat shock factor (HSF)-type, DNA-binding
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 296 aa     Download sequence    Send to blast
MLNLNENEGS STSISNFITT TYDMVDDLSS DSIISWSQSG KSFIIWNPEE FYNNFLQRFC  60
FQGDNINSFF SYLNSHGFRK IDSGNWEFAN DNFVRGQPHL INNTISCVIE GRVLYDQSMD  120
MFKVRKLFER QVKEVEDQLP PHNSYPTSKR PFPTKLYEMV DDPSSDAIIS WSQSGRSFII  180
WNPKEFCKDL LRRFSNTLHI PLFFHKLQRF SFKKIDPKKW EFANDNFVRG QCHLVEIIIS  240
NEKEKIDQLL KRYDRQKKLG EARELFKLQI EEMKKTKEVK EQEVRLQHHI GLCKL*
Cis-element ? help Back to Top
SourceLink
PlantRegMap858789
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB4936840.0AB493684.1 Arabidopsis thaliana At4g18870 mRNA for hypothetical protein, partial cds, clone: RAAt4g18870.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002870020.10.0heat stress transcription factor C-1
TrEMBLD7M9E40.0D7M9E4_ARALL; Predicted protein
STRINGAl_scaffold_0007_22880.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2017156
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G18870.11e-131HSF family protein