PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Rsa1.0_00008.1_g00039.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassiceae; Raphanus
Family HSF
Protein Properties Length: 2561aa    MW: 288751 Da    PI: 8.4397
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Rsa1.0_00008.1_g00039.1genomeRGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind112.52.8e-35212822192102
                               HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESX CS
             HSF_DNA-bind    2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhks 85  
                               Fl+k+y++++d++++e++sws+ +nsfvv++  ef+k+ LpkyFkh+nf+SFvRQLn+YgF+kv+ ++         weF+++ 
  Rsa1.0_00008.1_g00039.1 2128 FLSKTYDMVDDPSTDEVVSWSSGSNSFVVWNVPEFSKQFLPKYFKHNNFSSFVRQLNTYGFRKVDPDR---------WEFANEG 2202
                               9****************************************************************999.........******* PP

                               XXXXXXXXXXXXXXXXX CS
             HSF_DNA-bind   86 Fkkgkkellekikrkks 102 
                               F kg+k+ll++i r+k 
  Rsa1.0_00008.1_g00039.1 2203 FLKGQKQLLKSIIRRKP 2219
                               ************99985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF119607.5E-231067IPR021863Fatty acid desaturase, N-terminal
PfamPF141118.6E-28159287IPR025558Domain of unknown function DUF4283
PfamPF143921.3E-6301344IPR025836Zinc knuckle CX2CX4HX4C
PfamPF033726.2E-8752961IPR005135Endonuclease/exonuclease/phosphatase
Gene3DG3DSA:3.60.10.106.7E-12758965IPR005135Endonuclease/exonuclease/phosphatase
SuperFamilySSF562192.09E-23759965IPR005135Endonuclease/exonuclease/phosphatase
PROSITE profilePS5087812.25812091500IPR000477Reverse transcriptase domain
CDDcd016501.43E-5212271499No hitNo description
PfamPF000782.4E-3812291469IPR000477Reverse transcriptase domain
PfamPF139665.1E-1817561845IPR026960Reverse transcriptase zinc-binding domain
Gene3DG3DSA:3.30.420.101.1E-819512080IPR012337Ribonuclease H-like domain
SuperFamilySSF530981.2E-1119532079IPR012337Ribonuclease H-like domain
CDDcd062222.22E-2119552074No hitNo description
PfamPF134564.2E-2319572075No hitNo description
PROSITE profilePS508799.38919742078IPR002156Ribonuclease H domain
Gene3DG3DSA:1.10.10.102.0E-3721222212IPR011991Winged helix-turn-helix DNA-binding domain
SMARTSM004151.0E-5721242217IPR000232Heat shock factor (HSF)-type, DNA-binding
SuperFamilySSF467855.03E-3321242217IPR011991Winged helix-turn-helix DNA-binding domain
PfamPF004476.2E-3021282217IPR000232Heat shock factor (HSF)-type, DNA-binding
PRINTSPR000569.5E-1821282151IPR000232Heat shock factor (HSF)-type, DNA-binding
PRINTSPR000569.5E-1821662178IPR000232Heat shock factor (HSF)-type, DNA-binding
PROSITE patternPS00434021672191IPR000232Heat shock factor (HSF)-type, DNA-binding
PRINTSPR000569.5E-1821792191IPR000232Heat shock factor (HSF)-type, DNA-binding
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0055114Biological Processoxidation-reduction process
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0004523Molecular FunctionRNA-DNA hybrid ribonuclease activity
GO:0016717Molecular Functionoxidoreductase activity, acting on paired donors, with oxidation of a pair of donors resulting in the reduction of molecular oxygen to two molecules of water
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2561 aa     Download sequence    Send to blast
MVVAMDQRTN VNGDARARKE EGFDPSAQPP FKIGDIRAAI PKHCWVKSPL RSMSYVVRDI  60
CAVAALAIAA VYIDSWFLWP LYWVAQGTLF WAIFVLGHDC GHGSFSDIPL LNSVVGHILH  120
SFILVPYHGW FTEEEKGKGI QTEYQDSTVK RIKAPVLDTT RLIQDNALTL IGRVTNPREQ  180
RIWALIPSLP RKWNLQGRAV GSDLGNNCFQ FRFEREDDLQ RVLDNRPYHF AYWMVILQRW  240
EPVISPTFPS QIPFWIRIKG LPLHYWHEAM VRSVGQELGV YVEQELTKTT ARVRVLVDGL  300
KPLVVQPIIE YESGEESTIY LEYERLENHC SYCFSLSHLS DYCLKKEPSI TRQGEGNGTQ  360
MSKPRQVEDL EAPSDRPRSQ REEYDSRRSQ RVTWQRRNSQ DASQLKENDI PRSERVEKAP  420
QLFHQRLDRY RKAYGERVSS RHTRNPPPPS RTRSPTEGVG HSRDKEHVEN QQIYTSPAYV  480
KNRDHQNRRE RPAFSANPRK EISQWRVKPT TMVERQSQEP TIQEAVTPIS ANVNRHPLQN  540
TSVHIPTNEE VMEELHQTTL QYLSCIDPTE AAARRQRVLQ GDSQGQMEQV AAGIIEAATR  600
AAEITVHQNA VRSEENLPHQ NDEASSIIGD QTTRSPEQEI RSSGLHSYAA FLRSPIENLP  660
TNGALRNPPT DAVIPQGQKR KRTTPARLRS AIVSPKHPQG ASSKKRLRAL IRTSPGGATR  720
SSGRDGTKRH TPDRNSDVVG PSRSSNQPRT NVFPAIQRLQ EINRVNAPDI FFLMETKNPT  780
EFVTKELDWL LMETYIVPPH SPGGGGLFLA WRKEVEMTIN SATNNYIDTN ITYKGASFQA  840
TFVYGEPDHT KRQAVWSEIS SLKNVNGGAW FLTGDFNEIT DNSEKQGGPA RAEGTFCAFR  900
TFLSQNDLFD LKHAGNFLSW RGKRSSHVVH CRLDRAISNT DWTEKFPSCR SVYLNYEGSD  960
HRPLLSFCDT TRKKGHRIFR YDRRLKGNEE VEKLISDIWE GFPNLTVESR LAMCRKAISK  1020
WCRTAQVNSQ KLIASLKLQL EKAMSDSREE ETKIQEINTN LLKAYQAEEA FWKQRSRQLW  1080
LSLGDANTGY FHAVTKGRRA KNKLTIIEDE AGKPWHEEEQ IARVISQYYH DLFTAVPFDG  1140
GPTISKALSP CITQEMNETL ISNPTDEEIK RALFAIHADK APGPDGFSAS FFQSNWSVVG  1200
PATVTEVQMF FNSAELPASM NTTHVRLIPK VTGAKTVADY RPIALSNVFY KIISKLISLR  1260
LKPILGSVIS ENQSAFIPGR VITDNVLITH EVLHYLKASQ AEKKCPMAVK TDMSKAYDRM  1320
EWDFIEQVLQ RLGFHEKLIK LIMQCITTVS YSFLINESVY GNVIPQRGIR QGDPMSPYIF  1380
ILCGEVLTGL CKEAERNGSL PGAERNGSLP GVRVARGSPR INHLLFADDT MFFCYSTPTS  1440
CKTLKDILLE YERASGQKIN TNKSSITFSS KTPPMVKDNA KLLLDITKEG GVGKYLGLPE  1500
HFGRRKRDLF TSIVDKIRQR ALSWSTKCLS KAGKMTMIKS VLTAMPSYSM SCFQIPISLC  1560
KRIQSVLTRF WWDGNDEKKK LCWVSWANLS KPKAAGGMGF RDVQVFNQAL LAKNAWRIVT  1620
EPSCLLSRVL RGKYCHKEAF LDVEPSSACS HGWRSVLHGR DLLKQNLGKV IGNGLNTKIW  1680
QDAWISLDTN IKPYGPITED ASDLRVSDLL TTDLKWNTTR IEELLPEFAA KILCLRPSQM  1740
GVEDAFIWQP LNTGIYSTKS GYHSAMTPNE SIIPSSNVDW YKDVWNEKCT PKLKVFVWSI  1800
LQRAFPIGEN LQRRGFNASV TCPRCGNQES ATHLFFSCPY AKKVWSLLPL ASTAHLADFA  1860
SFESAIATFH QMPCLPPSGI TTNILPWVCW QLWTSRNHLI FENRKFSEEE VALRSITTAR  1920
EWITAQETAK ISQPMIHRGS NSLRREEEDS NITKCFTDAA YDKETRQAGL GWIFSNRSAL  1980
TCKGSSFQTY VNSPLMAEAL ACRSSLLHAT SIGLRDLRIF SDNQSLVRAI NSKLASKEIF  2040
GILADIKNLA ASFDSISFSH IARSLNVEAD TLAKAVLRDP SSALDFSLSI SASPLWAISE  2100
KEMGSIPESV PTANSSTTTV VMSSIPPFLS KTYDMVDDPS TDEVVSWSSG SNSFVVWNVP  2160
EFSKQFLPKY FKHNNFSSFV RQLNTYGFRK VDPDRWEFAN EGFLKGQKQL LKSIIRRKPP  2220
QVQPPQQPQV QHSSVGACVE VGKFGLEEEV ERLQRDKNVL MQELVRLRQQ QQVTEHHLQH  2280
VGQKVHVMEQ RQQQMMSFLA KAVQSPGFLN QFAQQQSHNE GSQQHISETN KKRRLPVEDQ  2340
KNSGSNGLNG LSRQIVRYQS SMNESANSML QQIHNMSNNN NHGGFLLGDV PNPNLSDNGS  2400
SSNGPSGVAF ADVSSNPAMT HHNSPCATNQ VLEETNLPYP QADLLVPNQG SGSPSPDLVG  2460
CETDNGECLD PIMAVLDGSM MLETDNELLP GGVQDSLWEQ FFGESSGIGD SDELVSGSVD  2520
NELIMEQLEL QPNLRNVLSN NQQMNHLTEQ MGLLTSDALR K
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5d5u_B3e-252123221724129Heat shock factor protein 1
5d5v_B3e-252123221724129Heat shock factor protein 1
5d5v_D3e-252123221724129Heat shock factor protein 1
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00328DAPTransfer from AT3G02990Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapRsa1.0_00008.1_g00039.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBNALINDES1e-143L01418.1 Brassica napa linoleate desaturase (fad3) mRNA, complete cds.
GenBankJX8667481e-143JX866748.1 Brassica oleracea isolate Albo-UA fatty acid desaturase 3-2 mRNA, complete cds.
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM229
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G02990.10.0heat shock transcription factor A1E