PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_7649_f_2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family HSF
Protein Properties Length: 664aa    MW: 76266.3 Da    PI: 8.611
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_7649_f_2genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind105.54.4e-33251252103
                    HHHHHHHHHCTG........GGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXXX CS
   HSF_DNA-bind   2 Flkklyeilede........elkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkkg 89 
                    Fl+k+y++le++         +k+++sw+++gn f+v+ + ef++ +Lp+yFkh+nf+SF+RQLn+YgFkk++ ++         weFkh++F++g
  Neem_7649_f_2  25 FLSKTYDLLEEAdvgadyvdGNKKIVSWNAEGNGFIVWSPAEFSELMLPRYFKHNNFSSFIRQLNTYGFKKTSPKQ---------WEFKHEKFQRG 111
                    9*********666666666699**************************************************9988.........*********** PP

                    XXXXXXXXXXXXXX CS
   HSF_DNA-bind  90 kkellekikrkkse 103
                     +++l +i rkks+
  Neem_7649_f_2 112 CRHMLAEITRKKSD 125
                    ***********985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.104.3E-3515116IPR011991Winged helix-turn-helix DNA-binding domain
SuperFamilySSF467853.94E-2621122IPR011991Winged helix-turn-helix DNA-binding domain
SMARTSM004151.9E-4221122IPR000232Heat shock factor (HSF)-type, DNA-binding
PfamPF004471.9E-2825122IPR000232Heat shock factor (HSF)-type, DNA-binding
SuperFamilySSF484522.28E-6306367IPR011990Tetratricopeptide-like helical domain
Gene3DG3DSA:1.25.40.101.0E-12309522IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS5137510.983315349IPR002885Pentatricopeptide repeat
PfamPF015355.8E-6318347IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007564.3E-7318350IPR002885Pentatricopeptide repeat
PROSITE profilePS513759.843350384IPR002885Pentatricopeptide repeat
PfamPF130414.7E-8352396IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007562.2E-5353385IPR002885Pentatricopeptide repeat
PROSITE profilePS513757.377385419IPR002885Pentatricopeptide repeat
PROSITE profilePS513755.689421451IPR002885Pentatricopeptide repeat
SuperFamilySSF484522.28E-6428480IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513756.106456486IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.555491525IPR002885Pentatricopeptide repeat
PfamPF015350.045495522IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007560.0031495526IPR002885Pentatricopeptide repeat
PROSITE profilePS513758.396526560IPR002885Pentatricopeptide repeat
PfamPF015350.0013531557IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.101.0E-12559592IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513755.119561595IPR002885Pentatricopeptide repeat
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005515Molecular Functionprotein binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 664 aa     Download sequence    Send to blast
MDRNLLEVKE GTRQQSPRTK NPAPFLSKTY DLLEEADVGA DYVDGNKKIV SWNAEGNGFI  60
VWSPAEFSEL MLPRYFKHNN FSSFIRQLNT YGFKKTSPKQ WEFKHEKFQR GCRHMLAEIT  120
RKKSDPSVFP AYLKASNEGI TTTTAMEEYN SNNNSNHLLL MEENENLRKE KLQLQMQIAE  180
FKALEIKLLD CVSQYMGVHQ NKVRRFYYQN RVNKTTLYSI ISPLGNPKTS VEPELENWVK  240
AGNKVRVGEL QRIIRDLRKR KRFPHALEVS EWMNRKRICV FSPSEHAVQL DLIGRVHGFL  300
SAESYFNNLK EHEKNDKTYG ALLNCYVRQR QTEKALSHFQ KMKEMGFSLS SLSYNDIMCL  360
YTKIGQYEKV PSVLTEMKEQ NVSPDNFSYR ICINSYGART NLEGMEKILR EMESQSHIVM  420
DWNTYAVAAN FYIKANCNDK ALDVLKKAEE RLDKKDGTGY NFLISLYASL GNKAEVLRLW  480
ELEKTDCKRY INRDYITMLE SLVKLGELEE AEKVLKGWES SGNCYDTRLP NTVIIGYCNN  540
GFHEKAEAIL EDLTDKGKVT TPNSWALVAA GYLNAGKREK GFECMKIALS LHVEGKGWKP  600
NPKLMTSILS KLGDEGSVQD VEAFVALLRG AEYCEKHATV TLGPWFLFAL KELVGCISVL  660
VKAD
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5i9f_A6e-192366205429pentatricopeptide repeat protein dPPR-U10
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006449328.10.0pentatricopeptide repeat-containing protein At4g21705, mitochondrial
SwissprotQ84JR30.0PP334_ARATH; Pentatricopeptide repeat-containing protein At4g21705, mitochondrial
TrEMBLA0A498JYW40.0A0A498JYW4_MALDO; Uncharacterized protein
STRINGEOY282620.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM546323
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17750.11e-34heat shock factor 1
Publications ? help Back to Top
  1. Lurin C, et al.
    Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis.
    Plant Cell, 2004. 16(8): p. 2089-103
    [PMID:15269332]