PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pd.00g997750.m01
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family bHLH
Protein Properties Length: 949aa    MW: 105960 Da    PI: 7.2717
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pd.00g997750.m01genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH23.59.7e-08175222455
                       HHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
               HLH   4 ahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                       +h+ +Er RR riN+++  L+ ++P +    sk +  a +L + ++Y++sLq
  Pd.00g997750.m01 175 SHSLAERVRRGRINERLRCLQNIVPGC----SKTMGMAVMLDEIINYVQSLQ 222
                       8*************************9....788*****************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 949 aa     Download sequence    
MAEFTSDMQS IAPSLPFLDI APNTNMEPIH QYTDQFNPTV LDFYSSLNFQ TCMPFSNDNY  60
FSSQGPEFQG NLVQNFPNFF DHDNKSNQND EAPAVQHLVG AGAGNGFQES KKRRAMDDVS  120
ASSSGISTPP VSETGVKIKN SSGRGKRLKK SKEKEDEKPK DVVHVRARRG QATDSHSLAE  180
RVRRGRINER LRCLQNIVPG CSKTMGMAVM LDEIINYVQS LQNQVEFLSM KLTAASSFYD  240
FNSETDDMET KQSAKVYEAV ELERMKREGY GGYEIWRILC SKAEIAEKGA GLNVHFVFKY  300
IVRGKTRDGH GKQNQKYARE SSQTNWVPEL SPIANIVVRR CSKILGVPTT ELCEGFNSEA  360
SESIKLPLCY AKNFLEYCCF RALALSTQVT GHLADRKFRR LTYDMMLAWE APAATSQPIL  420
TLDEDLSVGV EAFSRIAPAV PTIANVIISE NIFEVLTAST GGRLLFSTYD KYLSGLERAI  480
RKMRTQSESS LLSAMRSPRG EKILEVDGTV TTQPVLEHVG ISTWPGRLIL TDHAFYFEAL  540
RVVSYDKAKR YDLSDDLKQV VKPELTGPWG TRLFDKAVFY KSISLSEPAV IEFPELKGHT  600
RRDYWLAIIR EILYVHRFIN NFQIKGVKRD EALSKAVLGI LRLQAIQEIC SANPLRYEAL  660
LMFNLCDQLP GGDLILETLA DMSTVRELAR SSNSKPGGGM YSISALDMIS NLGFAFGTSS  720
NNSVEAGLAV GEITVGEVTL LERAVKESKN NYEKVAQAQA TVDGVKVEGI DTNFAVMKEL  780
LFPFMELGKC LLSLAFWEDP MKSLVFCGVF TYIICRGWLS YAFALMLVFI AVFMALTRYF  840
SQGKSIHEVK VLAPPAMNTM EQLLAVQNAI SQAEGIIQDG NVVLLKIRAL LLSLFPQASE  900
KFAVALVVAA LTLAFMPSRY VFLLMFLEMF TRICFYSGVQ FRYATGKEG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1111139KKRRAMDDVSASSSGISTPPVSETGVKIK
2111152KKRRAMDDVSASSSGISTPPVSETGVKIKNSSGRGKRLKKSK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G18400.15e-55bHLH family protein