PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Do006942.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Panicodae; Paniceae; Dichantheliinae; Dichanthelium
Family bHLH
Protein Properties Length: 3171aa    MW: 337551 Da    PI: 7.4696
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Do006942.1genomeDichanView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH31.24e-10710752854
                 HHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHH CS
         HLH   8 rErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksL 54 
                  Er RR+++ + f++L +llP +     kK +Ka+++ +Av YIk L
  Do006942.1 710 TERERRKKMENMFSTLHTLLPRL----PKKADKATVVGEAVTYIKTL 752
                 6*********************7....399***************98 PP

2HLH36.96.3e-1223352378855
                  HHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
         HLH    8 rErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55  
                   Er RR+++++ f+ L  llP++     +K +Ka+i+ kAv YIk L+
  Do006942.1 2335 TERERRKKMKDMFSALHALLPQL----PEKTDKATIVGKAVTYIKTLE 2378
                  7*********************6....39****************995 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF540011.43E-862128No hitNo description
PROSITE patternPS0097206782IPR018200Ubiquitin specific protease, conserved site
PROSITE profilePS5088813.256702752IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF474591.26E-12705772IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000832.40E-9707753No hitNo description
Gene3DG3DSA:4.10.280.102.1E-12707769IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003532.1E-7710758IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000101.0E-7710752IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PROSITE profilePS5088814.08123272377IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF474596.67E-1423312394IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000831.04E-1223322382No hitNo description
Gene3DG3DSA:4.10.280.102.4E-1423322389IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003534.4E-1023352383IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000101.7E-923352378IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd021160.0035525072555No hitNo description
PfamPF012016.6E-2926972840IPR022309Ribosomal protein S8e/ribosomal biogenesis NSA2
CDDcd113827.12E-4127582842No hitNo description
TIGRFAMsTIGR003075.2E-2527582842IPR001047Ribosomal protein S8e
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006412Biological Processtranslation
GO:0006511Biological Processubiquitin-dependent protein catabolic process
GO:0005840Cellular Componentribosome
GO:0003735Molecular Functionstructural constituent of ribosome
GO:0036459Molecular Functionthiol-dependent ubiquitinyl hydrolase activity
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 3171 aa     Download sequence    Send to blast
MGKFMHDNPL VPHSEVNRHH YSLAAVAVGL GLGVAGLCKV LYSSLSMPWV SPRNLLLGSE  60
RVYYVGGLRN LGNNCFLNVI LQALASCDSF VSSLYGLLAT DGLLPEEKDE RMPLLLALSP  120
LLEDLSIVRD ERIVLNPDGL SLDFENFHCL PLSPVLNTNG DIVLHESSPS QLSPLTSFSP  180
LPCNHCAMSQ EGANLPHEVE RDHTTNANGS THAGAGSAIL RPTSSISGSI SKPANEIDND  240
WSKLASPPVL HAGEDNNIGS KISGPAVLHA GEDDNTGSTL TSPSALHAGE DKDNNAGSKL  300
TNPTVLYVGE DNKTGSKLAI QAMPDAGEDN NAGSKLTCPV ALHAGEDNNI GSKLDSPAML  360
HTSVDNNVGF KLDIPEALYV SENNNARSKL ISPVALHASE DKDNNTRSNL ASPVVLHASK  420
EINTGSKLAS PVALHAAGED KENKAESKLA GLVALHVGED KKEGSKLVIQ AVPHHGEDNN  480
VGSNSKVSSP VVLFAGEDNN AGSKINSPTV LHAGVDKNTT LVLYAIKNNN AKSKLAAIPT  540
VLHAGEDKDN NSGSNLGSQV VLHAGEDNKV GSNLAVQPVP HVGEDPSAES KLAYPGGKDN  600
HTVSKLAIQA TLHACEDNNT SSKPTSLATV DAGENNTKEG KNSEEEGGGN VVAAGSGGGA  660
SSSTSKKSTV DGGSDHGAIK DAPAAIADGG RGKGKGAKAV KEDHVLHIWT ERERRKKMEN  720
MFSTLHTLLP RLPKKADKAT VVGEAVTYIK TLKGTVKKLE KLKQERMRAQ VEQLLVGAGS  780
SSAAAAPSSA RHPAPAPATR EAILADKVHD WNTQEAVMAE LKAAATAVIE AAGTSSAAPL  840
GTMVAAAAPA PVLAPAPGPP IQTWSTPNIV VCVAGNAAFI NLRTPRHPGM LTKLMYVLEK  900
HRINVMATTV SSDQSHSLLS IQARVSQTGL GSCAIFLISY INAATPAPPH FPEDLTIEDS  960
PTHPQTRSHQ PELKPLLSTW LPTTQPLLRS PSDASLCCVD REAPLTLPAT SAPDRMLLES  1020
QLSNETAPQG SGSLPVVEVF APNPTSPAPA ATSLDVVAAS SSGGASPEPP SPETVPTSLP  1080
SFAQALGKPL DPPVLPSPPL RRPRNRVVPS APPRRSHRLA KKAMGRTPAL MAAQNILMRK  1140
LGLSVGTQLQ TADFDRYIQI FKNGLTEEQT KMIREPFANS VTAPADLGVS KSEKFITMGI  1200
CIIKPIDAAV LLSIYRAISL NTKRISMVVP FYTNCNDGEA ETAKKCCCLS VNLLCANCKV  1260
YAFSCVLAAF AFMSSPSQLC SPLTSFSPLP CNHRAMSQEG ANLPHEVERS HDQATVPDGI  1320
TLAGVGLAIV RVDSSSSSSV SNLDSESNNT RSKIANLVVL HAGEDNKVES KLSSPVVLHE  1380
GMESNIGSKL ASPVVLHAIK DKDKKAKSRL ASSVVLHVGE DNKVGSNLTI QAIPHVVEER  1440
NTGSNSKPTC LVVLYAGEDN NVEPKINNPV LLHAGVDNNA AGVLYAGEDN NARSKLTSLG  1500
VVHSSEGKEN NIGSNLVGLV VLHVGKDNKV SEVNKVRSNL AIQSVPHTGE ETNAGSKLAS  1560
PAMLHGNEDN NAASKLIIQL VPHVAKDNNI GSETASSAVV DAGENKTKEG HDVEEEEGNT  1620
ILAAGSNGSA LSNTIKKLKK DDGSAHDAMK DAPSSVADGC RGKGKGKGST AVEVDSALHI  1680
WTKQEWIMKM NNMFNILHAL LPQLPKNVTE VNEATVVWEV LSYIKTLDPE VTVQWLEKLT  1740
QERMRLEAEQ LVVGAGSSSS AAPASSSARH PAPAPAPARA TREVILADMV HDWNAQEAIL  1800
AELKAAASAV VEAASWPNVE VHVAGNNAFI NLRTPRHPGI LTKLQYVLEK HRINVMASTV  1860
SSDQIHNFFS IEASIDAATP APSQIPENLT IKDRFCAFIC YVLRSVFMGH IYFPLLLNLF  1920
PFAGETAKKC CCWKTFLCAK CKVYALPCVL GAVAVYKSSP SLIFPLRSSS PLPCNHCAMS  1980
QEGTNLPHEV EQSHDHATTP DDSILAGVEP TIVRFIGSSS SSVSNLASET NNIGSKPGSP  2040
TVLHASEDNN AESMMTSSVV LHAGEDNNVG AKLTNQAAHH ADEDSNTGSK LACPVVLHAD  2100
EDKNYNIESK PSSSVLLHAG EGNKAESNLA VQVVPYVDKY NNIGSNSKLT CPAVLYAGED  2160
KNSRSKIESP VMLHAVMDNN TTLVVYAGET NNTESKIASP PILHADENKD NNVGYNIASP  2220
EELHASEDNR DGSNLAIQKV PHGGKDNNTR SKPTNLVAVD TGRKNAKEGK NAREEGGGGD  2280
TPSNANMKSI VDGHGAHDTI KNALAAVADG GGGKGRGRGK GKGAAEVDHA LHIWTERERR  2340
KKMKDMFSAL HALLPQLPEK TDKATIVGKA VTYIKTLEGN VQSLEKLKQE RICAQAEQLL  2400
LGADSSSAAA ASSSARHPAP APAPATREAI LANMVHDWNA QEAIMAELKA AASAVVEAAG  2460
SSSAAPRGTN VAVEAPVAAP ILALAPPLQT WSAPNIVVCV AGNDAFINLR TPRRPGMLTK  2520
VLSVLEKHRI NVMATTVSAD QSHSLFSIQT RINGTTPAPP QLPENLTIED RSEPPNYPSP  2580
MDPSPHQYHD SPAYTTHNDF QDALDQFKAQ IGRDFTFKAS WHALKNTRKW LDSQNSSTGT  2640
AAGENSAGTD EPLGSEPPRP IGRDAAKKHR SDGARTESSS AAAGFFERLA VSREGKREDE  2700
AARAAKSEAA MQRQLDLQDR ANNLVEKDIT LRCFAVHVVS CSVLCMYCLM LLRIMEIRYE  2760
LGRQPANTKL SSNKTVRRVR VRGGNVKWRA LRLDTGNYSW GSEAVTRKTR ILDVVYNASN  2820
NELVRTQTLV KSAIVQGQEG EAAAEETKKS NHVQRKLDKL EGKELEFYMK KLQRKKGKDF  2880
RIKGNVASGG TMITPPPSTE AHLPVTDRRS LRLSGSGSGS ICNLASETNN VGSKPTSLVV  2940
LHAGEDKDAR SNFASQATLY DDEDNNLGSK PASLVVLNAS EYKENNVGSK LTNAMVLHAS  3000
EDNKLWRRRR RRRSRRRRRR RRRRRGGIIL TAGSSCGNLS NTIKKGCNIP EASSGGGAPF  3060
NTSKKLTTDG DPSHDTIEDA PAAMVDVGKG KGRGKSKGTV AEEVDHWLHI WTKTERMKKM  3120
KNMFSTMHAL LPELPKKINA ATPAPPFLPK NMTNEDRYKL AVEEMLHVVA N
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4v7e_BI6e-54275828782621640S ribosomal protein S8E
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
130103024RRRSRRRRRRRRRRR
230143024RRRRRRRRRRR
330163024RRRRRRRRR
430173024RRRRRRRR
530183024RRRRRRR
Cis-element ? help Back to Top
SourceLink
PlantRegMapDo006942.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0357651e-137BT035765.1 Zea mays full-length cDNA clone ZM_BFb0082H16 mRNA, complete cds.
GenBankBT0689751e-137BT068975.1 Zea mays full-length cDNA clone ZM_BFb0091C15 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_025825310.10.0uncharacterized protein LOC112900700
TrEMBLA0A3L6PWA20.0A0A3L6PWA2_PANMI; Uncharacterized protein
STRINGSi009419m0.0(Setaria italica)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G49770.13e-22bHLH family protein