PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.0015s0106.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family MYB
Protein Properties Length: 2351aa    MW: 253027 Da    PI: 5.0322
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.0015s0106.1.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding22.13.5e-0710501094147
                           TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
      Myb_DNA-binding    1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                           + +W+++E  ++ d +  ++++ +++Ia++++ +Rt  +c+ +++k+
  Bobra.0015s0106.1.p 1050 KRPWSADEKRIFMDKFLAFPKD-FRKIASYLP-HRTTGDCVAFYYKV 1094
                           569*****************99.*********.***********996 PP

2Myb_DNA-binding27.66.8e-0912061247244
                           SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHH CS
      Myb_DNA-binding    2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrw 44  
                           ++WT+ E+ +lv++vk++G +  +++ + mg +R+  ++k+++
  Bobra.0015s0106.1.p 1206 LPWTPAEEARLVEGVKKYGRN-LDLVREFMGSNRSKASIKDFY 1247
                           69*****************66.*******************99 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2351 aa     Download sequence    
MSDRHNRQNR EWRSPYPRNE RERPFSKRDV NASSHADRDL FPAQHGQMPF ASQGSRFDLA  60
AQNRHGRDRP LPGPPHPLPT TPTASSMWRA HEAPRRSSPH RGSPQRIAHG GSTSGPLDRR  120
YSGLAHDRIT GRSQDSYVPR TASWEGPPRN HLDPSQSHMD RDVRRGHLPP PPPPPLVPPH  180
QLMRTIASST PLPPPPPLPP NSRGLARWPP GGGLMPADFE SGEFRNESSV NSFSGGLPHP  240
PSLGGPPHLP HTSSLGFTDD RTARRTLPGP PPVGPHTSLP PHLAQMDAAP MTDKASSSWT  300
PKLELRGPER VWGIGNRAVE EVTSLVDPAD VKLEPRRPVD AELLQQRLQA VQAVRSDPPL  360
RSDSSFEIKS EALGRGEGGA ITPPLSGTDG TPGDLPMPTV GSGGLEGHSD EAPRRKKQRL  420
KWGEGLNARN KAAKAPEAEV RSQGEAVVPQ DDMCSPSVPK IEAGDAHSQE APAVADTTLP  480
AKSVPAPERD SIAAAKDLLK GPSSPAEVSA LIPLASRMGS ADAAARTPPP GPVAEEMPST  540
GEAGPVPAAD GDGLAIGGPP GLGSPVEVLT DKLLGLDKEI GDREKELEEL EAAKLTAVRA  600
AEAARAERAE VEKDKIPDAE FPQLSDDELD EPALPDALPE SSEEEPLEPE VRFDDSEEDM  660
EVEEEEVTSE DEVQEEEEDE DEDEDEEEDE EVEDPAHSHL ERKRSQGVPP GGESNTSVLP  720
KLPQMAGGAQ QANREEALEA ELESVPEAIY NAVSTTNDKV EVDLDGTPAA EALPEPEAPE  780
TEEVVRDWVS EKKGRWALHL ARASAVEGGK DMTSEDIDSE GDADILQWSP HRKQSLLTLD  840
ERFELCSERI ALAAGEVVKA MNGQLLSCLP DYLQPEPGTS WWPKALCLNH TQMPEYRNTM  900
LTGWLNRASF KRIIMERKWL TQLRWRELAE EYRAKFEDWQ VYFAEMGLDQ LQAPVQDPIL  960
ALTRRGAGSN STRNAQRTST RSNLAMGGPP RQDLEQLAAK ERQCLDQLKN MCQMPDQLVE  1020
EDDRRFQGFE NNNLRVEDPV QQLKDEEDAK RPWSADEKRI FMDKFLAFPK DFRKIASYLP  1080
HRTTGDCVAF YYKVQKLDDF AAVRRKQQLK KRRQQSEINR STTYMGMGPA ARMPDATTTR  1140
ALPSRPTPQP YADGRPIAKT RGTRSKPAGR SQRYAPVLEP SPGIEAVEAH VAEPPPSEVA  1200
VEESNLPWTP AEEARLVEGV KKYGRNLDLV REFMGSNRSK ASIKDFYTRS RKRLKLDQLV  1260
RARGHPGGPV GELEELAPTP IGGAPEPARP ASPEDAGLNL HTTERLPEPS PEHEEGELGE  1320
DAAPRGKFRT LEDNAAVSQA VDLSARQGSG AVRQLLSLPA LTSGPPMFPA MAQHMLSLLF  1380
AHQAQQTAQA QAHAQQQANA QNAMFAQLLM NSQLPPHPFL NFAQSTTGID MLKLQTGFPH  1440
HLGIQHLLTP PLTPFHPQTF SGVHQGLSHM FPGQQFVPPG ATHPFFNPQQ AAALFRQAAA  1500
FRGCWACACP GSEADNLEDL QSARGSDQWE GVEGLPKGDP VDGGSESLEN APTRKQQAVW  1560
THQEKEAYQA WKKSGKDFSA LERALPNKTT QQIKNFQQNS RKQALNAAGE TSRKRQRPDP  1620
ASRGASSLQT PNSTPSATPA PTPPPEPPLA ADLPPRASSG GPSPPQSSVG TPGYNQAQML  1680
LLEGIRKRSQ GDPSELGTER PAGDPIGGLG DRLGPSEQDR DMSAWLTAMK EGGIGRPAAA  1740
VSPLVGLAGA LTASASASRR ESSFLETTNP EPSLASRLDQ LSRAMAPLDV GARPLGTFDT  1800
EASAFGEDGS GSAKDRASML DDGSPLSKKA KRVHVGPAIP LSPNPQGLAS ISSGGGKRDT  1860
AGPSTPSGPT ITTGGGLAHA SSAPSPAQCM DTLKQLRTLA ELTRVDDMPE APPIFPDPFT  1920
TPDLPPKGLK ESHVQLLRNL AMARGASQSL AAEPDATPAR ALPHSWLSSI SVSRATGGLS  1980
VPLAYGNAVP LGPQTLSALL PSSGLDPLPA PRIAPLMDSS PGGAVPLSAL QQEPPATLPS  2040
AGDTSGLWGG GQPLRDKFPA VRQPAEEPTL VEKLELLTKR PVKLPELRCT ESVPPRTSPE  2100
ADVVQRNVPQ SLPSGAQHRS AQRQPSPSPH PLPMGTPRQA ATEAGEEETP EHPERKAHAA  2160
FFPRRPSPEH PENPGTTFAH RPQSPGYPGD SPSGQGTVAG EELADTYLGG PSKKESLGSV  2220
LQRTPGSKYG TPGEASRQEA VDMDIDTEAP SEVLPDALDQ TSLQSQEIPD QGDLAGLGSP  2280
SMGPHHDENL GLPGKAPPGQ ARSTLTPLPG REEDQVEDLP HTSPPAEANM DSEDTPMEQQ  2340
KIVEEGFPHS *
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.13e-15MYB family protein