PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG68262.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family MYB
Protein Properties Length: 2591aa    MW: 278724 Da    PI: 7.116
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG68262.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.11.7e-07562606246
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      g+W++ Ed++l+   ++    +W++Ia+++g gRt+ qc  r+q 
       GBG68262.1 562 GPWSKVEDKRLLSIAREEKYVNWSRIAERLGSGRTAGQCLTRFQR 606
                      89****************999*********************995 PP

2Myb_DNA-binding46.39.9e-15616658346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +W ++Edell +av+++G ++W  Ia++++ gRt+ qc +rw k
       GBG68262.1 616 KWIPQEDELLRKAVRKYGEHRWQDIAQCVP-GRTGHQCLHRWTK 658
                      899***************************.***********76 PP

3Myb_DNA-binding50.83.9e-16666712148
                      TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
  Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                      +g+W +eEd+ l++a+ ++G ++W++Ia++++ gRt+ qc++rw ++l
       GBG68262.1 666 KGKWLPEEDKALIHAIGMFGVKQWSKIAAHVP-GRTDVQCRERWCNVL 712
                      79******************************.***********9975 PP

4Myb_DNA-binding51.42.5e-16720762346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT+eEd++l +av+++  g W+++a++m+ gRt+++c  rw  
       GBG68262.1 720 SWTPEEDKKLMEAVAKHEVGKWSAVASEMP-GRTDNHCWRRWKV 762
                      7*****************************.***********86 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2591 aa     Download sequence    
MNPVVPDDVI VLEDDPEEGE EDDREECGGE ARQCRGEDER NAVVGRYAVD SDEEGLDGEC  60
RDPNVEDRGD GHGDGDENDD DAILRSVQAD IGIGWGPRRP ATLSHQRASD SDEDKDDDEV  120
LESIKRRYRR MDDERHGDKG KGRSNERRSA DKSDLAVTTF DDRQARVLGH DSSNNAGSVV  180
ERRYHSEFAK RAGTGSAAAK VGRNGGASYA IDSAGGGPSM GVIRPDADPD PDGVERGKRH  240
PVIPNAHNAI VHVSDPLPVT AAAGAAAECD PILLLPAPDG DQSSDADRDP DGDVDGDEDG  300
DKQEVIKEAV IAEGGIGAIE DALEKNRRCQ AAHREVLERI AIRQRENERI LRRVRVLLGY  360
ENSVKKRRQP TRAIPGAAGK EDDEEGVEAL FGEADGIGAG MPLADLLFEW KKKSLKKKGK  420
SAAASAPANE EGGEQEWGLP ENADMKKLRP VMERLPLILD IRKWTDQERS KLKNAMKMKA  480
QEKILSDLYQ ELGDRDITPE KASDLDQRLA DLGTREVTAD ELAAIVPSLD WEDFASCYLP  540
SRSGIDCKIQ WLNQDDPKIK QGPWSKVEDK RLLSIAREEK YVNWSRIAER LGSGRTAGQC  600
LTRFQRSLNA QILNSKWIPQ EDELLRKAVR KYGEHRWQDI AQCVPGRTGH QCLHRWTKGL  660
KPDIRKGKWL PEEDKALIHA IGMFGVKQWS KIAAHVPGRT DVQCRERWCN VLDPGLKQDS  720
WTPEEDKKLM EAVAKHEVGK WSAVASEMPG RTDNHCWRRW KVLHAEQVGP YIRSVKIKRV  780
AMVSNFVGRE SERPELGPQD FRITNSLEEL EPTATVNRRP KMKPRRRCRG QSMAGEAVDD  840
ENVDCALSAG QGSEAGQSSH EEAVAGSEAP VHGKGRHAGK EKRRRAMARK VRSRDGGSLG  900
GEDGMLDGAP SATQSVVGAE GSPVDDNATC PAVNTGNNRV RGAQKKSRRR NVVAGLPDSL  960
KANGTRKRRT NAGKGGQQYD ESQQQESTPA GDSAAAMGQE GCPLGTEKRP AALTSQEAFA  1020
LQKILEWHGH LAGIPQGGGA SVVCDVGDTS VEAGAPVTPQ EMLCARGVAP ANGEALTRRG  1080
ERVHVDGGKR RPRKRSAPVS EKNNNTSEQE PNGRGRGGGG GRSRRKQSAA LLEQGIVECE  1140
EETVRKKRPR RGGKHADERR PPSTIPTGDE NSQVREVSVL WNADNGFEAG ADGEEESLPL  1200
ALHSPLGAAR EDLAIGDVLC RNGVDDTEEM RIPLSMSMGS KTHNKRKRLS KNPLVDLEAE  1260
ADGGHGGKRK KRLSSGLEHR NCHSDEPASP SENGQTKKGR RRRQSKEDQT HFKPGGPPHE  1320
VLTDGAPPIL DNPSQEAANG RKNERRNVGG RRRKDLSSSG LENGKSPRAA KSPSSQSSQR  1380
RKGRRQSKAE EAHSKRDGPP NKEVMDGNPL ILDDASQPVL MMANEHGTAS SALAQRETHC  1440
EQGEAPLQIV EEAIVQQDEN GNVGLKHHES GNGHGCATSR DMECSEHGCL TLAEVNTTPV  1500
VPPTGVEVGG VDSSMAGIAA GARALGTICD RNKESVILGS QSAGVAKAFQ YNDQLARLYE  1560
PERVTAARQK RRERKMLMMQ GRSHGVRKGK KALPKGEGSA DRQVGRGGYL EGVENHVVAK  1620
EVAGGKNRVG VEGEGDDAGE VRDGGNPTDP TDSTEIEQRN NLSCSRGTSA LPAYPAWAAA  1680
LRRRATGSEV SPRSLRPSGY LPPSQCLFSP RPVASSGRPP LLNCFAGFSG PYSPAVRPSP  1740
VRPPMRQPTT QVSARPHPNF TTLLLGQTNS KPSTPSRSSS SQDASSPFIP HTSAVSVPVQ  1800
FAGSSPGMAE RASNSLAATS TTDQLPRGHK KALPVAHLLD FTSAVLASNA ATRLTGPETP  1860
AMHNCNEAES GMKPSSSSLP ENPLDDRAQS SESPVDVESS GRETCQPNDG ICLEGDAAEA  1920
EGICDLEEAE PRMNPSSLHE TQLLNTSQLS GAPLNLSCKR ADQSHGEGLE TVAAETPAMC  1980
HHKESEHGMN PSSLPERPLP NNTRTLSPAP ADSSGRKTDQ PNGIRLPIQA AASGHGLQET  2040
GVCKCADRSS AEHVEEPSER SSEKQCRLPA GSSQRVLQNG ESAGKQRWRA EEHFAGAENA  2100
TATVQLDYAG RSPSKLDAQR FVVATAHSNR DENICGSLSH TMDSPAKSSN GLCALPYRSS  2160
EHNRCIDYGT AVCDSNAEHA HRLEGSDAAA AEVIRLPAVG VSETVGAEPG HYEVNRVAMG  2220
QTVPANPSKA GAKAADACSP RKMRRLCNSS SSADGDDSEV SAARRSMSKR RRWTISDGHW  2280
CGDVSSFLEG RSLNSIDGAE SKLVKRRPQG ATEEEGVRKS HYPSCASADE GRFELSVGCA  2340
TKGGKVDGLI TPLKDIHTAG ETGRRRVTRG RCATCNQLIS TGEDDGPDTA CSKDPITSPG  2400
KAPGGGPLPG GGPLPGGGPL PGGGPLPGGS QQQWNAKTHD CLLWRGCLPQ LKGCQQTDWR  2460
PLKSCCAAIG CRRRRRGESM KCCDGSERGA GASAKTFLLR AQATAMVASG APGFAPCCCM  2520
IEMGSLVRKR REFGNALVTN KASNLKSGVS AAEEAVSGRE RSGCLSVSLR KQEPEVLKLV  2580
EWLWTMWSDG E
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1415421LKKKGKS
2948969RRRNVVAGLPDSLKANGTRKRR
3965970TRKRRT
413791385QRRKGRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.25e-85MYB family protein