PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG89833.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family MYB_related
Protein Properties Length: 3994aa    MW: 421182 Da    PI: 4.9242
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG89833.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding262.1e-0828702911346
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                        W++eE ++  ++++++G++ W++  ++++ +++l q+k + q+
       GBG89833.1 2870 TWSQEEKDKVFEVIERHGKD-WNLLQECLP-RKSLVQIKTFVQN 2911
                       7*****************99.*********.********99887 PP

Sequence ? help Back to Top
Protein Sequence    Length: 3994 aa     Download sequence    
MKSVRRGVMM AVMRVRKSVR RVMMTSRRGL MSEEWGCDDD EDDDEDDDED DDKDDDEDDD  60
EDDDEDDDKD DDEDDDEEIR VCKLEMHQSN QADSRATALP NARDSSNDDA GHLAGTAISS  120
EHSWPAAERA HTVDVRQPYS RDARLDRDRD RAVGKREWAE RSGFPRMTPV TNGGGSLGMS  180
KRRGAAGSSL SVSRECSGRQ MFYGDSCRYN IGSGGGQRNS DDGTGFASSG PGSGLENGVG  240
ARKGERGDNF FGVSPSFNLV ATGGEHRLKD AEERRSALER EPGLTAYSGS GASVDARGEG  300
FLSAEAAVIR PYHSLRRLEE ERFGRLKVGN GGGAGSSLET SGGSGGVGGA AAAVGGATVN  360
LGVAGAHGVG WDLPPRAERE HRTDRIRGLK GECGYRKVGS FGSSSFISLV QESGREWNNS  420
TDRDREGMRV ERYGRDSSRG CIKANGDGSG GCGREQTPTL STMSPQQQGS SPLEWRSGGY  480
SGLRDRLGGT GTAGSWEGSS LVPNQLASMD NSARSDRRVM HHGLEDGGSG ASHVLSDPLP  540
SLSTLSSLGT PLRDDTHVSP PKRQRLGWGQ GLAKYEKEKV EEVGDGNSTA AVQSASGQKV  600
ADSVVPAHTA SSESPVEAST ASTTNPVPVP VPASDGAACS GKDDDALAPT SGGAKEGKGD  660
ASRGFTSLRP QRASSSVSLL GTSADAPSTC MPLPFAGLNL PPRTAGSLTP SPAVLTCASP  720
LPPAPSSSLF SGSHAVRSGS TQAGLLALGS AASLPTVCSS GAMGAEVATN GHRAQMGADI  780
GGPYGQIGRF PPPGSRSTST LPLNFTSGLI FPSNALRIPG YEPPITRPLK SPPPPLSPLP  840
VAPLLKTSVA PVTAMMAPKI GGASTFSASS LTPSMHVPQF SLVGNFTGLT STPSSAGVRA  900
ATTWINSVTV SATGPSAPPR CSGKAGAPQD GLRGDGHTSV GSGEQSVASS QCLTKSEKSP  960
CVGERASSLG NSHDMEKLDA EVAPPENAAE GISKLGDVGG GNVSCLSKEG KSGVVEKVES  1020
ETERREKDLA ALNVENGKND GVEPGNGSLQ PECGGNAGST SLVPNAEGEQ KNGAAGSVPA  1080
AGRDGLSQPD RSANAGSDLP VAGTEGEQKS MSASVPNQDD PMEAIASDGC ERSGGAVAET  1140
DGWTPSQPLG GDDNAEQADD AAAKRQMASM QQTDGDEVAA AMETVVCGTE EQRVREAETD  1200
APEGAAAALP EPVGMGLCLS SPDAEMGCET GAEVHDKDTG EKPCPVLEAG VAGAQPVEDK  1260
CETTGAVSVE ADMDTENVKR AAESVEQIAA AALQSISEVM CEHSSPPEEG LHVDTVAVDA  1320
VATGTVSSEA QGQEKTVDVM DIVDSTIRSS LDKLCENRRR AQLAHDALSH LLPTGMSLDP  1380
SDANKLYSTP EEAPRWKENT ASHKQVAKAM EAVLLERMKD LQFKEKVLAL KFRALRESWK  1440
HEQAMSSRKG RPVRDKHSRK SDGDKRLGPT LQTQRRARPH GGGSSSNVVD AEGFQVVKKL  1500
LAEQGVEKMR AESKMPPMIL DEKERNTNRF INTNALIRDP IALEEEKKWT KPWTAEEKKI  1560
FMEKFALFNK NFKKIASFLD DRSTGDCVMF YYLHQKSEEF TKIRCPQQLK KRSFVKPNPI  1620
FMDLSQSGVS SAGRSREATV TAYQKPAGGE GTVSARAERT AAVEPHPVIR HARTPDRGRA  1680
GPPFALEGVG SDGSKGTGQA VLAGPVDFAP GGRDSGQHHY LPPGTSGKDG SSCTVKAQRR  1740
ACKDGGGRLE VIAEELPFAS SFHPSSAELA TIGGGDGSAA AVHGAESAAT ADKINYHTSA  1800
KSGSAHPAAI PNADVVTRAW SRRQDVSQNQ SWGYVDAQNC DVSEGAHLEG HPFSQQVAAM  1860
SKRDPDVRLQ QPPAWKPEGG EWRRRGHECS PALIGGEEKD ISASVSDEDL EHKSDKTKEI  1920
SADKDMKGSS AGALAERVDG PDNCAVSVAK DGEGGGSAEK LQQNCEKSAD GRNLPAEAMD  1980
ITPAAGGRCA RTDAGIPMAE EAEDITSDVP RVNRIQDAGA ADEVSCRVEN ATLEKAHSRL  2040
AIVELPIDKV DAILSTADMQ QEGAAFAEAA RAEATALPKG LQIPDRPEEG EPQANDSSLC  2100
RDEPMEGGAD DVVYKADDVE TAATPEAGCY DSYEKSTGRS PADNNAYEVK TEGEQDAKDQ  2160
YGRGGSEGGL GAESDDRKMS RHAVGGQGEL PSGRSNVSCA DARLDGNVCQ GVDVHTDGGN  2220
HHTCKTDRGS VMSEEIEGVT GNGPLLDQQQ GTAAILNEVT HRVENTLLVM ADGKVGSLMR  2280
PLDEVGGILD ATRIQEASMA FTESERVEGN TLPECSRILN PPGEGEPEVS GSSMFRDEPM  2340
EERSADVRLK ADDIVAAPIP EMEYYDRYRV SAECSPVDDT AYEVKTERGE EEGKGQCARG  2400
GSGGGGAECE DMDIGRHGSE GQGELSGARS SASEGDESVE GNVCEAGDMA GHGSSVSCKG  2460
SVDAQVNECE AKPVHPVADN RGVERVEDGN DADLDEEEEE RQFATEIEAA VEHADDDEDE  2520
QELETEEQND DGTADADDDD DGHDGNADDE EQEGIKGDGG SGDGGSGGGG DSQRGEDVLL  2580
GRIQEMRMAV LPVKGESDPN HGVDIILDDR TASSESVENI KVEVEVEERE MMRTEGAGFH  2640
EERPWDDDGQ EVMGCRKWCE PRLRGPAWDC RNEQMHPLFV TDLRGGMEGN ERESAPCKAQ  2700
STLARPVPTL KAAGEAKAKV PKMELCSVLD IGYDRSESGK ASLGGFAQCN TSTTQYYSQE  2760
KKPSLNDHIM SKRTPPSTNL RLDGLARAGG GWENLAKPLA QKPVPASVAL PVKREKDHTG  2820
MTVGVPLNQG PGSSGSVAAG VHSVAEKDRS SSKGSGGGVG EGKVRREPTT WSQEEKDKVF  2880
EVIERHGKDW NLLQECLPRK SLVQIKTFVQ NSKAKFGAMA EGGAGALHRV GERGRKRKSE  2940
DDLVGSSSSY SVPAMASSLH RQQMSQMQVR DQHQDGQPSQ PIHPIPVAAV ISVHGSPGSP  3000
SPARVVADEA NLTGVSGTPS KGGRVQPGVV GGDATAFSPP TGIPRHMGSE MLHEHATLAA  3060
FQQMMMQMFP RNSYGPPPGA AAGMVPVFHP AHPMFTPAFT AMSRPVPVPR HPAAMPPFGL  3120
LPAGMAPVGL TPVVSPAAAQ IPGMKKADTT TSIGVPPDID VCMAPMVRGI IGGREEELLS  3180
PFAQSVKPVA TVAQAHGQGQ GQCQTHAQAQ GPSQPQAQAQ PQGQSPTHTQ PQSVWNAQVD  3240
CVPQPHSHRS QTSGDVKLFG QSLLSQPQPS NAQNEVTASG MYLRETSAPV AVVVKRAATV  3300
NTSQDGGGES DAVGQSGCCG VDGTNTCLGS SSVAGGGAFG ILSPGAGGGF VAPAAFGGPQ  3360
LMLPYASGNL HRLSALADGR EIPTVAAIPS AVRIGGPAVW TVPPRPAMAG DLWASFMAHH  3420
HHHHHPNVGQ PVAMVGREVA VDEALSRERS VHESVSRVTE RLSRKEPEGG AGSATRRSAD  3480
SGSSHSRKSG ADGGRRICPV EVVPQERMQT GPTSECERTE GASLAAADHR TEPSVAIPRS  3540
RGRGASATVE EPMDAVHDSA RDGSEQQQQQ HDSPRGISPQ HGIRTSVQSV GGPSTPGHSP  3600
TAAASVASHP RTMWDFMAAA YAQEWPGGLG GQHPSMGLGR FDCSPAGMVW DNFRRVGLVP  3660
ARGRAPVVNP LARGVLGSVG AGLNLAMPEG LQGAVREGSI YYPAAPVMPA MNMPIPPQAS  3720
AWTPATISAQ RNDAAARVMI MAMAVNKQQQ QQQQQQQHQQ QQQQQQQQQQ QQHQHQQQQQ  3780
QQQQQQQQGE GAGAGVGGDR SEDHHLPTEG RDLDMRGARD AGVTSGMDID GEDNRDMLTT  3840
TERTTVAEAV NVVLSRCQME SSEGKLKATV RAHRKLTALP TTGDGFDIGA ISDAILQVCY  3900
AMGCSAFPRA TPRWWIKRRT GGTWEDLRQC DDATIDYFKE KLRMLPRVFR EIARTLSPYL  3960
QRRVMLYRVS LQPDQIIAYT LYRWALGETH DSGT
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.16e-30MYB family protein