PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID fgenesh1_pg.1_#_520
Common NameCOCSUDRAFT_39327
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Trebouxiophyceae incertae sedis; Coccomyxaceae; Coccomyxa; Coccomyxa subellipsoidea
Family MYB
Protein Properties Length: 2275aa    MW: 235391 Da    PI: 6.6647
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
fgenesh1_pg.1_#_520genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.76.3e-0910801123347
                           SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
      Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                           +W++ E++l+v++v+++G + + +I r+mg+ Rt   +ks++ k 
  fgenesh1_pg.1_#_520 1080 SWSQAEEKLFVEGVQLYGLD-FPAIRRHMGQSRTIGAVKSFFSKN 1123
                           7*****************99.********************9876 PP

2Myb_DNA-binding36.79.5e-1215771619448
                           S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
      Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48  
                           WT+ E   +++a k++G + W++ ++ ++  +tl q+k+++q+y+
  fgenesh1_pg.1_#_520 1577 WTEKEKVAFIEAYKMHGRN-WARLSEAVP-SKTLTQIKNYYQNYK 1619
                           *****************66.*********.**************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.64E-11880940IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.609.7E-4886938IPR009057Homeodomain-like
PROSITE profilePS5129313.065891942IPR017884SANT domain
SMARTSM007179.8E-4892940IPR001005SANT/Myb domain
CDDcd001679.11E-4895937No hitNo description
PROSITE profilePS512939.99410761128IPR017884SANT domain
SMARTSM007171.6E-710771126IPR001005SANT/Myb domain
SuperFamilySSF466891.2E-710791130IPR009057Homeodomain-like
PfamPF002492.7E-610801122IPR001005SANT/Myb domain
SuperFamilySSF466892.39E-1215711623IPR009057Homeodomain-like
PROSITE profilePS5129312.6815721623IPR017884SANT domain
SMARTSM007171.5E-1015731621IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.605.7E-815751618IPR009057Homeodomain-like
PfamPF002491.4E-915771619IPR001005SANT/Myb domain
CDDcd001672.22E-815771619No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2275 aa     Download sequence    Send to blast
MPQERGEAPP GGPYRPERFG KKREWPGPPN GFERDHGPPP FSGVGGSGYP PGMEPRGRGS  60
PTQGDVRGSF DRGALPPPPP PARNDRASGW DSWGPSSREG GGWRGGASRL PPPPMRRQHS  120
SGREAPDFEA GELAGPGEPP LSRSGWDQKP GLASGGAREG RYGGGISRTA SLPLGGRMSA  180
PEAMPLAPDL PGPPPLLRSV SHPAARPVSP LHSSANRPLP NGSSSFEAHW PMAEEQHTRS  240
SNFPVAYTPL PHHLVSQPSH SLPPGIMSRR TDLPLPPAPP LLASPVLPDL AMPAADGGAP  300
SPALLRPAEA ASVGGVTPAR QLPEADYEAG ELSTPAGSVA GDANATPVSD DPPKRKRLGW  360
GQGLARLRSV DKRGLPADDH SEHSPRDSEA SRHTDAGAHD SSALNSPSFS ARTDDAAPAA  420
ALSPVVPGPD AAAAAATAVA AFTEEPAGSA LPPPPALLMV EPAVVIEAAA EVATPAPPPS  480
PLQPAALTPI RGPEEAPAAA AAPLASTPQS PKDRPTIFTP SPEARAPVVS PSLPPQLPPS  540
VSAEPSPLKQ EAIAAHEGES PIPEPVKAAG QQGAEQKPSK EVIMRKIDTA DSEIEALERQ  600
MAELSTALDA DVAAARDLAE EVTHLETARP METQPSEEES SIAQSTQGQL EAAMDVDVAE  660
KPTAVLHPLT KKAKALKRAR AGGVARITPP KLPSLATGEM SEGDRHLLRL PRRWQVEAVV  720
KTNHERAAEA RSVFLPLLHP DMLEDSLRGV MYGLVEEAPA YHHNLRTHQR LFDALLEHVR  780
RRRSALRRKE AALGRQYLAL YEQWRCHFTG KSGVRKDRPG MSLARAASGR KTARSDYEQA  840
QIMLQLQAVE RMKEMVHVPP QVLCPYERAA RRFVCRNARV EDPVELLKQE RQLRPWTPEE  900
KRIFNEKFLV HPKDFRRIAM HLDIRTTGDC VMHYYRIQKL DEFAAVRRKQ QLKKRRQQSE  960
VNRSITYLGI GGAAAAAKRG DPLAPHHAVV HADQLTSRGS KGPRGGGRGG RGGGRGSRLG  1020
GNTAAAAAAA AAAAAAAANI AAEAMAEVGA TRGDEDWTPA DEGGALDVGK KLGRGGESSS  1080
WSQAEEKLFV EGVQLYGLDF PAIRRHMGQS RTIGAVKSFF SKNRRRLDLD RLADDAAAKM  1140
EADEMANADL QHLAAPLARR PPSRSATPAA DARMPPPADI ALRSAEAAAA AAAAAAAAVV  1200
HAQDGPGFAA AGDAEMADAA DMLSSLQALS AAGGHPLSAP PLGPPLPLGV GPPGFGGLQP  1260
PFAPQVLQHL AAQGMGDVGG MHGPFPPNIP PQLAGLMQAQ AHAQAQQQQQ PNPVQALHFM  1320
QQMGLPPHVL HMLVHQMAAA GGPPGMGAHP PPGGLPPGLN PTLLMPGVMP GLHPLNLNLQ  1380
NYALFQALQG AGGMAPMSAA MHSVVPERQL PGQLPGQLPL GSGGFQAAAD ERAFHFQQQQ  1440
LQQQMGVQQQ MGVQQQLDNL QAMAEFKEAQ MKSTRPLSPP DAAGMEGLGL DPSQEARWRY  1500
DNGLHARMDV DPHQGPPQGG PGPPGPPPPF LVGGPPPRGE ALIMKKSASA DSLPVFCAAP  1560
VDEVAESVVA KRQMSLWTEK EKVAFIEAYK MHGRNWARLS EAVPSKTLTQ IKNYYQNYKV  1620
KLGLDRMELP ISAVQPISRK RSRTDAADSP AVSGTPSVPP PSTAAAAGSG AAPADLATAA  1680
LERLQVQQPE RPFSTPPGAS LEAVAAAADA ARDKSQSAPP PGLYSGQLPP AAPAFSDLER  1740
GPSPGARAAA AAEVAEAVDG QQLLPAAAAR QAAAKSMPHQ GLLPLIAREN VRPGHGGSLG  1800
GARMQQQLFS TGGGSSLAAR LGVAGSMSSS LPTPSPAGSP APSQRPEGVR ARASLPREAL  1860
AASDDLSDRM ARFLQHRRAG PPSSLPEAPA ITTFSPRVSA ASPPLLSGDL AIGIGVSAGH  1920
VSTPALLEVL RSIGQAGPAS PQQLIPPALP LANASPPPPQ PWPPSTEAVK AMLAAATAAA  1980
SATADAHRPS VADSAPGPGP GAAGQPSPAD APKQPLAADA RNEEPSPLGI GSLDQAGGRL  2040
QPDELFGQAA AAAAAMESDV PTMSAAEGPP QGSPQGAGQP VVQQADAAVS PAVVSDSAAV  2100
VPRQPAAQTP LQPQPAAAEL TSSSTAGPEV SMEAVAEHEA ASAAQVENAH ELYVAEQPTA  2160
DVQSPSRVQN GSVLAAPAEE AAAPDPAEGV APDLAASGRG IALADAPDAH AAPAPGQAAE  2220
TAQVAAPEKA SRAVVDAPAS SPELAGHAAE PLQPAVMAAM GAGDAAAVTL SDNQ*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110051013GGRGGRGGG
210071015RGGRGGGRG
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_005652288.10.0hypothetical protein COCSUDRAFT_39327
TrEMBLI0ZAS50.0I0ZAS5_COCSC; Uncharacterized protein
STRINGXP_005652288.10.0(Coccomyxa subellipsoidea)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP28831010
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.18e-18MYB family protein
Publications ? help Back to Top
  1. Blanc G, et al.
    The genome of the polar eukaryotic microalga Coccomyxa subellipsoidea reveals traits of cold adaptation.
    Genome Biol., 2012. 13(5): p. R39
    [PMID:22630137]