PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sphfalx0002s0156.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Bryophyta; Sphagnophytina; Sphagnopsida; Sphagnales; Sphagnaceae; Sphagnum
Family MYB
Protein Properties Length: 1110aa    MW: 118720 Da    PI: 6.3533
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sphfalx0002s0156.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding48.91.5e-1555101148
                           TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
       Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                           +g WT+eEde l +av+ + g++Wk+Ia+ +   Rt+ qc +rwqk+l
  Sphfalx0002s0156.1.p  55 KGGWTPEEDEVLRRAVQWHKGKNWKKIAEFFT-DRTDVQCLHRWQKVL 101
                           688****************************9.************986 PP

2Myb_DNA-binding58.41.6e-18107153148
                           TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
       Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                           +g+WT+ Ed++++++v+++G   W+ Ia++++ gR +kqc++rw+++l
  Sphfalx0002s0156.1.p 107 KGAWTKSEDDRILELVTKHGATKWSMIAQHLP-GRIGKQCRERWHNHL 153
                           79******************************.*************97 PP

3Myb_DNA-binding57.23.8e-18159203147
                           TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
       Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                           r +WT++Ed+ l++a+k++G++ W+ Ia+ ++ gRt++++k++w++ 
  Sphfalx0002s0156.1.p 159 REAWTEQEDLTLIHAHKLYGNK-WAEIAKFLP-GRTDNSIKNHWHST 203
                           789*******************.*********.***********975 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5129416.38350101IPR017930Myb domain
SuperFamilySSF466891.15E-1552108IPR009057Homeodomain-like
SMARTSM007173.7E-1354103IPR001005SANT/Myb domain
PfamPF002491.4E-1355101IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.606.5E-2157113IPR009057Homeodomain-like
CDDcd001672.19E-1258101No hitNo description
PROSITE profilePS5129431.742102157IPR017930Myb domain
SuperFamilySSF466894.87E-31104200IPR009057Homeodomain-like
SMARTSM007173.8E-16106155IPR001005SANT/Myb domain
PfamPF002491.4E-16107153IPR001005SANT/Myb domain
CDDcd001672.14E-14109153No hitNo description
Gene3DG3DSA:1.10.10.603.1E-26114160IPR009057Homeodomain-like
SMARTSM007179.0E-18158206IPR001005SANT/Myb domain
PROSITE profilePS5129424.751158208IPR017930Myb domain
PfamPF002491.4E-16159202IPR001005SANT/Myb domain
CDDcd001672.11E-13161201No hitNo description
Gene3DG3DSA:1.10.10.601.3E-23161208IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1110 aa     Download sequence    Send to blast
MASFNVPPGN NGITENVQQQ RSSGCSSGAE SDDDTSHQLP PVHGRTGGPT RRSSKGGWTP  60
EEDEVLRRAV QWHKGKNWKK IAEFFTDRTD VQCLHRWQKV LNPDLVKGAW TKSEDDRILE  120
LVTKHGATKW SMIAQHLPGR IGKQCRERWH NHLNPNIKRE AWTEQEDLTL IHAHKLYGNK  180
WAEIAKFLPG RTDNSIKNHW HSTMKKKVDP ATATDPVSKA LAAYQAHQES MNPINSVDQL  240
ESSVSMVGKV GLLTNEATTS SQLIRDIHAA SGTPSMTQPL QFGEQYNEAK SANAAPSSTA  300
YCILSANHEN DPKKIGKGAH MQKEHSSCEA LAVHSGSGTL QQSCDVSPQS SGQSQGYSGW  360
SSNQSLGYSV GFAGVPMSRT LVPSLLSQQL PLGSLPESGL LSTVELPILP SFSVVRPALE  420
MASSSSQEFQ NSSVPISMCA TGSFISAGNV PVTSSNLMPS NDTLSNRSMV TNEGNLTLMD  480
PNMQTFEWMR SCLFSKSPFA APVALPSGHC DTEAEECSME VVGYQDEHLK AANQAVNLKM  540
SGPAPELDGS MGIALDALFY EPPRIACLAD SPFPSYDLIQ GGQSLQAYSP LGVRQMIMPP  600
GNCITPPPYE LPQSPLQDCS PQSILRSAAR SFSGSPSILR KRPRPLTIPP KPRSVCDKRD  660
EQLGQTRMSR GSQDNAVGLS GSMKTADDSM DGKPSEGSAK AAACQARTAL FVSPPYHLGK  720
TSSKVSGAED TGSDVPNDTQ YTSPDCAIDS SGKVGGNSTT LSARDGRHGA ACSNGPWESA  780
GGKYNKRGRP QGLNHECDMG TRRRGGTTVP VKTFQDGSLL FEHQTAGQGQ PCFPLDANTL  840
PTHLVMLPGA ENDGNPKVLG SKHGFSDMRA SSCEAGVVGT GGLMSGTVLS RSDRREMMSP  900
GAMSTFGVWG SVTCASTGVA VLHHTPAKDW LTELESFFAS PTTIWREPWR MDPSPSAQKD  960
GGISFLEELE AAVDRGVDDA LGIMQHLSEQ NASAYCEAEE ILAKPPSAEC MPSPPPCYQT  1020
RADANQNAHD CKENIRGGSD SVDGGTSSFV NWLSPFHVDF LSPDGAVAGT PVRGVSSVSL  1080
TGGADDNHLV SGGACDAFSP SMYLLKECR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C4e-70552086159MYB PROTO-ONCOGENE PROTEIN
1h89_C4e-70552086159MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1640644KRPRP
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G11510.28e-85myb domain protein 3r-4