PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sphfalx0306s0012.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Bryophyta; Sphagnophytina; Sphagnopsida; Sphagnales; Sphagnaceae; Sphagnum
Family MYB
Protein Properties Length: 1678aa    MW: 181503 Da    PI: 6.0024
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sphfalx0306s0012.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding51.52.3e-16769815148
                           TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
       Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                           +g WT+eEd  l ++v+++G  +W++Ia ++  gR +kqc++rw+++l
  Sphfalx0306s0012.1.p 769 KGQWTPEEDRFLMELVERHGQQRWSLIATYLT-GRIGKQCRERWHNHL 815
                           799****************************9.*************97 PP

2Myb_DNA-binding48.91.5e-15821863145
                           TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
       Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                           r+ W  eE+e lv a+++lG++ W+ Ia+ ++ gRt++ +k++w+
  Sphfalx0306s0012.1.p 821 RDGWNIEEEEALVSAHNKLGNR-WADIAKLIP-GRTENAIKNHWN 863
                           56799*****************.*********.***********8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5129430.664764819IPR017930Myb domain
SuperFamilySSF466891.12E-29766862IPR009057Homeodomain-like
SMARTSM007178.8E-16768817IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.7E-26770821IPR009057Homeodomain-like
PfamPF139217.6E-17772832No hitNo description
CDDcd001678.54E-14772815No hitNo description
SMARTSM007179.6E-14820868IPR001005SANT/Myb domain
PROSITE profilePS5129419.531820870IPR017930Myb domain
Gene3DG3DSA:1.10.10.602.5E-19822869IPR009057Homeodomain-like
CDDcd001671.70E-10824863No hitNo description
Gene3DG3DSA:3.90.1170.409.0E-1016011669IPR003448Molybdopterin biosynthesis MoaE
SuperFamilySSF546901.44E-716011669IPR003448Molybdopterin biosynthesis MoaE
PfamPF023911.2E-816011666IPR003448Molybdopterin biosynthesis MoaE
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006777Biological ProcessMo-molybdopterin cofactor biosynthetic process
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1678 aa     Download sequence    Send to blast
MKRTRPDQEA AAAASGMQQL LPTSAAPSPS GPLSPRADAA LSPGGGQLYL QLESGAVTTS  60
IMQVSAAAAA TVDQGKKKKK KKKQSTTVRS KSPHHVSSSY AAPAPAPTDM QLAMISLQQV  120
AHGVSSSNSN SSSSNTIPAQ QLLQLELNNS SSSFLGCSLQ NQNLGSLHHH PASSLAACLL  180
TTGDPNNNNN QDGEQQRHMC PSATTPDDHI IQEQRFSPEE DHMRSHHDAG SSWQSSCEFT  240
PATVSGGCSF NAHQLAGSSA AVNQRIEGGI LLQQHHDDHE PAGFMLQQVG IREEQWSDKS  300
NHHHPQTMLN DLPGQQQQVM DSSWWGQKRV KTSQGFGFNM VTSSCAAVEA QPAAANPSPN  360
QLEHGCMMQS GWKTLGPLQP LRSGLEMDNK QAAVVPRDHH QNNSSLTEFT SHCKSMELDS  420
SNMLHESQQT MLTEMRNELP ADPQQQQQQQ QTLLSSALTT ATTCEITSNP NPRISIARLT  480
DDQALMDFEN HFINHSMQNI AAAAAIDEEN MSMLHDSPPQ QQLQQLEVVK NRVQGGGQLA  540
AQAGSSVGFL HHGHVFAAAG SCDQTGLEWQ GEGLGATCVN DYLLLTSAIH KAQLLDLQLK  600
ENQRHNNYMK QSEATPASTH VAGVHESSLY HHHHLHAHDA QMVGHDGSGK SLLSSSSSLS  660
DLIDSHQHQQ DHERRIVVAS GQDVVSHAAA AAAVQELYFS DQLLLDALAA DHHQWFTASS  720
REAYEKKTHH HDDPQQQQHH HDLQILLPSD KRVFKRSSKG GPRRPNIIKG QWTPEEDRFL  780
MELVERHGQQ RWSLIATYLT GRIGKQCRER WHNHLRPDIK RDGWNIEEEE ALVSAHNKLG  840
NRWADIAKLI PGRTENAIKN HWNATMRRKD LRRKHRRPID GTLDGLEVIP RCTVLRDYQQ  900
KVVATQTERN NNNNSSSKPD LSITHDHDLD CLGGGATTGG HGESHSGTPD SGSPSPQLAC  960
ESSTGWTNSP QTKIQMDEAS LTPGHFEEMD SVDVDNIMHL MCATTGEDDE EDHGLGDEHH  1020
HHQCASYALA AATQGNHPLT LQSASCYVPF ANSSMTTTAV TTTPSLLQPR VSVWGQQGGA  1080
RSTSSNNVDG YLSCSSSSPV TVWGGGSGGG GVGTSGAGAW GLSSGEGELN DIATGGGGGG  1140
GGESLGHLLR PLWIPNHGTK TCCMGHNCSC TTSVAAGALA SSWKTCNTED DDDDDDDDDV  1200
DDNAVLIQNL QASTSELTTL DMDIPGKSAV ATQLTGTLEQ VVSKSFTLHQ AASLPYPLEP  1260
ENASECSRPA GEGLGQGSQQ HVQQIYFNSM NTNVLSSVQS SYPLEIPSPP QHHECASSSC  1320
FSAHDTLQLQ QHQQQLDSDR TFMQNPNGSC CGGSTNLMQL AMYPPVGIPE VSHLPHIMMM  1380
PETLNLQDGS HSMHTVVTAK PSTYTANCKL PAAPEDSFQK SMQAQTLNAT TASSYYDNPC  1440
SISIVPDQHE QQSGTSGTMA CLLNLPEVSS AAVGPSPQLL FNALNPMQHQ QQQQQQQQQQ  1500
WDHHNHHNPM VGVPICCSES DSIATAEDHS GGRQNHELDL IEFVDRPLDT TYYMERVKLS  1560
TAPEVGSYSL NVPHLWGGGG GSGSGDGCGQ LCIKGIDYPG HDTPLAEAKL REICHQMHAQ  1620
WQLGCIGLGH RIGVMKAGET RLAIAVSSSN FMDSMNAVHF AVEQLQNEAL LDHQFIM*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1mse_C3e-457678692104C-Myb DNA-Binding Domain
1msf_C3e-457678692104C-Myb DNA-Binding Domain
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17682KKKKKKK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00567DAPTransfer from AT5G58850Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G58850.16e-48myb domain protein 119