PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim03g119050.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family MYB
Protein Properties Length: 1071aa    MW: 120152 Da    PI: 7.1834
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim03g119050.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding22.92.1e-07295388147
                         TSSS-HHHHHHHHHHHHHTTTT...............................................-HHHHHHHHTTTS-HHHHHHHH CS
     Myb_DNA-binding   1 rgrWTteEdellvdavkqlGgg...............................................tWktIartmgkgRtlkqcksrw 44 
                         r+ W++eE e lv++ kq   +                                               +W+ +a++   gR++ +c+srw
  Sopim03g119050.0.1 295 RKEWSKEESENLVKGLKQQFQEmllqrsvnllsdedgcsresgdlddviasirdlaitpetmrlflpkvNWDQVASMYLPGRSGAECQSRW 385
                         789****************999******************************************************9888*********** PP

                         HHH CS
     Myb_DNA-binding  45 qky 47 
                         +++
  Sopim03g119050.0.1 386 LNW 388
                         *98 PP

2Myb_DNA-binding38.23.2e-12448492146
                         TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         r  WT eEd +l  av+ +G  +W  +a+ +  gRt+ qc +rw k
  Sopim03g119050.0.1 448 RREWTDEEDIKLSAAVETFGESNWQFVASVIE-GRTGTQCSNRWIK 492
                         678*****************************.***********76 PP

3Myb_DNA-binding38.52.7e-12502559246
                         SSS-HHHHHHHHHHHHHTTTT..............-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   2 grWTteEdellvdavkqlGgg..............tWktIartmgkgRtlkqcksrwqk 46 
                         g+W+++Ed++l+ av ++ ++               Wk++a++++ gRt  qc++rw +
  Sopim03g119050.0.1 502 GKWSADEDKRLKVAVMLFYPKtwrnvvqsvpwrtpIWKKVAQYVP-GRTHVQCRERWVN 559
                         89*****************99************************.***********87 PP

4Myb_DNA-binding46.39.6e-15569610346
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                          WT+eEd++l+ a+ ++G   W+++a++++  Rt++qc+ rw  
  Sopim03g119050.0.1 569 EWTEEEDLKLKSAIDEHGYS-WSKVAACIP-PRTDNQCRRRWIV 610
                         7*****************99.*********.9**********75 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.155290389IPR017877Myb-like domain
SMARTSM007172.4E-6294391IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.604.3E-10297313IPR009057Homeodomain-like
CDDcd001670.00317352388No hitNo description
Gene3DG3DSA:1.10.10.604.3E-10362403IPR009057Homeodomain-like
SuperFamilySSF466895.14E-12371439IPR009057Homeodomain-like
PROSITE profilePS500906.62390442IPR017877Myb-like domain
SMARTSM007170.0031394444IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.1E-8404450IPR009057Homeodomain-like
SuperFamilySSF466893.23E-18428500IPR009057Homeodomain-like
PROSITE profilePS5129418.755443498IPR017930Myb domain
SMARTSM007172.8E-12447496IPR001005SANT/Myb domain
CDDcd001676.34E-10450492No hitNo description
Gene3DG3DSA:1.10.10.605.2E-16451499IPR009057Homeodomain-like
PfamPF139211.6E-11451512No hitNo description
Gene3DG3DSA:1.10.10.601.1E-12500551IPR009057Homeodomain-like
SMARTSM007171.6E-9500563IPR001005SANT/Myb domain
PROSITE profilePS500908.676502561IPR017877Myb-like domain
CDDcd001671.14E-6503561No hitNo description
PfamPF002491.7E-5538559IPR001005SANT/Myb domain
SuperFamilySSF466891.12E-24541614IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.0E-20552608IPR009057Homeodomain-like
PROSITE profilePS5129421.937562616IPR017930Myb domain
SMARTSM007171.3E-13566614IPR001005SANT/Myb domain
PfamPF002491.3E-13569609IPR001005SANT/Myb domain
CDDcd001671.58E-11569612No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1071 aa     Download sequence    Send to blast
MAFDSDDDFS DNDYDDGFQQ DMEALKKACL FAGKDADDLQ PSSSTGDVAA GDDDDDDDDV  60
TPSVSDANEY DDDIACLRNL QERFSLSTEL CEPINLKPLC SIFPPGSEGD EENDLETLRA  120
IERRFAAYDD DSGTRRKESP LDKFEQISEG CSNDVASSQN LIEGHNFDAE TTEASLNSFC  180
LPKSAHAFLD AIRKNRSCQK VMRDKMMQTE ARLEELKKLT ERVKILKSFQ LTCKKRMGRA  240
LSQKRDARVQ LISLPKQRFS SKGKKLSATH CGPPENSHVA SYREALTHFA VSLSRKEWSK  300
EESENLVKGL KQQFQEMLLQ RSVNLLSDED GCSRESGDLD DVIASIRDLA ITPETMRLFL  360
PKVNWDQVAS MYLPGRSGAE CQSRWLNWED PLIKHEGWDL LEEKNLLQVV QQKKMSDWVD  420
ISTSLGVCRT PFQCLSHYQR SLNASIIRRE WTDEEDIKLS AAVETFGESN WQFVASVIEG  480
RTGTQCSNRW IKSIHPAMKR CGKWSADEDK RLKVAVMLFY PKTWRNVVQS VPWRTPIWKK  540
VAQYVPGRTH VQCRERWVNS LDPSLKLDEW TEEEDLKLKS AIDEHGYSWS KVAACIPPRT  600
DNQCRRRWIV LFPDEVSMLK EAKKIRREAF ISNFVDREDE RPALRPNDIV PTQKLSSRAG  660
RETTSVNKKR KLRPRATKDD MAPRCDTMSE MEKPHAEGSE GPESSHLLKS SLLPSRDQGC  720
NDAMKNKRPS KLRRRKTKKS TPNDKVPEAS ASTDSTIADG NICKRRRRST SSLVKKKSRT  780
VPSASSMVES TTADGNICKR RRRASSLVKP KSRESSSSLP NLSSSMAVVE EAESLVQDSR  840
KAKNVMDKRN SASEYDDPCI SPQGHPLAHP HLDEGTADLN VGENENASAT GFQDYSLLLQ  900
RNAVVCTDEN ASQFEASATP GTREGEDCGA SVCHKLNKCN QLENNVKSSL DYLPQTADDG  960
MTLASFVCKL RAKVSSSSST KVARLHSGKA PSKAMSGDHC SRSCISGGHD GMEKRSKQEC  1020
TSSNQTSGTK VEDDMPLSSF IGRVKKRECT EVGDDMLLSL FVGRLKRERH *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C7e-273986086155MYB PROTO-ONCOGENE PROTEIN
1h89_C7e-273986086155MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1732737RRRKTK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755150.0HG975515.1 Solanum lycopersicum chromosome ch03, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010318634.10.0uncharacterized protein LOC101267796 isoform X1
TrEMBLA0A0V0IVV70.0A0A0V0IVV7_SOLCH; Putative ovule protein (Fragment)
STRINGSolyc03g119050.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA108762124
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.11e-166myb domain protein 4r1