PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen04g018550.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family MYB
Protein Properties Length: 1677aa    MW: 183461 Da    PI: 6.3792
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen04g018550.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.34.1e-09804845346
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
   Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                       +WT+eE e ++d  + +G++ +++Ia+ +  ++t  +c+++++k
  Sopen04g018550.1 804 PWTPEERENFIDKLAAFGKD-FRKIASFLD-HKTTADCIEFYYK 845
                       8*****************99.*********.***********98 PP

2Myb_DNA-binding27.76.1e-0910221062345
                        SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
   Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                         WT eE   +v+av  +G++ +  ++ ++g +R++ qck ++ 
  Sopen04g018550.1 1022 DWTDEEKSTFVQAVSAYGKD-FVMVSGCVG-TRSRDQCKIFFS 1062
                        5*****************99.*********.********8775 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.22E-13792848IPR009057Homeodomain-like
PROSITE profilePS5129315.573800851IPR017884SANT domain
SMARTSM007171.9E-9801849IPR001005SANT/Myb domain
PfamPF002495.9E-7803845IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.608.9E-6804845IPR009057Homeodomain-like
PROSITE profilePS5129310.6110181069IPR017884SANT domain
SMARTSM007175.4E-710191067IPR001005SANT/Myb domain
SuperFamilySSF466891.01E-910201069IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.1E-410221063IPR009057Homeodomain-like
PfamPF002491.6E-610221061IPR001005SANT/Myb domain
CDDcd001674.46E-510231061No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1677 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHD RWREPTPHHH YTSSRWNPDY RSRATSGHGG KQGSYHMCPE  60
EPGHGFMPSR SNDKIVEDES NRPSRGDGGR YGRNSRENRS FGQRDWRGGH SWEAASPSGS  120
ARQNDATNDQ RSMDVAVPHS LSHPHSEHVN TCDQSHSREQ HNKSGSINGT ASAGQRFERE  180
SSLGSIEWRP LKWTRSGSLS SRGSLSHSGS SKSIGVDSNE TKTELQLGNS KAVKSLTGDA  240
TACVTSATPS EETSSRKKPR LGWGEGLAKY EKKKVEGPED NAVKVGASIS GDSAEPGHSQ  300
PLNLADRSPR VAVFPDCPSP ATPSSVACSS SPGLEDKQLV KATNIDQDVG NLCGSPSVVS  360
QYYSEGSGFN LENWDLAQIS NLNSSINELL LSEDPNSVDS GFMRSTAVNK LIVWKSDITK  420
ALEKTEVEID SLENELKTLI SGPENNQLVP SASCSPPKDC YANSQEDQGA TSNTASRPAP  480
LLVDIPDDLM GQEEADIHGN EPAQVKVEDI DSPGSATSKF VQLPSEKSVE PVVSMRHGGM  540
LISDDSKSRR LNVNMCSITE EKAKSRSSDL KLCNINEEKA RDAIACGESS QPTANHSDSA  600
SNGSSNCGKD ALYNLIIAAN KDSAERAFEV FKNQLPASKC SFDFSRAVRG SSFQIDPAVK  660
ERFVKRKQFQ QFKEKIIALK FRVHQHLWKE DIRMLSVRKF RAKSQKKFDF SLRPVQIGHQ  720
KHRSTIRSRF SATVGSLSLV PSSEILNFAS RLLSELGAKV YRNTLRMPAL ILDKKERTMS  780
RFISKNSLVA DPCAVEEERG LINPWTPEER ENFIDKLAAF GKDFRKIASF LDHKTTADCI  840
EFYYKNHKSD CFERTRKKSE YSKQAKVCSA NTYLVASSGK RWNREANSVS LDILGAASAL  900
AANVEDSIEI QPKGMSKYSV RMVNEYKASR LNELERSNSL DVCHSERETV AADVLAGICG  960
SLSSEAMSSC ITSSVDPGEG NQEWKHLKVG LSTRLPRTPE VTQSVDDETC SDDSCGEMDP  1020
TDWTDEEKST FVQAVSAYGK DFVMVSGCVG TRSRDQCKIF FSKARKCLGL DKILPGSGNL  1080
DRLDMNGGSD PDACVMETKK SSLMLENVSD LCMDAGILKP DLTSSDDRDE AGELDSVDTE  1140
LVSKNSVQVN CHVDKQEVDF NRDCEIQIGV CIGSGQGDED LITVSREGVE IDGDASEIGL  1200
PYIPCEVSTK HLGEEIRGVV SSPVHDLKNR KAEKTEVSRS NCSLEDRKPN MVLFGNNSRL  1260
AAARGGGLCP LNGSRNRTQL ESDSECKLDV NYLESNISFQ RKQISEASNA DKLSELELEN  1320
VGDKQCENAT QSAEQPLSST SRLAQVESCQ ILGSYLLGES TLTENGDPGC RASAASQEIQ  1380
VGRNLQLDTF STTCFLQKCN GTNRGGCSVS DLVPNREQTG SSSSVVEKPC RNGDVKLFGQ  1440
ILSKPCPKAN PSSNAERIDG SNQKMKVGSN SFSASHSLEG NSATAKFERN NFLGSENHPL  1500
RSFGFWDGSR IQTGFSSLPD SAILLAKYPA AFGSYGLSST KMEQPSLHGV VKTTERNLNS  1560
PPVFAARDSS SNSAVAGSDY QVYRNRDVQP FTIEMKQRQD AVFSEMQRRN GFDVVGIPQQ  1620
TRGVVVGRGG ILQCSGVVSD PVAAIKMHYA KAEQFSGQAG SIMREDDSWR SKGDVSR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754430.0HG975443.1 Solanum pennellii chromosome ch04, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015073255.10.0uncharacterized protein LOC107017599
TrEMBLA0A3Q7G2Y30.0A0A3Q7G2Y3_SOLLC; Uncharacterized protein
STRINGXP_009803598.10.0(Nicotiana sylvestris)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA59042332
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-155MYB family protein