PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim04g049120.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family MYB
Protein Properties Length: 1581aa    MW: 173413 Da    PI: 6.5139
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim04g049120.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.43.8e-09804845346
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         +WT+eE e ++d  + +G++ +++Ia+ +  ++t  +c+++++k
  Sopim04g049120.0.1 804 PWTPEERENFIDKLAAFGKD-FRKIASFLD-HKTTADCIEFYYK 845
                         8*****************99.*********.***********98 PP

2Myb_DNA-binding27.85.7e-09925965345
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                          WT eE   +v+av  +G++ +  ++ ++g +R++ qck ++ 
  Sopim04g049120.0.1 925 DWTDEEKSTFVQAVSAYGKD-FVMVSGCVG-TRSRDQCKIFFS 965
                         5*****************99.*********.********8775 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.48E-13792848IPR009057Homeodomain-like
PROSITE profilePS5129315.573800851IPR017884SANT domain
SMARTSM007171.9E-9801849IPR001005SANT/Myb domain
PfamPF002495.5E-7803845IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.608.3E-6804845IPR009057Homeodomain-like
PROSITE profilePS5129310.901921972IPR017884SANT domain
SMARTSM007175.7E-7922970IPR001005SANT/Myb domain
SuperFamilySSF466891.01E-9923972IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.0E-4925966IPR009057Homeodomain-like
PfamPF002491.5E-6925964IPR001005SANT/Myb domain
CDDcd001674.17E-5926964No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1581 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHD RWREPTPHHH YTSSRWNPDY RSRATSGHGG KQGSYHMCPE  60
EPGHGFMPSR SNDKIVEDES NRPSRGDGGR YGRNSRENRS FGQRDWRGGH SWEAASPSGS  120
ARQNDATNDQ RSMDIAVPHS LSHPHSEHVN TCDQSHSREQ HNKSGSINGT ASVGQRFERE  180
SSLGSIEWRP LKWTRSGSLS SRGSLSHSGS SKSMGVDSNE TKPELQLGNS KAVKSLTGDA  240
TACVTSATPS EETSSRKKPR LGWGEGLAKY EKKKVEGPED NAVKVGASIS GDSAEPGHSQ  300
PLNLADRSPR VAVFPDCPSP ATPSSVACSS SPGLEDKQLV KATNIDQDVG NLCGSPSVVS  360
QYYSEGSGFN LENWDLAQIS NLNSSINELL LSEDPNSVDS GFMRSTAVNK LIVWKSDITK  420
ALEKTEVEID SLENELKTFI SGPENNQLVP SASCSPPKDC YANSQEDQGA TSNTASRPAP  480
LLVDIPDDLM GQEEADIHGN EPAEVKVEDI DSPGSATSKF VQLPSEKSVE PVVSMRHGGM  540
LISDDSMSRR LNVNMCSITE EKAKSRSSDL KLCNFNEEKA RDAIACGESS QPTANHSDSS  600
SNGSSNCGKD ALYNLIIAAN KDSAERAFEV FKNQLPASKC SFDFSRAVRG SSFQIDPAVK  660
ERFVKRKQFQ QFKEKIIALK FRVHQHLWKE DIRMLSVRKF RAKSQKKFDF SLRPVQIGHQ  720
KHRSTIRSRF SATVGSLSLV PSSEILNFAS RLLSELGAKV YRNTLRMPAL ILDKKERKMS  780
RFISKNSLVA DPCAVEEERG LINPWTPEER ENFIDKLAAF GKDFRKIASF LDHKTTADCI  840
EFYYKNHKSD CFERTRKKSE YSKQAKVCSA NTYLVASSGK RWNREEWKHL KVGLSTRLPR  900
TPEVTQRVDD ETCSDDSCGE MEPTDWTDEE KSTFVQAVSA YGKDFVMVSG CVGTRSRDQC  960
KIFFSKARKC LGLDKILPGS GNLDRLDMNG GSDPDACVME TKKSSLMLEN VSDLCMDAGI  1020
LKPDLTSSDD RDEAGELDSV DTELVSKNSV QVNCHVDKQE VDFNRDCEIQ IGVCIGSGQG  1080
DEDLITVSRE GVEIDGDASE IGLPYIPCEV STKPLGEEIR GVVSSPVHDL KNRKAEKTEV  1140
SRSNCSLEDR KPNMVLFGNN SRLAAARGGG LCPLNGSRNM TQLESDSECK LDVNYLESNI  1200
SFQRKQISEA SNADKLSELE LENVGDKQCE NATQSAEQPL SSTSRSAQVE SCQILGSYLL  1260
GESTLTENGD PGCRASAALQ EVQVGRNLQL DTFSTTCFLQ KCNGTNRGGC SVSDLVPNRE  1320
QTGSSSSVVE KPCRNGDVKL FGQILSKPCP KANPSSNAEP IDGSNQMLKV GSNSFSASHS  1380
LEGNSATAKF ERNNFLGSEN HPLRSFGFWD GSRIQTGFSS LPDSAILLAK YPAAFGSYGL  1440
SSTKMEQPSL HGVVKTTERN LNSPPVFAAR DSSSNSAVAG SDYQVYRNRD VQPFTIEMKQ  1500
RQDAVFSEMQ RRNGFDVVGI PQQARGVVVG RGGILQCSGV VSDPVAAIKM HYAKAEQFSG  1560
QAGSIMREDD SWRSKGDVSR *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C4e-14762853494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D4e-14762853494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755160.0HG975516.1 Solanum lycopersicum chromosome ch04, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015073255.10.0uncharacterized protein LOC107017599
TrEMBLA0A3Q7G2Y30.0A0A3Q7G2Y3_SOLLC; Uncharacterized protein
STRINGSolyc04g049120.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA59042332
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-109MYB family protein