PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim03g114840.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family MIKC_MADS
Protein Properties Length: 247aa    MW: 28398.2 Da    PI: 8.8752
Description MIKC_MADS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim03g114840.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SRF-TF99.81.1e-31959151
                        S---SHHHHHHHHHHHHHHHHHHHHHHHHHHT-EEEEEEE-TTSEEEEEE- CS
              SRF-TF  1 krienksnrqvtfskRrngilKKAeELSvLCdaevaviifsstgklyeyss 51
                        krienk+nrqvtf+kRrng+lKKA+ELSvLCdaeva+iifs++gklye++s
  Sopim03g114840.0.1  9 KRIENKINRQVTFAKRRNGLLKKAYELSVLCDAEVALIIFSNRGKLYEFCS 59
                        79***********************************************96 PP

2K-box1003.3e-33821739100
               K-box   9 leeakaeslqqelakLkkeienLqreqRhllGedLesLslkeLqqLeqqLekslkkiRskKnellleqieelqkkekelqeenkaLrkkle 99 
                         ++ +++++ ++e+ +Lk+++e Lqr+qR++lGedL++Ls k+L+qLe+qLe+slk+iRs+K++++l+q+ +lq+ke++l e n+ Lr+kle
  Sopim03g114840.0.1  82 QSVTDTQNNYHEYLRLKARVELLQRSQRNFLGEDLGTLSSKDLEQLENQLESSLKQIRSRKTQFMLDQLADLQQKEQMLAESNRLLRRKLE 172
                         567789***********************************************************************************98 PP

               K-box 100 e 100
                         e
  Sopim03g114840.0.1 173 E 173
                         7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM004321.3E-41160IPR002100Transcription factor, MADS-box
PROSITE profilePS5006633.529161IPR002100Transcription factor, MADS-box
CDDcd002658.13E-45278No hitNo description
SuperFamilySSF554555.36E-34282IPR002100Transcription factor, MADS-box
PRINTSPR004048.9E-33323IPR002100Transcription factor, MADS-box
PROSITE patternPS003500357IPR002100Transcription factor, MADS-box
PfamPF003192.3E-261057IPR002100Transcription factor, MADS-box
PRINTSPR004048.9E-332338IPR002100Transcription factor, MADS-box
PRINTSPR004048.9E-333859IPR002100Transcription factor, MADS-box
PfamPF014863.1E-2884171IPR002487Transcription factor, K-box
PROSITE profilePS5129715.93587177IPR002487Transcription factor, K-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010076Biological Processmaintenance of floral meristem identity
GO:0048440Biological Processcarpel development
GO:0048441Biological Processpetal development
GO:0048442Biological Processsepal development
GO:0048443Biological Processstamen development
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 247 aa     Download sequence    Send to blast
MGRGRVELKR IENKINRQVT FAKRRNGLLK KAYELSVLCD AEVALIIFSN RGKLYEFCST  60
SSMVKTIEKY QRCSYATLEA NQSVTDTQNN YHEYLRLKAR VELLQRSQRN FLGEDLGTLS  120
SKDLEQLENQ LESSLKQIRS RKTQFMLDQL ADLQQKEQML AESNRLLRRK LEESVAGFPL  180
RLCWEDGGDH QLMHQQNRLP NTEGFFQPLG LHSSSPHFGY NPVNTDEVNA AATAHNMNGF  240
IHGWML*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ox0_A5e-289217122101Developmental protein SEPALLATA 3
4ox0_B5e-289217122101Developmental protein SEPALLATA 3
4ox0_C5e-289217122101Developmental protein SEPALLATA 3
4ox0_D5e-289217122101Developmental protein SEPALLATA 3
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtMADS-box transcription factor that acts redundantly with J2 to control meristem maturation and inflorescence architecture. {ECO:0000269|PubMed:28528644}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00030SELEXTransfer from AT2G03710Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY2943290.0AY294329.1 Lycopersicon esculentum MADS-box protein 1 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqNP_001234380.10.0MADS-box protein 1
RefseqXP_015068142.10.0MADS-box protein EJ2
SwissprotQ7Y0400.0EJ2_SOLLC; MADS-box protein EJ2
TrEMBLM1CAG71e-179M1CAG7_SOLTU; Uncharacterized protein
STRINGSolyc03g114840.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA4024625
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G03710.11e-88MIKC_MADS family protein
Publications ? help Back to Top
  1. Tomato Genome Consortium
    The tomato genome sequence provides insights into fleshy fruit evolution.
    Nature, 2012. 485(7400): p. 635-41
    [PMID:22660326]
  2. Soyk S, et al.
    Bypassing Negative Epistasis on Yield in Tomato Imposed by a Domestication Gene.
    Cell, 2017. 169(6): p. 1142-1155.e12
    [PMID:28528644]