PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG034549t1
Common NameTCM_034549
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MIKC_MADS
Protein Properties Length: 185aa    MW: 21512.8 Da    PI: 10.2364
Description MIKC_MADS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG034549t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SRF-TF85.53e-271059251
                      ---SHHHHHHHHHHHHHHHHHHHHHHHHHHT-EEEEEEE-TTSEEEEEE- CS
            SRF-TF  2 rienksnrqvtfskRrngilKKAeELSvLCdaevaviifsstgklyeyss 51
                      ri+n++ rqvtfskRrng+lKKA+EL +LCdaev v+ifsstgkly+++s
  Thecc1EG034549t1 10 RIDNSTSRQVTFSKRRNGLLKKAKELAILCDAEVGVMIFSSTGKLYDFAS 59
                      8***********************************************86 PP

2K-box89.18.5e-3077170598
             K-box   5 sgksleeakaeslqqelakLkkeienLqreqRhllGedLesLslkeLqqLeqqLekslkkiRskKnellleqieelqkkekelqeenkaLrkk 97 
                       ++  + +++ + +q+e+a L+++++nLq+++R+++Ge+L+ Ls+k+Lq+Le+qLe sl+ +R kK+++l+++i+el +k + +++en +L kk
  Thecc1EG034549t1  77 QQLGNPTSEVKFWQREAAILRQQLQNLQENHRQMMGEELSGLSVKDLQNLESQLEMSLRGVRMKKDQILMDEIQELNRKGNLIHQENVELYKK 169
                       3444778999*********************************************************************************99 PP

             K-box  98 l 98 
                       +
  Thecc1EG034549t1 170 V 170
                       7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM004321.4E-39160IPR002100Transcription factor, MADS-box
PROSITE profilePS5006631.511161IPR002100Transcription factor, MADS-box
CDDcd002651.60E-44278No hitNo description
SuperFamilySSF554552.88E-32291IPR002100Transcription factor, MADS-box
PRINTSPR004045.3E-28323IPR002100Transcription factor, MADS-box
PROSITE patternPS003500357IPR002100Transcription factor, MADS-box
PfamPF003194.9E-251057IPR002100Transcription factor, MADS-box
PRINTSPR004045.3E-282338IPR002100Transcription factor, MADS-box
PRINTSPR004045.3E-283859IPR002100Transcription factor, MADS-box
PfamPF014865.0E-2883170IPR002487Transcription factor, K-box
PROSITE profilePS5129716.19686176IPR002487Transcription factor, K-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010440Biological Processstomatal lineage progression
GO:0048574Biological Processlong-day photoperiodism, flowering
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008134Molecular Functiontranscription factor binding
GO:0042803Molecular Functionprotein homodimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 185 aa     Download sequence    Send to blast
MGRGKIVIRR IDNSTSRQVT FSKRRNGLLK KAKELAILCD AEVGVMIFSS TGKLYDFAST  60
SMRSVIERYN KTKEEHQQLG NPTSEVKFWQ REAAILRQQL QNLQENHRQM MGEELSGLSV  120
KDLQNLESQL EMSLRGVRMK KDQILMDEIQ ELNRKGNLIH QENVELYKKV NLIRQENMEL  180
YKKV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5f28_A4e-22181181MEF2C
5f28_B4e-22181181MEF2C
5f28_C4e-22181181MEF2C
5f28_D4e-22181181MEF2C
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00410DAPTransfer from AT3G57230Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankKC1556590.0KC155659.1 Gossypium hirsutum MADS box protein MADS64 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017981395.11e-131PREDICTED: MADS-box transcription factor 23 isoform X4
SwissprotQ6EP491e-107MAD27_ORYSJ; MADS-box transcription factor 27
TrEMBLA0A061FF381e-131A0A061FF38_THECC; AGAMOUS-like 16
STRINGEOY155231e-131(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM53024105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G57230.11e-105AGAMOUS-like 16
Publications ? help Back to Top
  1. Puig J, et al.
    Analysis of the expression of the AGL17-like clade of MADS-box transcription factors in rice.
    Gene Expr. Patterns, 2013 Jun-Jul. 13(5-6): p. 160-70
    [PMID:23466806]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]
  3. Yu C, et al.
    The effects of fluctuations in the nutrient supply on the expression of five members of the AGL17 clade of MADS-box genes in rice.
    PLoS ONE, 2014. 9(8): p. e105597
    [PMID:25140876]