PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG001182t1
Common NameTCM_001182
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MIKC_MADS
Protein Properties Length: 338aa    MW: 37831.4 Da    PI: 5.4033
Description MIKC_MADS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG001182t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SRF-TF87.95.4e-28958150
                      S---SHHHHHHHHHHHHHHHHHHHHHHHHHHT-EEEEEEE-TTSEEEEEE CS
            SRF-TF  1 krienksnrqvtfskRrngilKKAeELSvLCdaevaviifsstgklyeys 50
                      krien++nrqvtfskRrng++KKA+ELS+LCd+++a+i+fs++g+l  +s
  Thecc1EG001182t1  9 KRIENNTNRQVTFSKRRNGLIKKAYELSILCDIDIALIMFSPSGRLSHFS 58
                      79********************************************9997 PP

2K-box28.65.7e-11108180779
             K-box   7 ksleeakaeslqqelakLkkeienLqreqRhllGedLesLslkeLqqLeqqLekslkkiRskKnellleqiee 79 
                       + +++++ae+ ++e+++L+ ++++ ++++R +  +  +  s+ eL++ e++Le++l+++ ++Kn l+ +++++
  Thecc1EG001182t1 108 STTTNSNAEEIHREISNLQHQLQMAEEQLRVYEPDPWTLNSMAELESCEKNLEQALTRVNQRKNYLMSNHLST 180
                       33688999*************************************************************9987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5006629.758161IPR002100Transcription factor, MADS-box
SMARTSM004328.5E-38160IPR002100Transcription factor, MADS-box
SuperFamilySSF554556.67E-30282IPR002100Transcription factor, MADS-box
CDDcd002652.09E-37278No hitNo description
PRINTSPR004044.0E-26323IPR002100Transcription factor, MADS-box
PROSITE patternPS003500357IPR002100Transcription factor, MADS-box
PfamPF003196.8E-261057IPR002100Transcription factor, MADS-box
PRINTSPR004044.0E-262338IPR002100Transcription factor, MADS-box
PRINTSPR004044.0E-263859IPR002100Transcription factor, MADS-box
PfamPF014865.6E-8111180IPR002487Transcription factor, K-box
PROSITE profilePS512976.913115199IPR002487Transcription factor, K-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 338 aa     Download sequence    Send to blast
MGRVKLQIKR IENNTNRQVT FSKRRNGLIK KAYELSILCD IDIALIMFSP SGRLSHFSGK  60
RRIEDVLSRY INLPDQDRGS LVRNKEFLLS TLKKLKDENE IALQLASSTT TNSNAEEIHR  120
EISNLQHQLQ MAEEQLRVYE PDPWTLNSMA ELESCEKNLE QALTRVNQRK NYLMSNHLST  180
FSDPSTVQMY LDSQEGEPSS FENEVVRWLP ENGQNPTQYC AGSESSCIPV RNQSSSTIYD  240
PMPHGANMTV DACDMGGCHV STSSNDGLSP WHHSYTSTEL LSAFMSPTSF PLMKDIAGPS  300
IPQVVVSQQQ VETASNCPQM PPTGEGANYE SNLPHLN*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6bz1_A3e-17194190MEF2 CHIMERA
6bz1_B3e-17194190MEF2 CHIMERA
6bz1_C3e-17194190MEF2 CHIMERA
6bz1_D3e-17194190MEF2 CHIMERA
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017976545.10.0PREDICTED: agamous-like MADS-box protein AGL104
RefseqXP_017976550.10.0PREDICTED: agamous-like MADS-box protein AGL104
TrEMBLA0A061DIV80.0A0A061DIV8_THECC; AGAMOUS-like 104, putative
STRINGEOX921920.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM21792365
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G22130.11e-91AGAMOUS-like 104
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]