PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG029951t1
Common NameTCM_029951
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MYB_related
Protein Properties Length: 474aa    MW: 52508.6 Da    PI: 10.4313
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG029951t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding254.3e-08841236
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS- CS
   Myb_DNA-binding  2 grWTteEdellvdavkqlGggtWktIartmgkgRt 36
                      grW t+E++ll  av ++G+++W+++a++++  Rt
  Thecc1EG029951t1  8 GRWGTWEELLLGGAVLRHGTRDWNLVASELQ-ART 41
                      9*****************************9.888 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.608.4E-4541IPR009057Homeodomain-like
PfamPF002497.1E-7742IPR001005SANT/Myb domain
CDDcd001670.00469956No hitNo description
SMARTSM002971.2E-1788371IPR001487Bromodomain
SuperFamilySSF473702.49E-18261373IPR001487Bromodomain
Gene3DG3DSA:1.20.920.106.6E-16269372IPR001487Bromodomain
PfamPF004396.8E-13277355IPR001487Bromodomain
PROSITE profilePS5001410.07282352IPR001487Bromodomain
CDDcd043691.56E-15286362No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0005515Molecular Functionprotein binding
Sequence ? help Back to Top
Protein Sequence    Length: 474 aa     Download sequence    Send to blast
MGTEMITGRW GTWEELLLGG AVLRHGTRDW NLVASELQAR TISPFAFTPE VCKAKYEDLQ  60
QRYSGCKAWF EELRKQRMLE LRRALEKSED SIGSLESKLE SLKAEKRDDS RIDYDSSQTV  120
SAIPCLKSEG VEFSSKDTSK DDLSAGSFTR EAQTNWSPHC QIPAAVPAEE MDMKPGESLV  180
SEREKVSSID KLADTFCGGQ FQSIRKRRGK RKRKDCSRDA KEGSVGESEF LGPADVASAS  240
PCKETSASNS AQIARSSGIE DQSGGSSKEG IDAMMGIFSS VAENYCASVF RRRLDSQKRG  300
RYKKMILRHM DFDTIRSRIA SNSIMSVKEL FRDMLLVANN AMVFYSKNTR EYKSALLLRH  360
IVTATLRQHF KEYGSKVPIT TFTSSRPMHK LPAKPRSIRP GNRKLPGKAA NNGNAVVGVS  420
HANKKTANAD SPPSVESLPV TKKGSSQPRK VGRGRASQKS ESPMKGRKRA RAR*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1206214RRGKRKRKD
2465471GRKRARA
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007025741.20.0PREDICTED: uncharacterized protein LOC18596921
TrEMBLA0A061GG170.0A0A061GG17_THECC; Bromodomain 4, putative
STRINGEOY283630.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM7607915
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G42150.13e-34DNA-binding bromodomain-containing protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]