PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG005021t2
Common NameTCM_005021
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MYB_related
Protein Properties Length: 669aa    MW: 72295.8 Da    PI: 6.6127
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG005021t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding50.83.9e-162468147
                      TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
   Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47
                      r rWT+eE+ ++++a k++G   W +I +++g ++t+ q++s+ qk+
  Thecc1EG005021t2 24 RERWTEEEHNRFLEALKLYGRA-WQRIEEHIG-TKTAVQIRSHAQKF 68
                      78******************88.*********.************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.23E-171874IPR009057Homeodomain-like
PROSITE profilePS5129421.0911973IPR017930Myb domain
TIGRFAMsTIGR015577.1E-172271IPR006447Myb domain, plants
SMARTSM007177.3E-132371IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.1E-82464IPR009057Homeodomain-like
PfamPF002492.7E-132467IPR001005SANT/Myb domain
CDDcd001671.22E-92669No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009409Biological Processresponse to cold
GO:0009651Biological Processresponse to salt stress
GO:0009723Biological Processresponse to ethylene
GO:0009733Biological Processresponse to auxin
GO:0009737Biological Processresponse to abscisic acid
GO:0009739Biological Processresponse to gibberellin
GO:0009751Biological Processresponse to salicylic acid
GO:0009753Biological Processresponse to jasmonic acid
GO:0042754Biological Processnegative regulation of circadian rhythm
GO:0043433Biological Processnegative regulation of sequence-specific DNA binding transcription factor activity
GO:0043496Biological Processregulation of protein homodimerization activity
GO:0045892Biological Processnegative regulation of transcription, DNA-templated
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0046686Biological Processresponse to cadmium ion
GO:0048574Biological Processlong-day photoperiodism, flowering
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 669 aa     Download sequence    Send to blast
MDTYSSGEEL VIKTRKPYTI TKQRERWTEE EHNRFLEALK LYGRAWQRIE EHIGTKTAVQ  60
IRSHAQKFFS KLEKEALAKG VPIGQALDIE IPPPRPKRKP SNPYPRKTGA ATTAQVGAKD  120
GKSETPLSSL RCKQVLDLEK EPLPERPNGD EKPINLKDNQ DDSCSEVVTL LHEANCSSVS  180
SVNKNSIPTS AALRNSCTFR EFVPSLKETI QDNGTSKASN LDNSCTSHEK AAQGQKKDDV  240
DGGLRADETQ ATQNYPRHVA VHVLDGSLGT AASATTEHQN NAPRSTHQNP AAHAAASFAA  300
TFWPYANVDS SADSPACSQG GFPSRQMNPA PSMAAIAAAT VAAATAWWAA HGLLPLCAPL  360
HTGFTCALAS AAAVPPMDNE QAPATKMERK DNNDQDLSMQ DQQLDPEYSE ALQAQHSASK  420
SPTSSSSDSE ACGDAKVNTG VKAADDEKAA AVTEPQDANK TKNRKQVDRS SCGSNTPSSS  480
EVETDVLEKY EKDKEDAKGA DANHPQVECC NRRGRSCSNP SDSWKEVSEG GRLAFQALFS  540
REVLPQSFSP PHDGKNKGQQ KDKVEDDKQN SDEKDGATSA LDLNSQTVRS CSYRQGVEKN  600
GLSRGEDIVG EGLLTIGLEH AKLKARRTGF KPYKRCSVEA KENKVMNAGS QGEEKGPKRI  660
RLEGEAST*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00119DAPTransfer from AT1G01060Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017979100.10.0PREDICTED: protein LHY isoform X6
TrEMBLA0A061DT360.0A0A061DT36_THECC; Homeodomain-like superfamily protein isoform 2
STRINGEOX955620.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G46830.11e-107circadian clock associated 1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]