PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG026383t1
Common NameTCM_026383
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Trihelix
Protein Properties Length: 661aa    MW: 74541.6 Da    PI: 6.5899
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG026383t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix49.41.2e-15126188269
          trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                       W+++evlaL+++r+++e+++ +       We+vs+k++e gf+rs+++Ckek+e+ ++++  i+ +++
  Thecc1EG026383t1 126 WSNDEVLALLRIRSSIENWFPEF-----TWEHVSRKLAELGFKRSAEKCKEKFEEESRYFNSINCSKN 188
                       ********************998.....9*******************************99886655 PP

2trihelix96.42.6e-30505593186
          trihelix   1 rWtkqevlaLiearremeerlrrgk....lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                       rW+++evlaLi++r ++ ++   +k     k+plWe++s+ m+e g++rs+k+Ckekwen+nk+++k+k+ +kkr s +s+tcpyf+ql 
  Thecc1EG026383t1 505 RWPRDEVLALINLRCSLYNNGDHDKegaaIKAPLWERISQGMSELGYKRSAKRCKEKWENINKYFRKTKDVNKKR-SLDSRTCPYFHQLS 593
                       8**************9999999764444499*******************************************8.9999********95 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM0071719122179IPR001005SANT/Myb domain
PfamPF138378.4E-11124187No hitNo description
PROSITE profilePS500906.179125177IPR017877Myb-like domain
SMARTSM007170.22502568IPR001005SANT/Myb domain
PfamPF138373.8E-19504593No hitNo description
CDDcd122033.55E-24504573No hitNo description
PROSITE profilePS500906.539505566IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0001158Molecular Functionenhancer sequence-specific DNA binding
GO:0005516Molecular Functioncalmodulin binding
Sequence ? help Back to Top
Protein Sequence    Length: 661 aa     Download sequence    Send to blast
MFDGVPDQFH QFIASSAAAA AAAAVAAART TTLPLPLSFP PLHLANSSNG FTSFDTLYTS  60
NSHNQVPPQL QQQQPHFLHP LHPQHQTQKN EEKEENTGLV RMNMEIERER SMPESIDNHH  120
HHHHPWSNDE VLALLRIRSS IENWFPEFTW EHVSRKLAEL GFKRSAEKCK EKFEEESRYF  180
NSINCSKNYR LFSELEELCQ GENPPPPHHN QQVVGATEKN KNVEKSREDE DNMGQNLEDD  240
SRNIDEYQTT AGNNAPEDNE RVVENKADNK NSSNRKRKRQ KKFEMIKGFC EDIVNKLMNQ  300
QEEMHNKLLE DMVKRDEEKV AREEAWKKQE LDRINQELEL RAKEQAIAGD RQATIIKFLS  360
KFASTGSSKC FRRSNEALFK VPNDSNPPST SSSLVPAQNP NPIVNAQSQG DQVSSTTLST  420
MVLGHQNSGS CPTDNNQIKA TSMTENQAPE NPNPKTLTSS ALALAPKNPN PVNAQSNPSP  480
PTSSVTVNKA PLTPTSNDKE DLGKRWPRDE VLALINLRCS LYNNGDHDKE GAAIKAPLWE  540
RISQGMSELG YKRSAKRCKE KWENINKYFR KTKDVNKKRS LDSRTCPYFH QLSTLYNQGT  600
LIAPSEGLEN RPALPENHSA ALPESGNDNS SQRGPAKDST VHFSEGETNM VQVPAFEFEF  660
*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1274281RKRKRQKK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00011PBMTransfer from AT5G28300Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX9642022e-55JX964202.1 Gossypium hirsutum clone NBRI_TRANS-471 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007030607.20.0PREDICTED: trihelix transcription factor GTL2
TrEMBLA0A061F9D30.0A0A061F9D3_THECC; Duplicated homeodomain-like superfamily protein, putative
STRINGEOY111090.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM82682838
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.11e-102Trihelix family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]