PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG042663t1
Common NameTCM_042663
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family TALE
Protein Properties Length: 504aa    MW: 56953.5 Da    PI: 7.9174
Description TALE family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG042663t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox29.99.4e-103203571954
                       HHH..SSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHH CS
          Homeobox  19 Fek..nrypsaeereeLAkklgLterqVkvWFqNrRak 54 
                       Fe   ++yp+ +e+  LA+++gL+ +qV++WF N R +
  Thecc1EG042663t1 320 FEHflHPYPTDSEKLMLARQTGLSRTQVSNWFINARVR 357
                       665559******************************88 PP

2BELL85.18.5e-28187256372
              BELL   3 qelqkkkakLlslleeVdkrYkqyveqlqtvissFeavaglgsakpYtslAlkaiSrhFrcLkdaiaeqi 72 
                        +++ k ++L+ +l+eV+++Yk y++q+q v++sF++v glg+a+pY+ +A kai++hF cLk ai +qi
  Thecc1EG042663t1 187 IQHRWKNSRLVLMLDEVYRKYKLYCQQMQSVVASFKCVSGLGNAAPYVCFAFKAIAKHFSCLKSAILNQI 256
                       57889***************************************************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM005744.8E-31134254IPR006563POX domain
PfamPF075261.3E-29139252IPR006563POX domain
SMARTSM003891.1E-11301365IPR001356Homeobox domain
CDDcd000862.62E-12301362No hitNo description
PROSITE profilePS5007112.212302361IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.601.3E-27304365IPR009057Homeodomain-like
SuperFamilySSF466896.42E-18305368IPR009057Homeodomain-like
PfamPF059203.2E-18318357IPR008422Homeobox KN domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 504 aa     Download sequence    Send to blast
MEGSYNQPLH IPQQNRRNRL RVTIGTNQEE EQASQAPLLQ LNQPALLCPS SSQPTFMHTF  60
PQSLCSSTMQ NPRDVNYQFF DDQGLSLSLS FQHQDNMNLP LNLDAQKSNE NSILGGFLKQ  120
NCQMRSSVPL GPFTGYASVL KSSRFLTPAQ QILDDFCGVD YRVLDFPLES LGDGDVGKDP  180
ITCSDKIQHR WKNSRLVLML DEVYRKYKLY CQQMQSVVAS FKCVSGLGNA APYVCFAFKA  240
IAKHFSCLKS AILNQIRFTD KTADNAVVGK DNNVPSLWTS DQGISNQNPV QNVTFLQHPL  300
WRSQRGLPDQ AVAVLKTWLF EHFLHPYPTD SEKLMLARQT GLSRTQVSNW FINARVRLWK  360
PMVEEIHMLE LRQSQKPSSE ATNQDAKLPS ELLVDKLPHF IASQEVENIQ NKRPRNNIFY  420
PDEQSKLQKS ALAYTSLPSN HHLGVGTSNF CLALSLNQDN NGIDFSTPPM PMNLCHNFNF  480
KTDGELSLKA GFDVERQHHG KNF*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
3k2a_A1e-14307365563Homeobox protein Meis2
3k2a_B1e-14307365563Homeobox protein Meis2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007009173.20.0PREDICTED: BEL1-like homeodomain protein 9
TrEMBLA0A061FM320.0A0A061FM32_THECC; POX family protein, putative
STRINGEOY179830.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1761579
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G02030.17e-78TALE family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]