PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG038379t1
Common NameTCM_038379
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family CPP
Protein Properties Length: 941aa    MW: 101567 Da    PI: 5.774
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG038379t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.12.3e-15530568341
               TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                       +k+CnCkk+kClk+YC+Cfaag +C++ C+C++C+N+ e
  Thecc1EG038379t1 530 CKRCNCKKTKCLKLYCDCFAAGIYCADPCSCQGCFNRPE 568
                       89**********************************876 PP

2TCR48.61.6e-15616654139
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                       ++k+gCnCk+s ClkkYCeC++a++ Cs  C+Ce+CkN 
  Thecc1EG038379t1 616 RHKRGCNCKRSMCLKKYCECYQANVGCSIGCRCEGCKNV 654
                       589***********************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.6E-16528569IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163434.793529656IPR005172CRC domain
PfamPF036382.4E-11531566IPR005172CRC domain
SMARTSM011141.9E-17616657IPR033467Tesmin/TSO1-like CXC domain
PfamPF036383.1E-11618653IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 941 aa     Download sequence    Send to blast
MDSPEPSKAP ISSSSAAASI SASSPVQESP FSNYISSLSP IKHDKVPHVA QGFLGLNSPP  60
LVFTSPHINT LRRPQSSSVE VSQNGEGDKK NIDGPGSLER SVSELQQGLI TDIKKEDDTK  120
DSVSVQPSSS SGCVDEYLAD PVEADCANSE YFINLNCKES KNAFQSSVNG LLETKNLKFA  180
GKNDVGREID AAQLLSGQSE EGLERKLTSH VKPVKIEDEQ HAGQVKSDEC PEFGSDMFDL  240
SSQGKECKNL DAQKVVEDHE DRCDGFLQLL PGSLQRVQEY EDFAENFEGV AEVTVDSMTN  300
DLEASEHQRG MSRRCLQFGD AQPEATANCS SSSLANDMIT SRSVATTSET EGLGLSHVDL  360
SVISRKRQLV NLSQLAINMI PQHYGEKSSL TVSKPSGIGL HLNSIVNAIP MGRGGTASMK  420
LAVDSMGIQG IKSASVMSCQ SMENMQSCSD AFEKVLAAPQ DGTLEAKACV IPGSAASESL  480
CTMESIDCQT TLHRKRELSS EHGDSNEMFN QQSPKKKRKK SSNSTDGEGC KRCNCKKTKC  540
LKLYCDCFAA GIYCADPCSC QGCFNRPEYE DTVLETRQQI ESRNPLAFAP KIVQPVTEFP  600
VTSREDGNWK TPSSARHKRG CNCKRSMCLK KYCECYQANV GCSIGCRCEG CKNVFGKKED  660
YCVTEEIVNR GGGEISESTV AAKKDFLNSD LCDPHYLTPL TPSFQCSDHG KNAPKSRLLS  720
RRCLPSPESD LTVLAKSPRS PRTSDSNDML LETSKENLDV GSYCEGINYN NADVLGDGCH  780
HTPLPNHPSI ILGSTSSKAR ELTSLSRFPL GPRSGCLSSG GSLRWRSSPI TPMSSLDGTK  840
NLQGLDSDGL SDILEDDTPE ILKDTSTPNK SVKTSSPNGK RVSPPHNLLQ LGSSSSGPLR  900
SGRKFILKAV PSFPPLTPCI DLKGSSNQNR SSCQENSSND *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A8e-1853165412121Protein lin-54 homolog
5fd3_B8e-1853165412121Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1515519KKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007013826.20.0PREDICTED: CRC domain-containing protein TSO1
RefseqXP_007013827.20.0PREDICTED: CRC domain-containing protein TSO1
RefseqXP_017983279.10.0PREDICTED: CRC domain-containing protein TSO1
TrEMBLA0A061GNJ00.0A0A061GNJ0_THECC; Tesmin/TSO1-like CXC domain-containing protein, putative isoform 1
STRINGEOY314450.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM151961015
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.13e-60TESMIN/TSO1-like CXC 2
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]