PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG028353t1
Common NameTCM_028353
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family CPP
Protein Properties Length: 647aa    MW: 71573.1 Da    PI: 4.7394
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG028353t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR47.63.4e-15337375341
               TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                        k+CnC+ s+Clk+YCeCfaag +C ++C Ce+C N+++
  Thecc1EG028353t1 337 FKRCNCQMSRCLKLYCECFAAGLYCVDSCACENCYNRTD 375
                       699*********************************875 PP

2TCR475e-15422461140
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+gC+Ck+skClkkYCeC+ a++ Cs  C+Ce+C N+ 
  Thecc1EG028353t1 422 RHKRGCKCKRSKCLKKYCECYRAKVGCSGGCRCEGCDNSF 461
                       589***********************************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.4E-12335376IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163432.371336462IPR005172CRC domain
PfamPF036383.4E-11338373IPR005172CRC domain
SMARTSM011145.9E-14422463IPR033467Tesmin/TSO1-like CXC domain
PfamPF036387.1E-11424460IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 647 aa     Download sequence    Send to blast
MESPKSDGNA VEAFPASALK SPFSKFLNNL SPIESANAAR YIERLSESSL PTTPSVFGSP  60
HLDLQPETGF LESEEIAAST SNAHGQLYSP SSSALIPCIQ KQFQSCNPSE CVDDFLADPL  120
EVDSAQHAET FLQSANVVPL LLPSCFTASQ VTTKNDDYTN DWVVEVGAQT LPHLSKKHLL  180
SGSLILLEGS GDQSIDESFD KILKFSSENI CNYVEPDELL EHQEASQHQR GIRRHLRFEA  240
TLDCKDNAAF NCHTSSRVRE VVDHVDSDHQ SSEAVVVDNE GHLPDNIEPP WDTQQVSCFD  300
QQTLPCGGQR VEVLSQYAES SKGKRNRETY TYESGDFKRC NCQMSRCLKL YCECFAAGLY  360
CVDSCACENC YNRTDYEDWV EDSREQIELR NPLAFAPTIV EQANDSPILA DDGNWTTPSS  420
ARHKRGCKCK RSKCLKKYCE CYRAKVGCSG GCRCEGCDNS FGKKSESIFQ REEEWKNLLN  480
MEELMSDQKG GTANQFSPTW EELGNTSHLT PLSHQVPSLI LSKIWDFPYI SQAQPQDGSG  540
LQLSPGQLHW YSSALASVNA PCEIMGDGSP HIHNDNSNPA NKLQSGSPNQ ERVFPPQQIQ  600
SDRLGSSSTA GLQSGRKLSS QAVSSFPPLS PHRNSKDRMN QIEDEQ*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-1533848112133Protein lin-54 homolog
5fd3_B1e-1533848112133Protein lin-54 homolog
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021298702.10.0protein tesmin/TSO1-like CXC 3
TrEMBLA0A061G9J50.0A0A061G9J5_THECC; Tesmin/TSO1-like CXC domain-containing protein isoform 1
STRINGEOY265550.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15696813
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22760.16e-56Tesmin/TSO1-like CXC domain-containing protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]