PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG040388t1
Common NameTCM_040388
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C3H
Protein Properties Length: 300aa    MW: 31311.5 Da    PI: 9.5946
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG040388t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH20.39.8e-073962225
                      -S---SGGGGTS--TTTTT-SS-S CS
           zf-CCCH  2 ktelCrffartGtCkyGdrCkFaH 25
                      k ++C  f+ t  C++G++C+F H
  Thecc1EG040388t1 39 KSKPCTKFFSTSGCPFGESCHFLH 62
                      7789******************** PP

2zf-CCCH27.74.7e-09104129227
                       -S---SGGGGTS--TTTTT-SS-SSS CS
           zf-CCCH   2 ktelCrffartGtCkyGdrCkFaHgp 27 
                       kt+lC+ f     Ck+Gd+C+FaHg+
  Thecc1EG040388t1 104 KTRLCNKFNTPEGCKFGDKCHFAHGE 129
                       8***********************96 PP

3zf-CCCH37.25e-12266290126
                       --S---SGGGGTS--TTTTT-SS-SS CS
           zf-CCCH   1 yktelCrffartGtCkyGdrCkFaHg 26 
                       +kt+lC +f++ G C +GdrC+FaHg
  Thecc1EG040388t1 266 FKTKLCENFSK-GSCTFGDRCHFAHG 290
                       69*********.*************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5010312.7093765IPR000571Zinc finger, CCCH-type
SMARTSM003560.33764IPR000571Zinc finger, CCCH-type
SuperFamilySSF902295.1E-63863IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.106.6E-83962IPR000571Zinc finger, CCCH-type
PfamPF006428.8E-53962IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.102.1E-14100134IPR000571Zinc finger, CCCH-type
SuperFamilySSF902293.27E-8101133IPR000571Zinc finger, CCCH-type
SMARTSM003560.0056102129IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010313.629102130IPR000571Zinc finger, CCCH-type
PfamPF006422.2E-6104129IPR000571Zinc finger, CCCH-type
SuperFamilySSF547915.99E-15166243IPR004088K Homology domain, type 1
SMARTSM003223.5E-12172242IPR004087K Homology domain
Gene3DG3DSA:3.30.1370.104.9E-16173242IPR004088K Homology domain, type 1
PROSITE profilePS5008414.508173237IPR004088K Homology domain, type 1
PfamPF000133.2E-11175239IPR004088K Homology domain, type 1
CDDcd001051.97E-14177237No hitNo description
Gene3DG3DSA:4.10.1000.102.5E-16264297IPR000571Zinc finger, CCCH-type
SuperFamilySSF902299.42E-9265295IPR000571Zinc finger, CCCH-type
SMARTSM003564.5E-7265291IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010317.075265292IPR000571Zinc finger, CCCH-type
PfamPF006421.6E-9266290IPR000571Zinc finger, CCCH-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003723Molecular FunctionRNA binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 300 aa     Download sequence    Send to blast
MEFGGGRKRG RHEAALNGNG GLKKSKQEME SFSTGIGSKS KPCTKFFSTS GCPFGESCHF  60
LHYVPGGIKA VSQMLGSNPA LPAASRSSAV LPSFPDGSSP PAVKTRLCNK FNTPEGCKFG  120
DKCHFAHGEW ELGKPTGPAY EDPRAMGPMP GRMAGRMEPP SQGLGAAASF GASATAKISI  180
DASLAGAIIG KNGVNSKHIC RVTGAKLSIR ENESDPSSRN IELEGTFDQI KQASAMVREL  240
ILNVGSASGT SMKNPAMSGS GAANNFKTKL CENFSKGSCT FGDRCHFAHG TEELRKPGM*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017983881.10.0PREDICTED: zinc finger CCCH domain-containing protein 44
SwissprotQ7F8R01e-114C3H14_ORYSJ; Zinc finger CCCH domain-containing protein 14
TrEMBLA0A061GZ050.0A0A061GZ05_THECC; KH domain-containing protein / zinc finger family protein isoform 1
STRINGEOY324490.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM23362875
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06770.13e-47C3H family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]