PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG021166t1
Common NameTCM_021166
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C3H
Protein Properties Length: 473aa    MW: 49470.5 Da    PI: 7.7527
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG021166t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH37.44.1e-124468327
                      S---SGGGGTS--TTTTT-SS-SSS CS
           zf-CCCH  3 telCrffartGtCkyGdrCkFaHgp 27
                      ++ C +++rtG+C yG+rC+F+H++
  Thecc1EG021166t1 44 EADCIYYLRTGFCGYGSRCRFNHPR 68
                      789********************96 PP

2zf-CCCH38.81.6e-1290114327
                       S---SGGGGTS--TTTTT-SS-SSS CS
           zf-CCCH   3 telCrffartGtCkyGdrCkFaHgp 27 
                       ++ C++++rtGtCk+G +Ck++H++
  Thecc1EG021166t1  90 QPVCQYYMRTGTCKFGVSCKYHHPK 114
                       689********************96 PP

3zf-CCCH35.12.3e-11135159226
                       -S---SGGGGTS--TTTTT-SS-SS CS
           zf-CCCH   2 ktelCrffartGtCkyGdrCkFaHg 26 
                       ++++C+++ +tG+Ck+G++CkF+H+
  Thecc1EG021166t1 135 GEKECSYYVKTGQCKFGATCKFHHP 159
                       7899********************8 PP

4zf-CCCH406.5e-13295321127
                       --S---SGGGGTS--TTTTT-SS-SSS CS
           zf-CCCH   1 yktelCrffartGtCkyGdrCkFaHgp 27 
                       +++++C+++++tG CkyG++C+++H+p
  Thecc1EG021166t1 295 PGQPECQYYMKTGDCKYGSSCRYHHPP 321
                       5789*********************97 PP

5zf-CCCH345e-11343366326
                       S---SGGGGTS--TTTTT-SS-SS CS
           zf-CCCH   3 telCrffartGtCkyGdrCkFaHg 26 
                        ++C+++++ G Ck+G+ CkF H+
  Thecc1EG021166t1 343 APPCSHYSQRGVCKFGAACKFDHP 366
                       689********************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5010314.0514169IPR000571Zinc finger, CCCH-type
SMARTSM003562.3E-54168IPR000571Zinc finger, CCCH-type
SuperFamilySSF902291.19E-64370IPR000571Zinc finger, CCCH-type
PfamPF006421.4E-94468IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.109.9E-64768IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010315.26387115IPR000571Zinc finger, CCCH-type
SMARTSM003568.3E-787114IPR000571Zinc finger, CCCH-type
PfamPF006423.7E-1090114IPR000571Zinc finger, CCCH-type
SuperFamilySSF902294.71E-690114IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.105.4E-592113IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010314.718133161IPR000571Zinc finger, CCCH-type
SMARTSM003561.4E-5133160IPR000571Zinc finger, CCCH-type
PfamPF006427.4E-9135159IPR000571Zinc finger, CCCH-type
SuperFamilySSF902294.06E-6135161IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.103.4E-4139158IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010316.389294322IPR000571Zinc finger, CCCH-type
SMARTSM003566.8E-7294321IPR000571Zinc finger, CCCH-type
PfamPF006424.2E-10296321IPR000571Zinc finger, CCCH-type
SuperFamilySSF902295.1E-6298322IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.102.7E-5300321IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010314.596340368IPR000571Zinc finger, CCCH-type
SMARTSM003566.6E-6340367IPR000571Zinc finger, CCCH-type
PfamPF006423.0E-8343366IPR000571Zinc finger, CCCH-type
SuperFamilySSF902293.92E-5344370IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.101.9E-4345365IPR000571Zinc finger, CCCH-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 473 aa     Download sequence    Send to blast
MERGPESDPQ PEWTAPGPET GLEEPVWRLG LGGGPESYPE RPEEADCIYY LRTGFCGYGS  60
RCRFNHPRDR AAVMGAGRGG VGEYPERVGQ PVCQYYMRTG TCKFGVSCKY HHPKQGGGSV  120
SSVLLNYYGY PLRPGEKECS YYVKTGQCKF GATCKFHHPA PPAQVPAPSP APPVASVPTP  180
VPAPAIYSTV QSPSGPSSQQ YGVVMARPPL MPGSYMQGHY GPLLLSPGMV SVPSWNPYMA  240
PVSPGTQPTV GSSSIFGVTP LSPSAPAYTG PYLPVPSSVG PSSSSQKEQS FPERPGQPEC  300
QYYMKTGDCK YGSSCRYHHP PEVIAPKADV MLGPLGLPLR PGAPPCSHYS QRGVCKFGAA  360
CKFDHPTGTL SYSPSASSLA DMPVAPYPVG STIGTLAPSS SSSELRPDLI SGSSKDTATA  420
IMSSSVSTLS ESVGSVFSEG APIPQSSIQQ SSQSTAPSTG SGSSSTEGRT SS*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017975652.10.0PREDICTED: zinc finger CCCH domain-containing protein 34
SwissprotQ6NPN31e-163C3H58_ARATH; Zinc finger CCCH domain-containing protein 58
TrEMBLA0A061EVZ70.0A0A061EVZ7_THECC; Zinc finger C-x8-C-x5-C-x3-H type family protein
STRINGEOY064490.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM17792786
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G47850.31e-131C3H family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]