PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG001774t1
Common NameTCM_001774
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C3H
Protein Properties Length: 410aa    MW: 45717.3 Da    PI: 7.8981
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG001774t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH20.96.1e-077192426
                      ---SGGGGTS--TTTTT-SS-SS CS
           zf-CCCH  4 elCrffartGtCkyGdrCkFaHg 26
                      +lC++++  G+C +Gd+C + H+
  Thecc1EG001774t1 71 KLCKYWMS-GYCARGDKCWYLHS 92
                      69******.*********99996 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5010313.8076794IPR000571Zinc finger, CCCH-type
SuperFamilySSF902296.8E-86892IPR000571Zinc finger, CCCH-type
SMARTSM003565.3E-66993IPR000571Zinc finger, CCCH-type
PfamPF006422.9E-57192IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.102.6E-67191IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:2.130.10.104.0E-4792399IPR015943WD40/YVTN repeat-like-containing domain
SMARTSM003201.9E-498137IPR001680WD40 repeat
SuperFamilySSF509781.74E-4499400IPR017986WD40-repeat-containing domain
PfamPF004000.0038102137IPR001680WD40 repeat
PROSITE profilePS5008213.182105146IPR001680WD40 repeat
PROSITE profilePS5029420.25105307IPR017986WD40-repeat-containing domain
PROSITE patternPS006780124138IPR019775WD40 repeat, conserved site
PRINTSPR003206.7E-6124138IPR020472G-protein beta WD-40 repeat
SMARTSM0032022177214IPR001680WD40 repeat
SMARTSM003207.4E-6221258IPR001680WD40 repeat
PfamPF004000.001223258IPR001680WD40 repeat
PROSITE profilePS5008212.38228267IPR001680WD40 repeat
PROSITE patternPS006780245259IPR019775WD40 repeat, conserved site
PRINTSPR003206.7E-6245259IPR020472G-protein beta WD-40 repeat
SMARTSM003204.8E-6261298IPR001680WD40 repeat
PfamPF004000.0011263297IPR001680WD40 repeat
PROSITE profilePS500828.537268297IPR001680WD40 repeat
PRINTSPR003206.7E-6285299IPR020472G-protein beta WD-40 repeat
SMARTSM0032011321359IPR001680WD40 repeat
SMARTSM0032049362399IPR001680WD40 repeat
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0016021Cellular Componentintegral component of membrane
GO:0005515Molecular Functionprotein binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 410 aa     Download sequence    Send to blast
MGIRRAARRY DHDSVERLSV RRQGEAANIN DYRRNAKPYH SISEDLLAKS SSHHPKNSYV  60
STIKAREQEN KLCKYWMSGY CARGDKCWYL HSWYCGDGFT MLAKLEGHKK AVHGIALPLE  120
SEKLYSGSSD GTVRTWNCHS GKCVRLSNLG DEVGSMITEG PWVFIGMKGV IKLWNIQTVD  180
ELSLKGPVGQ VYSMVVANNM LFAGAQNGVI FAWKGSSEAS PFQLVASMEA HSGAVLCLTV  240
GEKKLYSGSV DHTIRVWDMD TLQCIKTLNG HEDAVMSLLY CNGCLFSCSL DCTIKVWFAT  300
EGENWEVIYT HKEENCVYYL TLISVFTIMG VLALCGMNDA ETKPVLFCSC NDNTVRLYDL  360
PSFTERGRLY SKHEVRVIQR GPFPLFFTGD GNGSLTVWKW LQKPGGGAP*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6g6m_A2e-241023628323Tako8
6g6n_A2e-241023628323Tako8
6g6n_B2e-241023628323Tako8
6g6n_C2e-241023628323Tako8
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017983223.10.0PREDICTED: zinc finger CCCH domain-containing protein 48
SwissprotQ9FNZ21e-130C3H48_ARATH; Zinc finger CCCH domain-containing protein 48
TrEMBLA0A061DKF80.0A0A061DKF8_THECC; Zinc finger WD40 repeat protein 1, putative
STRINGEOX929120.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM18392869
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G51980.11e-132C3H family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]