PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG015076t1
Common NameTCM_015076
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C3H
Protein Properties Length: 448aa    MW: 49109.6 Da    PI: 7.6697
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG015076t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH27.74.7e-09108129426
                       ---SGGGGTS--TTTTT-SS-SS CS
           zf-CCCH   4 elCrffartGtCkyGdrCkFaHg 26 
                       + C+++ + G C+yGd+C+F H+
  Thecc1EG015076t1 108 KVCNYWVQ-GNCNYGDKCRFLHS 129
                       68******.*************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM003561.52550IPR000571Zinc finger, CCCH-type
PROSITE profilePS501039.8832551IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010315.591104131IPR000571Zinc finger, CCCH-type
SuperFamilySSF902292.62E-7105129IPR000571Zinc finger, CCCH-type
SMARTSM003567.4E-7106130IPR000571Zinc finger, CCCH-type
PfamPF006421.2E-6107129IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.105.0E-8108128IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:2.130.10.103.1E-15129243IPR015943WD40/YVTN repeat-like-containing domain
SMARTSM003200.034135174IPR001680WD40 repeat
SuperFamilySSF509781.1E-39136440IPR017986WD40-repeat-containing domain
PfamPF004000.05139174IPR001680WD40 repeat
PROSITE profilePS5029418.668142344IPR017986WD40-repeat-containing domain
PROSITE profilePS5008212.346142183IPR001680WD40 repeat
PRINTSPR003201.5E-5161175IPR020472G-protein beta WD-40 repeat
PROSITE patternPS006780161175IPR019775WD40 repeat, conserved site
SMARTSM003203.8214251IPR001680WD40 repeat
Gene3DG3DSA:2.130.10.105.8E-26244440IPR015943WD40/YVTN repeat-like-containing domain
SMARTSM003200.01258295IPR001680WD40 repeat
PROSITE profilePS5008211.511265304IPR001680WD40 repeat
PfamPF004000.2279295IPR001680WD40 repeat
PRINTSPR003201.5E-5282296IPR020472G-protein beta WD-40 repeat
SMARTSM003200.0089298335IPR001680WD40 repeat
PfamPF004000.0091300334IPR001680WD40 repeat
PRINTSPR003201.5E-5322336IPR020472G-protein beta WD-40 repeat
SMARTSM00320390358400IPR001680WD40 repeat
SMARTSM00320320403439IPR001680WD40 repeat
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005515Molecular Functionprotein binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 448 aa     Download sequence    Send to blast
MDLDGGNRRV FNRLGGPSTA PTDSSKHQKV CYHWRAGKCN RFPCPFLHRE LPAPGPAATA  60
NGSGAPKRFA DDSGFSGPAA RRGPNFNNNH HNSWGRMGAN KVVRKTEKVC NYWVQGNCNY  120
GDKCRFLHSW SLGEGFTMLS HLDGHQKVVS GIALPAGLDK LYTGSKDETV RAWDTNSGQC  180
TCVINLGGEV GCMISEGPWL FVGIPNVVKA WNTQTNYELS LTGPVGQVYA MVVGNDLLFA  240
GTQDGTILAW KFNAITNSFE AAASLKSHTL AVVSLVVGAN RLYSGSMDHS IRVWSLETLQ  300
CLQTLTEHHN VVMSLLCWEQ FLLSCSLDQT IKVWVATENG NLEVTYTHNE EHFMLVIRGG  360
SDWRMEGEAV GLLNLRGMHD SESKPVLLCT CNDNSVRLYD LPSFSERGKI FAKQEIRAIE  420
VGPGGLFFTG DGTGYRVWKW AQPIATS*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6g6m_A4e-171304038323Tako8
6g6n_A4e-171304038323Tako8
6g6n_B4e-171304038323Tako8
6g6n_C4e-171304038323Tako8
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6145877e-88JX614587.1 Gossypium hirsutum clone NBRI_GE58084 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017972995.10.0PREDICTED: zinc finger CCCH domain-containing protein 63
SwissprotQ9FNZ20.0C3H48_ARATH; Zinc finger CCCH domain-containing protein 48
TrEMBLA0A061G1I40.0A0A061G1I4_THECC; Zinc finger WD40 repeat protein 1 isoform 1
STRINGEOY230720.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM18392869
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G51980.10.0C3H family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]