PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG018714t1
Common NameTCM_018714
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C2H2
Protein Properties Length: 535aa    MW: 59429.7 Da    PI: 7.557
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG018714t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H217.21.4e-054567123
                      EEETTTTEEESSHHHHHHHHHHT CS
           zf-C2H2  1 ykCpdCgksFsrksnLkrHirtH 23
                      ++C+ Cgk F +   L  H+r H
  Thecc1EG018714t1 45 FQCKVCGKDFESMKALFGHMRHH 67
                      89*******************99 PP

2zf-C2H216.22.9e-057797323
                      ETTTTEEESSHHHHHHHHHHT CS
           zf-C2H2  3 CpdCgksFsrksnLkrHirtH 23
                      C+ Cg+ F +   L+ H+r+H
  Thecc1EG018714t1 77 CQECGRKFQSLKGLTAHMRLH 97
                      *******************99 PP

3zf-C2H214.69.8e-05369391123
                       EEETTTTEEESSHHHHHHHHHHT CS
           zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                       ykC +C+k+F+++  L  H+  H
  Thecc1EG018714t1 369 YKCRICDKTFKSHQALGGHQTFH 391
                       9***************9999877 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF576672.23E-104597No hitNo description
PROSITE profilePS5015712.6534572IPR007087Zinc finger, C2H2
Gene3DG3DSA:3.30.160.609.3E-54565IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SMARTSM003550.00174567IPR015880Zinc finger, C2H2-like
PfamPF139126.6E-84569IPR007087Zinc finger, C2H2
PROSITE patternPS0002804767IPR007087Zinc finger, C2H2
Gene3DG3DSA:3.30.160.601.9E-56697IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS5015710.8457597IPR007087Zinc finger, C2H2
SMARTSM003550.00937597IPR015880Zinc finger, C2H2-like
PfamPF139122.0E-57697IPR007087Zinc finger, C2H2
PROSITE patternPS0002807797IPR007087Zinc finger, C2H2
SuperFamilySSF576677.94E-9368391No hitNo description
PfamPF139123.3E-6368393IPR007087Zinc finger, C2H2
PROSITE profilePS5015710.72369396IPR007087Zinc finger, C2H2
SMARTSM003550.024369391IPR015880Zinc finger, C2H2-like
PROSITE patternPS000280371391IPR007087Zinc finger, C2H2
SuperFamilySSF576677.94E-9446473No hitNo description
PfamPF139125.8E-9450473IPR007087Zinc finger, C2H2
PROSITE profilePS501579.452451478IPR007087Zinc finger, C2H2
SMARTSM003550.74451473IPR015880Zinc finger, C2H2-like
PROSITE patternPS000280453473IPR007087Zinc finger, C2H2
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003676Molecular Functionnucleic acid binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 535 aa     Download sequence    Send to blast
MEKNSGKGKA KENMDFQGYG LRDNPKKSWK SLSFTDGSSS SMLKFQCKVC GKDFESMKAL  60
FGHMRHHSGR ERKRVNCQEC GRKFQSLKGL TAHMRLHPVK LRVSGEPGPG GPRQDLVLES  120
ITVRKKRSKR MRYSNAPNSS PSSLNESSDV FEIDQEVEDV ALCLIMLSWG VRNWSEFNSS  180
RESSDNSVIK SFHQSKEIIQ NEIGIPFGDG DESFQMKKPR VDKSNPDVSV SMNVFYEKKI  240
SECKELDSGI VTDKEEKIGS EAPNDMFCRD VEFRVSTVED ESGFELYATE IEERNSGEKM  300
TFRSIEVESG QDLMEGLDLA GLGSTKLSSC KDAMFDACDA EPGGNSSNKQ ICTPLNSEMS  360
DDSKKKNRYK CRICDKTFKS HQALGGHQTF HRKSNSCAIE QIENCEKNTQ SSSSPKTEAS  420
PKFRRVENVE NSVEQEINGV TSNGTSRCKV HKCGICFKVF ASGQALGGHK RSHILKESGT  480
RDKQPPMQIG FISDVLDLNL PALHNEEANG DVGFKSCRVG SDCKSEPLVS LVAN*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017974426.10.0PREDICTED: uncharacterized protein LOC18601620
TrEMBLA0A061EGU60.0A0A061EGU6_THECC; C2H2-like zinc finger protein, putative
STRINGEOY036170.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16241913
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G45120.18e-19C2H2-like zinc finger protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]