PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID fgenesh1_pm.chr_6_#_302
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; Mamiellales; Bathycoccaceae; Ostreococcus
Family C3H
Protein Properties Length: 1010aa    MW: 110522 Da    PI: 4.9413
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
fgenesh1_pm.chr_6_#_302genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH23.21.2e-07914935426
                              ---SGGGGTS--TTTTT-SS-SS CS
                  zf-CCCH   4 elCrffartGtCkyGdrCkFaHg 26 
                              + C  f r G+C +G++CkFaH+
  fgenesh1_pm.chr_6_#_302 914 PVCYAFER-GECDRGNSCKFAHP 935
                              67999***.*************8 PP

2zf-CCCH19.91.3e-06954974627
                              -SGGGGTS--TTTTT-SS-SSS CS
                  zf-CCCH   6 CrffartGtCkyGdrCkFaHgp 27 
                              C  f+  G C+yGd+C+F+H+p
  fgenesh1_pm.chr_6_#_302 954 CYAFRD-GNCSYGDSCRFSHDP 974
                              666666.*************86 PP

3zf-CCCH18.83e-069901009626
                               -SGGGGTS--TTTTT-SS-SS CS
                  zf-CCCH    6 CrffartGtCkyGdrCkFaHg 26  
                               C  fa+ G C +Gd+C+F+H+
  fgenesh1_pm.chr_6_#_302  990 CYAFAE-GNCTRGDSCRFSHD 1009
                               999***.*************5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
TIGRFAMsTIGR005851.1E-8320338IPR002099DNA mismatch repair protein family
Gene3DG3DSA:3.30.565.101.0E-6320230IPR003594Histidine kinase-like ATPase, C-terminal domain
SuperFamilySSF558742.75E-4235221IPR003594Histidine kinase-like ATPase, C-terminal domain
CDDcd000751.21E-841143No hitNo description
PfamPF135893.7E-943146No hitNo description
PROSITE patternPS000580111117IPR014762DNA mismatch repair, conserved site
SuperFamilySSF542119.1E-30221358IPR020568Ribosomal protein S5 domain 2-type fold
CDDcd034841.95E-50231358No hitNo description
Gene3DG3DSA:3.30.230.104.0E-38231357IPR014721Ribosomal protein S5 domain 2-type fold, subgroup
SMARTSM013401.6E-39236358IPR013507DNA mismatch repair protein, C-terminal
PfamPF011197.3E-24238357IPR013507DNA mismatch repair protein, C-terminal
SuperFamilySSF1181165.89E-23632765No hitNo description
PfamPF086761.4E-22635816IPR014790MutL, C-terminal, dimerisation
SMARTSM008531.8E-30635817IPR014790MutL, C-terminal, dimerisation
SuperFamilySSF1181165.89E-23798848No hitNo description
PfamPF134514.0E-20864908IPR025306Probable zinc-binding domain
PROSITE profilePS5010313.713910937IPR000571Zinc finger, CCCH-type
SMARTSM003564.5E-4910936IPR000571Zinc finger, CCCH-type
PfamPF006428.8E-6914935IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.108.6E-4915934IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010314.164948975IPR000571Zinc finger, CCCH-type
SMARTSM003560.0023950974IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.101.3E-4953974IPR000571Zinc finger, CCCH-type
PfamPF006422.8E-4954974IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010313.8359841009IPR000571Zinc finger, CCCH-type
SMARTSM003562.39861009IPR000571Zinc finger, CCCH-type
PfamPF006422.7E-49891009IPR000571Zinc finger, CCCH-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006298Biological Processmismatch repair
GO:0006310Biological ProcessDNA recombination
GO:0009555Biological Processpollen development
GO:0048316Biological Processseed development
GO:0005524Molecular FunctionATP binding
GO:0030983Molecular Functionmismatched DNA binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 1010 aa     Download sequence    Send to blast
MPPDDDDDGN GDDVLARPRA IRPIDAAAVH RICSGQVVLD LASCVKELVE NALDAGATNV  60
EIRLKDHGTD VVEVSDNGSG VDGASYGMLT KKYATSKLRA FEDLETTTTF GFRGEALSSM  120
CGICGEFTVT TRTLDDECGT KIEYDAEGNV VSTSAVPRSA GTTATARRLF EPLAVRRKEF  180
LRNAKREYGK ALAVVQAYAL MSKSVRILCT HQSGKYGRAN VLHTRGGSEA TVRENVVTVF  240
GAKMMACMRE VDFELGSSTG CRVVGFVSKV DPGCGRVGTD RQFFYVNGRP VDLPKMSKAL  300
NETYRSFNPN QVPMAVLDFR LPTDSYDVNV TPDKRKVMLH CEQEILMRMK EALAETFAPS  360
RYTFAVGDAP PSAGRRSLSS ARRSSDVDDV DDDENDENDD DENWTPIKYG EEETFDAALP  420
IEDFARARAA KRAAPVREVE TQKNLETFGF TREVTNVAIG GGWTMATSTE NAAPETSPSE  480
APLATPREVE IKEETEDVEV CEEEEPKRPR LEEPMCEDDD EDVEAPTRPY VGHEEVTFED  540
VAVTEMEQET PPPSRRESLD VGKIAFSMES MLLRRQNAKK SALPAPTAKE ASFESSRIPS  600
ETESTVDAAS TQTAAKATNE LERVFDKADF AKMRIVGQFN LGFILATLGD DLFIIDQHAS  660
DEIYNFERLQ RTTSLTKQPL IQPISLDLTA SEEQTVLQNM PVFLANGFGF CDIAESVPGA  720
DINNSSVDPR CRTLRLNAVP FLKNVQFDKS DVQELVAMLD QGQHSLPSKS QLSIGLARPS  780
SGAASDAASA RLLRPSKTRA ALAMRACRSS IMIGDALDRR SMRRVLNHLT SLEAPWNCPH  840
GRPTMRHVRR WRMSGKVSTG TYGDITLNCR DCNAEFIFTI GEQEFYATKG WTNQPSRCEP  900
CKKAKKARFG EDAPVCYAFE RGECDRGNSC KFAHPGTDHK GTGGGGGGGG AGICYAFRDG  960
NCSYGDSCRF SHDPNASDSR PSSRGPTGKC YAFAEGNCTR GDSCRFSHD*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h7s_A3e-911935713363PMS1 PROTEIN HOMOLOG 2
1h7s_B3e-911935713363PMS1 PROTEIN HOMOLOG 2
1h7u_A3e-911935713363MISMATCH REPAIR ENDONUCLEASE PMS2
1h7u_B3e-911935713363MISMATCH REPAIR ENDONUCLEASE PMS2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001418550.10.0predicted protein, partial
TrEMBLA4RZC50.0A4RZC5_OSTLU; Uncharacterized protein (Fragment)
STRINGABO968430.0(Ostreococcus 'lucimarinus')
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G25900.22e-06C3H family protein