PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP012497.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family C3H
Protein Properties Length: 913aa    MW: 101882 Da    PI: 8.7107
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP012497.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH25.71.9e-08174193525
                  --SGGGGTS--TTTTT-SS-S CS
      zf-CCCH   5 lCrffartGtCkyGdrCkFaH 25 
                  lC++f+  G C++G+ C+F H
  PCP012497.1 174 LCKDFMT-GRCRRGSDCQFLH 193
                  9******.************* PP

2zf-CCCH21.73.6e-07234254526
                  --SGGGGTS--TTTTT-SS-SS CS
      zf-CCCH   5 lCrffartGtCkyGdrCkFaHg 26 
                   C++fa+ G C++G++CkF H 
  PCP012497.1 234 YCSDFAK-GKCQRGSTCKFDHH 254
                  6******.*************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 913 aa     Download sequence    
MSRSNRKRSS KWDLVEGPQF EDANMQDNGW MGKAGRAFHH KESGRDWLSP ETNDLHRPKH  60
DLDLPSREPL AGSRGSHKNE SINKGCNRYM NDSMVWDGDG NCSTRMSPGL DEWREHRSRS  120
PKSGWRRSLR GRSRSRSRSR SKSQSWSRSP DRGYRRESLF LDRNRGRPGI SAQLCKDFMT  180
GRCRRGSDCQ FLHEGNSNYD DSWESRHRKR DASRYSTPPD TKYYPLKSER YSVYCSDFAK  240
GKCQRGSTCK FDHHRASDGF SKGSTNENTR ERENERRNRD TSTERGAERV PHRSSDIPCK  300
FFAAGNCRNK KNCRFSHHIQ ARASPERKSQ DGRWGPGHSL NDANPAWSGP KWSDTGTLSD  360
AAMLTVDNRN IGVPEVRSSA WSVDDNRWGC DQNNENKNCA DRSVSHEAVE RNEKDANLWN  420
EDSVGAHVDL PKLRDTEKWL GDMSPDWNYT VQSSNHVGKQ EHSRITRGSE PSTQVHGAAS  480
IIEPMVAERS DFLQNKDVRV DGVISVPYDN RTAIEEPSSF RNNLNVTANI MARQSFDHSG  540
QSSSAFPFSG LSTIGQSKNL IPCGGVVKSP QDTLSPESKS VTKLDIGDAK TSLVDGIPQV  600
PNLVGGKELT ELTNLSASLA QLLGNQQQLP QIYAALNSHN APLLPKSEGS TEQLLAAAIQ  660
RDPTVVSHKP YDPMCDSIEH RISNNQMCLL PNSAGNTSID GKVENLSNVV SPSSLPSGAN  720
VNNYQQTNNP LEEPTLKDHQ LSQHGAKSEV VKGNGALGAE ESKSAQEENN SPENGPMEVT  780
GGKKLKEVKG SRAFRFALVE NVKELLKPSW KEGQVSKDAY KTIVKKVVDK VTSTMQGANI  840
PQTQEKIDHY LSFSKPKLTK LVQVASPVDE SIALSIFERI CGHLFHPFRS LASFYLPHHF  900
ESLVSIVPDD MIS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1130138RGRSRSRSR
2130140RGRSRSRSRSR
3130142RGRSRSRSRSRSK
4132140RGRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.12e-09C3H family protein