PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PHU20051.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family C3H
Protein Properties Length: 867aa    MW: 97094.6 Da    PI: 9.0603
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PHU20051.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH21.25e-07169189526
                 --SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   5 lCrffartGtCkyGdrCkFaHg 26 
                  Cr+f + G C++G+ C+F+H+
  PHU20051.1 169 VCRDFPA-GKCRRGSDCRFIHP 189
                 5999999.*************9 PP

2zf-CCCH21.25.1e-07252274326
                 S---SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   3 telCrffartGtCkyGdrCkFaHg 26 
                 t  Cr++ + GtC+ G++CkF+H+
  PHU20051.1 252 TITCRNYVK-GTCRWGASCKFSHD 274
                 668******.*************5 PP

Sequence ? help Back to Top
Protein Sequence    Length: 867 aa     Download sequence    
MGESEKRRKS LWDAEEPNST NYEEWGAPKA DNLWQSKSRS DWLSGDNVTG TEELRKENYH  60
YKSMSPAFER RGRRSNSHSP DNGRTQPRRY SGRARSRSRS RSRSRGRGEG RSRSRSRGRD  120
RFRTSSRSRS RDRGSMRVQN RSRSPLHNYR RDSYVSGDRR TGLRTSSQVC RDFPAGKCRR  180
GSDCRFIHPD AANHRGGGHS EDNVAERLGS RPERGHISRY TDIEGPGYQS RDRLPDVHHL  240
EDELHRNRSR GTITCRNYVK GTCRWGASCK FSHDGASGDN YDKGTRSASF DHGQDNPASR  300
SGKSLCEYFA AGKCYKDNCK FSHDASSRNH EIRHSDNTGG HRFNDKNNWL NAPKWDDEGR  360
TSDLVKASGW DETAPKWDDE GRPSDLVKAS GWDETAVRKD TAVSVLTDRT NERPGHSFEN  420
ENRAWGSTEP QFMNSERQRA ASPHRGSAGH LNTLNISESS VIQNFTNAQD IHLTSQASDL  480
NMERTSAHVL GHKSNQGSSG IILSTIDTQP YGSGGSFVET QGLTDDSIAR TLGSNVVNEF  540
MNSRDSVHHV GLPGQSFSGT GIGMKSEHSA VLNGAHQEPN VFLPIPSTGH NKREAPGTPE  600
MLEFKVPPNL SGTVTGEQVH QMETSPTSMI KKFEEGLREA QLQSVLNPSG PSGMLPSNPI  660
SSLVHALYGQ TNPEMRVPDN YHPPDGFELN TSGNLPNNSF HLDRDSNKAP MEQMNQSYAV  720
DPELGNNDQI DEVKQQENKL VEVNGKDKLA LEESKDGLEN NHPGAMHMHA KIEEGSCNKD  780
EKAMRLFKNA LIEFVKEILK PTWKEGKMSR EVHKTVVKKA VDKVTSAMQA EQVPKTQEKI  840
DQYLSYSKPK IAKLVQAYVE RLLKNEA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
193101RARSRSRSR
293103RARSRSRSRSR
395103RARSRSRSR
495105RARSRSRSRSR
597105RARSRSRSR
697107RARSRSRSRSR
799107RARSRSRSR
8111119RARSRSRSR
9111121RARSRSRSRSR
10113123RARSRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.17e-15C3H family protein