PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PHT50480.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family C3H
Protein Properties Length: 845aa    MW: 94670 Da    PI: 9.3368
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PHT50480.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH21.34.9e-07169189526
                 --SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   5 lCrffartGtCkyGdrCkFaHg 26 
                  Cr+f + G C++G+ C+F+H+
  PHT50480.1 169 VCRDFPA-GKCRRGSDCRFIHP 189
                 5999999.*************9 PP

2zf-CCCH21.24.9e-07252274326
                 S---SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   3 telCrffartGtCkyGdrCkFaHg 26 
                 t  Cr++ + GtC+ G++CkF+H+
  PHT50480.1 252 TITCRNYVK-GTCRWGASCKFSHD 274
                 668******.*************5 PP

Sequence ? help Back to Top
Protein Sequence    Length: 845 aa     Download sequence    
MGESEKRRKS LWDAEEPNST NYEEWGAPKA DNLWQSKSRS DWLSGDNVTG TDELRKENYH  60
YKSMSPAFER RGRRSNSHSP DNGRTQPRRY SGRARSRSRS RSRGRGEGRS RSRSRGRDRF  120
RTSSRSRSRS RDRGSMRVQN RSRSPLHNYR RDSYASGDRR TGLRTSSQVC RDFPAGKCRR  180
GSDCRFIHPD AANHRGGGHS EDNVAERLGS RPERGHISRY TDSEGPGYQS RDRLPDVHHL  240
EDELHRNRSR GTITCRNYVK GTCRWGASCK FSHDGASGDN YDKGTRSASF DHGQDNPASR  300
SGKSLCEYFA AGKCYKDNCK FSHDASSRNH EIRHSDNTGG HRFNDKNNWL NAPKWDDEGR  360
TSDLVKASGW DETAPKKDTA VSVLTDRTNE RPGHSFENEN RAWGSTEPQF MNSDRQRAAS  420
PHRGSAGHLN TLNILESSVI QNFTNAQDIH LTSQASDLNM ERTSAHVLGH KSNQGSSGII  480
LSTIDTQPYG SGGSFVETQG LTDDSIARTL GSNVVNEFMN SRDSVHHVGL PGQSFSGTGI  540
GMKSEHSAVL NGAHQEPNVF LPIPSTGHNK REAPGTPEML EFKVPQNLSG TVTGEQVHQM  600
ETSPTSMIKK FEEGLREAQL QSVLNPSGPS GMLPSNPISS LVHALYGQTN PEMRVPDNYH  660
PPDGFELNTS VNLPNNSFHL DRDSNKAPME QMNQSYAVDP ELGNNDQIDE VKQQENKLVE  720
VNGKDKLALE ESKDGLENNH PGTMHMHAKI EEGSCNKDEK AMRLFKNALI EFVKEILKPT  780
WKEGKMSREV HKTVVKKAVD KVTSAMQAEQ VPKTQEKIDQ YLSYSKPKIA KLVQAYVERL  840
LKNEA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
193101RARSRSRSR
293103RARSRSRSRSR
395103RARSRSRSR
495105RARSRSRSRSR
597105RARSRSRSR
6109117RARSRSRSR
7109119RARSRSRSRSR
8111121RARSRSRSRSR
9125133RARSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.16e-15C3H family protein