PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG79841.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C3H
Protein Properties Length: 790aa    MW: 86162.8 Da    PI: 5.0782
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG79841.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH18.82.9e-06458478526
                 --SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   5 lCrffartGtCkyGdrCkFaHg 26 
                  Crff++ G C++ ++C+F+Hg
  GBG79841.1 458 TCRFFLQ-GNCRFDNNCRFSHG 478
                 6******.*************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 790 aa     Download sequence    
MPDEDPKLGP HLEMLGGKGR AENRCVCRGK ACEYVAVGSG NIAWEEKEAP TGLAQQEGGE  60
SGGSGGGGEG GERGGGGGGG RGGRGERGGG GGEGGGGGGG GGGGGYRVTG AGDGHVCRVW  120
VLLPSLKNSK QTNGREDDVE EEATPEQQQQ QKKKKKKRKR KRKKEKWKKN EKNGKQQQQQ  180
QQQQQQKKKK KNKKKKRRRE KEEEEEGEGE EGKEEEEEEE EEEEEKEKRK RSMEEGRGGV  240
GGGGGTTGLH EEDSEDTEGS LERELEVQLV EHKDSLAALT AAIQVEPSEE LLQVQAELEE  300
ACRSAEQSLL HLKRARLLKE CAEMTEGVAG ADDEPADAAG GSSNAANPKD IHTKGHATET  360
LPLSSEVGNA DREAAADRIE AKIDKAETVC RARNFTRGSK CQFLHADGRW YAGEVLWSGE  420
VTAEALRDIG HGSAGNSEPG SSFARVRFLH PSKESVQTCR FFLQGNCRFD NNCRFSHGTL  480
IPANSLREFG SSTGSLQDLR LGKSVAGSDD GDCNSYDDGS EEEEDDDDEE EEEEEEEEEW  540
EEGEENGEVD EEWERNTGLG NLFEEAAAMA EAGPQNEVVN FVDWEQHTRG IASKMMAKMG  600
YQEGMGLGKL GQGIQVPLPV RVLPAKRGLD YVASKLPEAK EAKRKKRKRG GEKVQRRKAA  660
LAAKQRELEQ ERSDGDMFAL INRQLHSSSA TVSTQDAAAN DSKGAAAAKT GKAVDRKSLL  720
LKVDQVRELK ETIGKLEEML ERNKNEKPVY VAVQKKLEHA RCALAQANAA HNSVRDILNE  780
KDAHKKWLKF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17381RGGGGGGGR
28290GGRGERGGG
3151159QKKKKKKRK
4151169QKKKKKKRKRKRKKEKWKK
5152158KKKKKKR
6152160KKKKKKRKR
7152164KKKKKKRKRKRKK
8152172KKKKKKRKRKRKKEKWKKNEK
9152196KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
10153159KKKKKKR
11153161KKKKKKRKR
12153165KKKKKKRKRKRKK
13153173KKKKKKRKRKRKKEKWKKNEK
14153197KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
15154160KKKKKKR
16154162KKKKKKRKR
17154198KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
18154159KKKKRK
19155161KKKKKKR
20155163KKKKKKRKR
21155199KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
22156162KKKKKKR
23156164KKKKKKRKR
24156200KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
25156161KKRKRK
26157163KKKKKKR
27157201KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
28158164KKKKKKR
29158202KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
30158163RKRKRK
31158165RKRKRKKE
32158166RKRKRKKEK
33159203KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
34159191KRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKK
35160204KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
36160192KRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKK
37160164RKRKK
38160165RKRKKE
39160168RKRKRKKEK
40161205KKKKKKRKRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKKNKKKK
41161193KRKRKKEKWKKNEKNGKQQQQQQQQQQQKKKKK
42187199KKKKKKRKRKRKK
43187199KKKKKNKKKKRRR
44188200KKKKKNKKKKRRR
45190198KKKKKKRKR
46193199KKKKKKR
47193198KKKKRK
48193199KKKKRRR
49194200KKKKRRR
50194199KKKRRR
51643649KKKKKKR
52643658KRKKRKRGGEKVQRRK
53644649RKKRKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G24830.15e-30C3H family protein