PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG77208.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C3H
Protein Properties Length: 891aa    MW: 86002.7 Da    PI: 8.3036
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG77208.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH20.86.7e-07237262126
                 --S---SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   1 yktelCrffartGtCkyGdrCkFaHg 26 
                 +kt+lC  f+    C+y d C+FaHg
  GBG77208.1 237 FKTRLCAKFMSDRMCPYSDGCSFAHG 262
                 69***********************8 PP

2zf-CCCH283.7e-09322346226
                 -S---SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   2 ktelCrffartGtCkyGdrCkFaHg 26 
                 k + C+ f++ G C+yG+rC+F H+
  GBG77208.1 322 KLRACMKFMKEGSCPYGERCCFWHD 346
                 5689******************986 PP

3zf-CCCH291.8e-09374400127
                 --S---SGGGGTS--TTTTT-SS-SSS CS
     zf-CCCH   1 yktelCrffartGtCkyGdrCkFaHgp 27 
                 +kt+lC+ + ++G C + drC+FaHg+
  GBG77208.1 374 WKTRLCNKWEANGACFFADRCHFAHGE 400
                 8************************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 891 aa     Download sequence    
MGGGVDGGGW GWKEEGKGVG KGRGGGRGEG SAGAVAEGGR GAECGSGGVF GTSSTTSNYS  60
SGGRPAGAGG ALDTVGGASS ASAAYKGGTR GVGESGKSSS GSSSHSHLQR HPAVSSSLPI  120
NGRIVSSSSN GEGGREGGGR EGCMGAKKGG GGGGGGGGGG GGGEEPPWKR LRTADESGPA  180
GASDVCGGTG GELTAAPAGG DLQKGGNCTN TSAYSSPGSQ SGPQSTGGSL RYRPWLFKTR  240
LCAKFMSDRM CPYSDGCSFA HGNHELQTPP PPFVSDMAGP GVLGEGGGGG GGGGGVAGGG  300
GGGGGGGDGG GGGQYGSGNP RKLRACMKFM KEGSCPYGER CCFWHDDGTS KPPGGVQGMS  360
ASSFTKLGIK PPNWKTRLCN KWEANGACFF ADRCHFAHGE AELQPFNGPR QPYHSPEQLF  420
YGFPPFNGPP PQQQQQSNLA SPLPPLSPPN APFSSPVPRR ASSLGLGSAS AGLLSADPSL  480
PSGVRPSSSP GVSLAAGSSA SGSGPVVGGS SSSNLSTPMV SPASSSANVE GKQSQTLSSL  540
PATRTVPDLR AGTAATGVDS PAAPLAGHRF VAQQQTTSSR REGGLGGGAG GSGGGGPASR  600
VTVEMSVALQ TDSQLAHGCL EQSQSQSRRP VGGLGRGYSE GGFNMFAYGD CPFDEAEDDG  660
LQQVYAQHVI APGPSSASFH DPGAVRGSAV VPMPMPSAIP EEQCHASAAV AGSGQLPEET  720
WSPPGIVRGR GGGGGGGGGG GGSGGGSGGG EGGAGTGVEP QVEERIGTSA IGSDSKGGGL  780
AAAKASTSAS GSHLVYQQGE QLEAGVDQRA GSSVAGAPAT VASGSRGQGG GGGGGGASVG  840
RVQKKAEKEG FGRGGGRGGG GGGGGIGTQQ GVLTVPFFRF SSPTGAAGRG K
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1133139GGREGGG
2854860GGREGGG
3855861GGREGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G32360.11e-19C3H family protein