PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG86070.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C2H2
Protein Properties Length: 941aa    MW: 103636 Da    PI: 4.5106
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG86070.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H211.90.00068914936323
                 ETTTTEEESSHHHHHHHHHH..T CS
     zf-C2H2   3 CpdCgksFsrksnLkrHirt..H 23 
                 C  Cg++F ++ +L +Hi +  H
  GBG86070.1 914 CATCGQVFDSRNQLFQHITKsgH 936
                 99***************987666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 941 aa     Download sequence    
MPQCLYEVLE IPRTATADDI KNAYRRLALQ WHPDKNQNRL EEANVRFKEI QNAYDVLRDK  60
QERAWYDSHR EAIVRGGDNY YGSESGVSGY KSSKPENEIN LWPYFSSSAY SGYGSTGKGF  120
YSVYEEVFKH LHRQEQESAA FQGLRIPDPP PLGRQDSPYA EVSAFYNYWL SFATCRDFAW  180
CDEYVVASAP NRKYRRLMEE ENKKLRKKEK KEFNETVRAL AAFVKKRDKR VLERQLEQRR  240
IRAEKEEEEK RRRQEMMKER WEKAQSYEEP EWARVNEDDQ EEFDGDEDDG IADPNSGSYD  300
WARSGTAATA ASGRVGGGGE TETSKATTAG ANVHNELFCI VCSKRFKSEK QWQNHEKSRK  360
HLERVSELKA VLLEEEEATE QATTSSSPAE EALPSSRTAA ARLAAESSSA DPALPKEEDE  420
KGEESASEEE VAAEANERCS HRGNIPPRSA SQSQTSDVSD QQSKEADGRD ADDNATGAAD  480
NNGDNGGGSD DDDACGADGA TVRGGGGGGG GGGGGGGGGS RKGGETISSS ASHSKIHGNI  540
NAGGRGGEQQ KQQEEDDEDD EEEEVGGESE LKKSDKTGSH FEQEEEEEGK EEEEEEEEEE  600
EEEEDHDEDA VLERMLRSSS QRSSEKKAST PLSRNQEEDD EDEETEGREE EEDEGDHDED  660
AVLEWMLRSS SQRSSQKKAS TRFGHDREEA HEDEETAETE EKAGSAEREP GNSEAVTEKE  720
DANPVATEVQ EGNEQREERE EAKKEEGEGE EIGSRSQGKK AKKGRRAVKE KGGGVGKAVS  780
DAAETDRRST PGVEAAPSRT EEGQEAGKEA SQAEGTNRGI ENGKEGIVEN AKTDGADVQK  840
SSRKEKKQKK KDVGISAGGG GMGGNKVLSD GMALQQLLQQ QQQLLLQKGV GGKKGAKAKK  900
ARKEVVEAVI GRLCATCGQV FDSRNQLFQH ITKSGHAAPK G
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1192229RKYRRLMEEENKKLRKKEKKEFNETVRALAAFVKKRDK
2203211KKLRKKEKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G74250.13e-59C2H2 family protein