PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG74118.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C2H2
Protein Properties Length: 1648aa    MW: 168823 Da    PI: 6.8063
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG74118.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H215.45.5e-05452474123
                 EEETTTTEEESSHHHHHHHHHHT CS
     zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                 ++C++C+k F r  nL+ H r H
  GBG74118.1 452 FVCEICNKGFQRDQNLQLHRRGH 474
                 89******************988 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1648 aa     Download sequence    
MDLSLRHRLE GSELCDMDIG DYLFRGGQSL SRAGANGDCA VGASAVSLCY HPTAAGLHPL  60
SSSAAASTRG IPSPAVFSSS SPAQQSPFAL ADCFKLNNNS SSSSSSSTGI VSSCGFRVVK  120
PFAEVRSLYA SSSSSPSPSP RSPRSLTTEI PSFVFDPRSQ SHAEIPIPTN DILAKEATES  180
STLSRHHRGE RREDCGHQRP PTSSESGSRR RKVGDGEGEQ CRAAETAAAR SRRQEDEMGV  240
GGGLVAAPRA GGGEEARTTP TLPTFAFPAR RDCETTAAEP RPGDPPQDHR HDDDERDDGL  300
LLTTTSSSSV SSGSDSVEET MTLVSAGTGA RGDSRGSGDD NAVLQGGGGG GGGGGDGPTT  360
GMLICDGSSE ERGAVGGEGG EGGRERRMIC DGSFEDRGRG GGGGGGGGGG GGGNQSQPGG  420
EMKRKRNLPG TPDPQAEVVA LSPKTLMATN RFVCEICNKG FQRDQNLQLH RRGHNLPWKL  480
KQRLSKELRK RVYVCPEPSC VHHDPTRALG DLTGIKKHYC RKHGEKKFEC DKCSKKYAVY  540
SDWKAHTKSC GSREYRCDCG IVFPRRDSYI THRAFCEAHT GEDAGGAGGA FGTGGLNPPP  600
SEVMSPQRRS PGIGIRGFHP DQSEFRAAVD GSAAGLACWL SCEDEMMRES GGVGVGVGGG  660
GGRASAEAAS AMSVSLRQHL LENGFCHPPP GLEILPIELA NPWVVGSSQH SFEAFPPTRS  720
SSAAAAGGGG GGGGGGGGRG EAGGQAPQGR SGSASLSSSS LALWMGPHRT PQDHQSIGDL  780
LPSDPGGNPD SSSAVPLLLP HHPLHAGDHP RTCLGQSSST LGAEHSGSVV GGPSSCCLLS  840
NLHSQENHKL ASPVNYNFHH STSLNYSQIP PAPRQAAGSV FASLFASGTA GQIPVKSLLT  900
NGGGAGTAAG PSCSDRVGTR SGSVAAAGEE GGGGGGGGGG GSGGGHAGQI VMKGGEVERL  960
PLSSRMSSDQ HNSDMMMMRG VGNDVLRRHD GYHHHVNRGI GGEFESPPTP SGKLSPIPGA  1020
HQSSAAYRNY TEVEAAASRY PGRHCETAGG SDALNGGPGS SMLLHSGEPR SCSSLGVSDG  1080
GGGIGRMPWT PLATMGHYSE RNREQERGQQ QQPRAHHRQQ GRAKGGEAEE GSGECGGASF  1140
AAEGQGGARH HEGDDNGDHE DGNRSSCLAV RETMTAATEV EGEGEGEGGE GGGGDSSRGS  1200
AEAEGGPDAS SRTDALLRAR CCGALGRGRG EESAGLFHHQ LAVPGSGFGL TWLGGSPQRG  1260
GSSGSRERAI AAAQEHLDSG SVIESRSRGR VGGMGGGGGG RGGGGGGRDG ETAAAHLDQQ  1320
IHNNIEEIFG RLGDENDCRC VGKNSCGGRD TDQTVAVISR PHVSMQEAGD GIMQSRETQL  1380
ERTSGSLPPH HEDLASESRP TSGFTPTAAA AAGGQVSIIP SSLLPSSAAS GTMSSALTGA  1440
APATDISKIA ITATNRSPAS LQRSDQDAFP GDNNIVLLRG KVAREGRGGG GGGVEGGGGG  1500
AVLLSTSSDK AQLGAVTRDF LGIGRNDKRV FPPLSPTQLA DLAVIESIRG GFNRSRSREG  1560
GNGSDPRAVF MMTAAGVPTT SVMTQLGREN IELRPTPVSI IILSLLPATT PIMAGLYRIP  1620
GLHKPCSLFL IDTCHSGGSS DSQEHGKK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112971307GGGGRGGGGGG
212981308GGGGRGGGGGG
312991309GGGGRGGGGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G03840.11e-93C2H2 family protein