PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG79953.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C3H
Protein Properties Length: 1988aa    MW: 196867 Da    PI: 7.088
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG79953.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH44.72.2e-1417911817127
                  --S---SGGGGTS--TTTTT-SS-SSS CS
     zf-CCCH    1 yktelCrffartGtCkyGdrCkFaHgp 27  
                  yktelCr + +tGtC+yG++C+FaHg+
  GBG79953.1 1791 YKTELCRSWEETGTCRYGGKCQFAHGK 1817
                  9************************96 PP

2zf-CCCH36.11.1e-1118291853125
                  --S---SGGGGTS--TTTTT-SS-S CS
     zf-CCCH    1 yktelCrffartGtCkyGdrCkFaH 25  
                  ykte+Cr+f+ +GtC+yG rC+F+H
  GBG79953.1 1829 YKTEICRTFSSNGTCPYGTRCRFIH 1853
                  9************************ PP

Sequence ? help Back to Top
Protein Sequence    Length: 1988 aa     Download sequence    
MAGGSLDKIL TAGLDKMSSS LDKMPKGNGG KQADGGNARK EEAENVPAEE STCKRGEEKK  60
STNGGGGDPA SIGEGANASP SSTRTFPPSN EGPAMKSSSL GERMSQKLTK QLSVIVRGEG  120
GGAMDWFHSL KQPFLPRASE KLQMESGNQE LQVESGKQEE ELQTESARNE DELQTESRNG  180
EEELQTESRN EQSKDRKKYL IGSYNPEDVE LVLQPFPWGK GEEKDCLGEE LEAEPSVGQS  240
VGQSVGITMV NVFDSADLSV LPEANEEGSG EEGMCLADDE LNKMVDCLTS DVNNTNGSEI  300
NSGNVSFGNI ENFKEKDGGN VGVGNTEKEK NGGNVGVGNT EKAENGGNVG GGNVGGGNIE  360
KEVNGTRKTG DKTNEKEING INSSSGNGVV ILRDGLGIAA GTAEVTEERA SALVLVGSGS  420
VAATSTSTSL VSVLPGSAAA VAAASAGTSM VSAGSGSVAA AAAAAAPTAA SASASVLKGA  480
TVRPHHVAQR RRDEKGPEAG ACVGGRGGGG GGENITAAAG STFSATEVCT SAKNSTRGSS  540
SSTSSNSAIT TLLYPCDRQH RVLYPPACGF NPIRINCGKR ITPAQDKSVS QDKESSSFWV  600
SSSTIKALSS PANGAAAGGK GGGSYHALPS RQNSTAVPPV STPIPGSHPH SSPSSSSSSA  660
FTASTASSIS HPHSSPSSSS SSAFTASTAS SISLDSKDKS DMDGLQGGTS SSSREERLGV  720
ENRKQLAVAG SQRGERTGVE DGAVTGGVAD MTRSSQGGIL KQHQLGEGEV KRRAFPSPPS  780
ILTPSPPPPP QPLLGQRFEP KRGQKGGGPG LLGGGRGNNA PGGSRMMGAA AAAAGLTSTP  840
GPLSPCPARG YAAAVGRSLV RPPPYQGKDK FDEVSPSYPG RNKQSNYDLN EVTVENRAER  900
VGVRRSEEGC ASAAGAGDDA LGNETGLPLL VEEMVQDLWE ENEDPSVPIR TGEAAASRDE  960
DDPTVVAAAS LDTKRSAGGG VGITRRVSAV GSGGGMKRIG GEGGIRDGGG GGGGGTRQAL  1020
GVRTPAVSFA AAAEGQRSLR RDEPGSLSPP SVTTHQSHIP LPLGYRRGLP GMIAEEEGAV  1080
ATPAGGGGGG GGDSASSIAE GVLAMAGRPG GHVPFQQIRQ AGPRGVSAGG GGGLGSLGAA  1140
AGGVAAGGAA GGGGGGGGGY NVGTSRATGL SAAASQSLTV VPHRHSPTFR QAGGPGSGSG  1200
YPVGSGLHAS PYLSPSLQAS LTPILGGGGM GMAAMAAQFG RRPSSPTNTV EVAEAAAAGP  1260
AMGGRAGGAS TGTVMTGPAA VSPRADSGPG TAAGASMAKT GLSLDTVGGA ADGSCGTGPT  1320
VPAGAATGSG IGSIAPVMLM PQGIICQSPL MRGQAGHHAH ASRMKPGGVV GGGGGGGGGG  1380
GSGGLFPALS PSPSPLGIFG GSGGAMGGVG GGGGGGDGSL LDLQEMQQEL LYGAAGFGPI  1440
RGAPSPTGIT AEQYGKFQHQ LQQQHHMAAA QSQHQLLQAA AAAAAAAAAG GGGGGGAGFI  1500
PQAPPSPSSY QLFLQQQQQQ QQQQQVDRRS AMKLTMAGPG GPVAIGRGSN SGPGTAFQPG  1560
TGNSGPGVAG PAGHLGIINT SGNAAGFNGS MSSPLGGGGG GGGGVIGGGR MGPQSRGPHN  1620
ALTLAGGGNG ISMANPLGGG VSAAGSRSPH RTNGGPGSGS GPSGGSTVIS GPRSMMLRLP  1680
VLQIPGAYDR HAVSGQESTS RELTTPSGIG TGDGTPSAPG LMHVGSCRIL PMQQLSACSA  1740
GVGMGGGAGG GGVPGVVISP PLSPSDAGAV ASSGALTATT PTTPQSPHSL YKTELCRSWE  1800
ETGTCRYGGK CQFAHGKEEL RPVARHPKYK TEICRTFSSN GTCPYGTRCR FIHYRTALQP  1860
DPPSPVSSPC ARPASCAAGA AKLIASSPLK EDDGNMRRLP IFQRLAPGSS ESPQFEMTGL  1920
ASGDHLGQTR FDQQIRATKP GSVDWSARGP APVGGSPVGG SGNVVSFLRV SAVSEEPLLV  1980
TWDLEGAM
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1504512GGRGGGGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G66810.13e-25C3H family protein