PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG74540.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family C3H
Protein Properties Length: 5290aa    MW: 565196 Da    PI: 5.4013
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG74540.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.32.1e-0650395060526
                  --SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH    5 lCrffartGtCkyGdrCkFaHg 26  
                  +C+ fa+tG+C+ G++Ck +H+
  GBG74540.1 5039 ECSVFAETGECPDGSSCKLHHS 5060
                  6*****************9998 PP

Sequence ? help Back to Top
Protein Sequence    Length: 5290 aa     Download sequence    
MPNELCGRDW ANGGHAEGGP LPLGLQSHHQ QQLPGSARGP GRHHVAVSKT GGGVCWGGRY  60
SHEERRSNLE EEGQSDAEEQ GNHHLQSRRG EAIDVSSSGR WGDDCDDREG NHGGSWRGGE  120
GGISQHRHFR GHYRHRLSPP IRPERILSER GFSDRREEER REEERREEGG GGSSTIDDTG  180
GGAADIDRWE GGPAADGTGR CLLLGQSPVR HHLHPPTSPP TSPCSPTPLR GEEERNVMGE  240
AASPRGRRSN WASSATGGFL PSVIVGNKEV MGSGSRSRSS TSVARLLPVE GGSGYDGRRR  300
GGWEIAEHEL YPRRTAALMT TWTTRRKKTM TTRTRTTAMA KRLLKMDTES AGWAARAYGI  360
SLESLMGMVA VKGEEEEEEE EKGVEEEEEE VEEEEEEVEE ERRVHSVLMS GNECDYLWRG  420
GGSSRPMGVN RVGEHYLGGG GGGGDHDGQV RDSRRMRSEM SAGGGAEREK GGGAAAVRAD  480
LVSLGDDGCA SAGVAEEGEL VESEDGEEEE VAEGEDAWGE DTTSKHFVRR KSAGDDRLGT  540
SLSLERTSPS SPTTLQRREL NKNCNDRQNR ARMPIERHLN DPWGAEGGGG GGGGGGGVVE  600
EDASNEAHRD WRRFGGKEFG LNGHVLGERR WSMLNAWRRF GGKECKWGEE GSMMNEGRVG  660
GGVLAGKSAN GVGDFLAGKL SAGGGVDARE EGEEWRRRQK SAGTVAAAES PALISGRHVS  720
SSRARRTEMD SDWQTGIGMA NLAGVIGGRG AGKGGVLVSG RMPAVSAAEV MDALPAEERA  780
SFMNTWLSLA TRQEENLVDR ARGGALTRRR PSLDRDRDRD DDVDIAARGI GGGGEGGGEV  840
IHHHEDDEST VRLPTPRMGE SRFVQFQREV LTREGRLEQP GEEEGIQEIM ARDEVVIDTA  900
HEKLLSLRRG RRDLMRAAAD GLRKKGGGVG RQRREIGLER RSAEMSRFME WVDERGALER  960
GEEMGYEGRG GRGGGDEEER DSTPIGWQTL AIPPPGAPSS REGRAYGNDD DDDDDDDEHC  1020
HSFPVSSRQR GMSSSEGFWE IGERPLARAP AVDVDGGNSH TFDSAARVGI RTRSAPDLCR  1080
GGRGVGSVGV SRMMPTKAGG EEVVAVRGRE DGCRQGGSGG RGGGEGERER DFVRCGSRGR  1140
LAQNAAQPSG GAFSLAQNAA QPSGGAFSRS SMACVEDVRG GKGVGTTSSW FQYQYEDDDD  1200
YGTTMVSRSG FPSLVPSDDG RKGAAAISSE AEGIETRDFG SSRKRRREYS GGVEENTKFA  1260
GAAQREWSLG GEKSLALSED SLENTLSREE RWAAADAGGG GGGWSNGRAV PLIERLGVGG  1320
EGGGGEGGGE GGGEGGGGGG GGEALPLSSS NWLVNSGITT AALFGRRLRS EGDVSSPSPP  1380
VPSVYPVAEF PEGVADGFDN VGERSMPGSR QYCVEREGLT EVCGREEEAA EEELEEGEAE  1440
EDGEEEEEEE EEEEAEEEEV EARRNSIADQ GDGILSAYSM SAMTVPIDGR DKAAMEAQDE  1500
SEVKPGGPTS TSCGEEGDAL AFLREMAKLV SSGYGLDKCI KHMEGEAAPP YSPPPPLPPS  1560
DVDHDPESLL ATPLQRPFLG ATGSEDMAVR YGGQSTANQR PFPPATASGN SATLRTRRRK  1620
KGDEEGIAVL EDISNDDYDG FDEHDEYGAD RGGDGGRCGV QHDEGEGEHK GGRKLVHMSD  1680
PCFAWKGERA PSATALNRAK FAIEGGGSHQ SKRTLPSGFK VGSGREGGGQ IEDAPSMLGQ  1740
SGVVPEWGFA RDGYGEGNNY GATRMKGIEE GKDMMMGEGE EEEEEEEEEG EIAYREKMEN  1800
GKGVMMMEEG EEEEEEEEEG EIAYRGNREN GRGVMMMMEE GGEEEEEGEI VAREPQRCRM  1860
PKMDVHSLDS NARLRFCRTN EAARARGQAK LSKAAVDLLG MENEGEEGDV RGDEERENMK  1920
ENGRCGRDGS RMCGALAERG VRPPSEKGGI VIPSEKLGQL RGNVSDENDL LDESHVFGAA  1980
DGERWRVEKD VMVSLEKVPL LTGREATGLR EHGSITNEAA SAVELPFPSS RRVEHFEEEV  2040
VAMDSPVAIT SDHNEIDSEQ TERANDVASG GGQGDGLEDQ EGEGEEDDDN LYEVSGGKGQ  2100
SCRIRLGEVE VAATDMTDGY LGNIDADAND YHGDSGHDDG EEDVEEDEEE NEEEDEEEED  2160
EDMILLLPDN DPHFVREEKP MLCVSTWQDL HSSADSASMQ GVGGGAYSPW IGSTLQNPGF  2220
TGRTVSALPS CAAGRVFGLG GPGAGGSSRA TLLRGGGSIM TPMTMMMTPR HAIPSITPHK  2280
RLWNQWVASP SATCQRTQQQ GHGAPGAAGE GEASPMGEGE VCPTGEGEAT ALNTTDEEGP  2340
PSVHVGGEGE AMEMLRGEGG NRTMGNLIGE MTVENATDKM EMELAEAVGE VATHESNDPV  2400
EEEGESCGPM ETSNKRRRCS MVEVGVLKYG QEREQQQQQT ELRSAGCEQE EEGLQSLHSK  2460
SMELVGEGPP GPLGGLVSGS NEEASCMDEE GYELGRRKDA DCGYHDEGSV PGQREEGEDR  2520
YGELYASLDG GKLAGHGHGE SGAHQEQQQQ QQQYQEEKDK ERKKNKEKER KKKSPAELNA  2580
TLKALLSKFV NQGEFVMDAS PERERGQSSS SAVSGTKVGE GTNDEEQKTD GLSPEVLTRS  2640
VGRGRTAAGA AGAAAAMAIY SSGDRSRGGA SATVVSTRRI AQPEEQQQLL QRGVSTAVGA  2700
TKNSGHDVVV ADVASRIAHT DTGPGNAGGV SSRVRELVGR EHSSSLRYLQ QQQEGDVREV  2760
RVSAAAVLGG TARSSASASS SLQCSLVPTM PSGVMVGSLQ KRSPVAKAAA VPSAVRVSPV  2820
SSVQQHQLLS RMPVGGRGAV GVPSLLNPTS LPRPRVWSRK WCREDCVAPG AGSANVPSGA  2880
ATEGAGVAAH HTTTAGKMMG RPSGSSSPEQ YAASASSKTA SSPLLGIKPT AGPSSPVELA  2940
PSKRICTKST FVVGGQHESM LVSSSVAEGR QAGSAAVTSA SGLSGSLLST SLPPSRLSLQ  3000
ISSSAVTGSP GGSSSSALHP PAPSLRPSQL TKCTSGGGTP MTSPSSLLQH LRNRVSSGLN  3060
SSPSLLSHGA AVESPMAKKR IVSASALGGG ECEYVRIGNS LVRSPSMASK VMTGSSSMLG  3120
VSAMGHIMSG NGGSNAPCPL PMRSSTSAAS LTSAAGATAV RSSPLGCHVA VGQLGQRKVI  3180
TSQATTSASV VTRVIPGTPG AAGRLQQQQQ QQVQQQIKQQ VQQQQHPKHQ QRQQQEQREQ  3240
QRQQQQQQQK QQQKQQQKQQ QNQQQEQQQR QKQQKQQQQH QQPKEKQQRK QQHQQQQKEK  3300
QEQNQWQEEK RQLPPQQQVH QQQEQEQEQH QPQQQRQQQQ QEQQQQQHEQ EQQQHARQQQ  3360
QQEQQQQQHA RQQQQQEQQQ QQHEQEQQQH SRRQQQHARQ QQQQQQQHEQ LQQQHEQQHQ  3420
QHERQEKQNE QREQEAQVVL RSHSIGGSDG NDRVPRHACS ISEMNLQDST ATPAAHHHEV  3480
LPAKVDATRV TDSAVNLQDP GVISAAHQQE LPAGKGDDTQ GRAREGGVSS VTVSLESPFS  3540
DTAVNLHDSA VISAAHHEEL PTVKGDDEAQ VTARESGVSV ATISLVPPSS MVTDSAVNLQ  3600
DSAVVPVAHH QELPIGEGDD DTQAGASEGG VSAVTVSLES TRGGQGAGAS MTAAVCRPSL  3660
NDKLELAAGT VPVAANDGGL PIGKGNDTQV TVTEGGVSAV TMSLVSSRGK EDADASKTAV  3720
VCRHLQNDKL GLAGGTVPVA VNDGGLVNKK RSEKIKAVGK EQVNELAVTV STTVNNGRLV  3780
NEENPKGMRV VPKGLAHGLN DLGNDPSPSI PPAPEGALEI IVRSCSTAGP GDARKKEGSE  3840
SAAAAVLWGV QPAAEPGYAS AAVNVPSQVC ASQSVVVEME LPRQAMVGRT DGEEQASGDT  3900
GLGETGIEQG AMEEGMVFLQ QTRSSPPQNM ISVSGIERSA RSDEHDDGGT SIDDRCDFQA  3960
AGTLSTSGAE TTLPTSSSDG LADADVSMGG PCDLRVGTIC ASGDGTPEVD AGPLMSRGGG  4020
DAVPGPMSRN PLCSATALTV PAGARQVNGD QNLARDAGVA CSNNEMQNAE VKEGCILVVP  4080
MPLSSSHANM VDGCEISHGV QAAAKQRLPV ASTLQSGVRK TVGVVRCHRF ASTAVEPMVA  4140
ARPQVPQCQV LSPSTVNHIA GGPEGEAVMG EREEGIEGGP EPAGQAAIEA SETGIRGEGG  4200
TGVCTGTDLT KASKEDGMED MLLSLPSSSN RDCTGCGPDD DNGCCRKDGD DNGTCVCTGT  4260
DTTKASKEDG MEDVLSSLPS SSNRDCTGCX PDDDNGCCRK DGDDDADDGV GLEVRDRLLS  4320
NRNGLSSSAS SSTKGGKNHS LGPGAASGNR GAPNGNAMAR LADAETNAGM EGASGRYDGV  4380
CLSGNVPERQ VVGVNGTCIA GNEISTSREV DVVGHGGKQG ERVTETEALQ KGGRRETRDK  4440
VAPVHPLHES ASSCSGSSNC GRAGEQVVVV PIIAPKTSAG QRPWVGACLQ NSASMSKQKV  4500
LLGPPVNTVV ASAREQTKGA GSMATCVAVT TPISLGRVWM GNDERAGRAV MGRGIRSILT  4560
KQLSRSGSSF TRLVKKNGKI VFDSPAKYVR LRANQLVAVN ASASHPLSAG KGIQLRNTYV  4620
KSKVNQLVRS GVSCGVSMST APGMVGLGVR TGAVLSSPSR KVSPLSSSRA RQRLRGLLKV  4680
KAVITNRQAK QRSTVFPWRR GMNTRELGWR QTKRMSGHYS FWKDVDHLPL EMEKGSLLGL  4740
LRKQLQSKRM YEPVYTVSAN GLSMRRTGVR SNRSGANLKW TKCIGNRAIE ANEVATKAVA  4800
AVEKRNRERK EAVKKAKFDA AKDRARRKAA RRAAVRKSRE RIIFVGLAQY RMDPVGRTLH  4860
RIPEPPGISS KPASAFALLA PKRTSINGIM YVRVGNGSKL VRDPKAAIHA IASHRVKWSL  4920
HNARFSRGQK SQQYCLFYTR FGTCNKKDGK CPYIHDPEKV AVCTKFLVGK CDKENCLLTH  4980
KVIPERMPDC FYFLKGECTK AKCAYRHAEK IDPTGPICAA FRKGYCRDGY QCTKRHTYEC  5040
SVFAETGECP DGSSCKLHHS KKLRQVGQKR RRFGQKDTRT VFTDEDFGEG GLALHVPNAK  5100
PWYATEREAT NGVMTLRKLK LAEEGLRIGF WVEGSPFRKR PRVARGLEEE VVGRVLAGSR  5160
LHTVVTPSSI TEYRGADASD VMEGVGSMPL KETCSRSQLV GPGALVVNRR NCMDQVLWTT  5220
TAWTWTTLLD CGRMEHNSLF DRGWMDKNSL IRRDLLTRTV SQKRYDGRKL FRQASMDVNC  5280
MRSKLLGAEL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112431247RKRRR
225602572KERKKNKEKERKK