PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
Basic Information? help Back to Top | |||||||||
---|---|---|---|---|---|---|---|---|---|
TF ID | GBG74540.1 | ||||||||
Organism | |||||||||
Taxonomic ID | |||||||||
Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
|
||||||||
Family | C3H | ||||||||
Protein Properties | Length: 5290aa MW: 565196 Da PI: 5.4013 | ||||||||
Description | C3H family protein | ||||||||
Gene Model |
|
Signature Domain? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
1 | zf-CCCH | 19.3 | 2.1e-06 | 5039 | 5060 | 5 | 26 |
--SGGGGTS--TTTTT-SS-SS CS zf-CCCH 5 lCrffartGtCkyGdrCkFaHg 26 +C+ fa+tG+C+ G++Ck +H+ GBG74540.1 5039 ECSVFAETGECPDGSSCKLHHS 5060 6*****************9998 PP |
Sequence ? help Back to Top |
---|
Protein Sequence Length: 5290 aa Download sequence |
MPNELCGRDW ANGGHAEGGP LPLGLQSHHQ QQLPGSARGP GRHHVAVSKT GGGVCWGGRY 60 SHEERRSNLE EEGQSDAEEQ GNHHLQSRRG EAIDVSSSGR WGDDCDDREG NHGGSWRGGE 120 GGISQHRHFR GHYRHRLSPP IRPERILSER GFSDRREEER REEERREEGG GGSSTIDDTG 180 GGAADIDRWE GGPAADGTGR CLLLGQSPVR HHLHPPTSPP TSPCSPTPLR GEEERNVMGE 240 AASPRGRRSN WASSATGGFL PSVIVGNKEV MGSGSRSRSS TSVARLLPVE GGSGYDGRRR 300 GGWEIAEHEL YPRRTAALMT TWTTRRKKTM TTRTRTTAMA KRLLKMDTES AGWAARAYGI 360 SLESLMGMVA VKGEEEEEEE EKGVEEEEEE VEEEEEEVEE ERRVHSVLMS GNECDYLWRG 420 GGSSRPMGVN RVGEHYLGGG GGGGDHDGQV RDSRRMRSEM SAGGGAEREK GGGAAAVRAD 480 LVSLGDDGCA SAGVAEEGEL VESEDGEEEE VAEGEDAWGE DTTSKHFVRR KSAGDDRLGT 540 SLSLERTSPS SPTTLQRREL NKNCNDRQNR ARMPIERHLN DPWGAEGGGG GGGGGGGVVE 600 EDASNEAHRD WRRFGGKEFG LNGHVLGERR WSMLNAWRRF GGKECKWGEE GSMMNEGRVG 660 GGVLAGKSAN GVGDFLAGKL SAGGGVDARE EGEEWRRRQK SAGTVAAAES PALISGRHVS 720 SSRARRTEMD SDWQTGIGMA NLAGVIGGRG AGKGGVLVSG RMPAVSAAEV MDALPAEERA 780 SFMNTWLSLA TRQEENLVDR ARGGALTRRR PSLDRDRDRD DDVDIAARGI GGGGEGGGEV 840 IHHHEDDEST VRLPTPRMGE SRFVQFQREV LTREGRLEQP GEEEGIQEIM ARDEVVIDTA 900 HEKLLSLRRG RRDLMRAAAD GLRKKGGGVG RQRREIGLER RSAEMSRFME WVDERGALER 960 GEEMGYEGRG GRGGGDEEER DSTPIGWQTL AIPPPGAPSS REGRAYGNDD DDDDDDDEHC 1020 HSFPVSSRQR GMSSSEGFWE IGERPLARAP AVDVDGGNSH TFDSAARVGI RTRSAPDLCR 1080 GGRGVGSVGV SRMMPTKAGG EEVVAVRGRE DGCRQGGSGG RGGGEGERER DFVRCGSRGR 1140 LAQNAAQPSG GAFSLAQNAA QPSGGAFSRS SMACVEDVRG GKGVGTTSSW FQYQYEDDDD 1200 YGTTMVSRSG FPSLVPSDDG RKGAAAISSE AEGIETRDFG SSRKRRREYS GGVEENTKFA 1260 GAAQREWSLG GEKSLALSED SLENTLSREE RWAAADAGGG GGGWSNGRAV PLIERLGVGG 1320 EGGGGEGGGE GGGEGGGGGG GGEALPLSSS NWLVNSGITT AALFGRRLRS EGDVSSPSPP 1380 VPSVYPVAEF PEGVADGFDN VGERSMPGSR QYCVEREGLT EVCGREEEAA EEELEEGEAE 1440 EDGEEEEEEE EEEEAEEEEV EARRNSIADQ GDGILSAYSM SAMTVPIDGR DKAAMEAQDE 1500 SEVKPGGPTS TSCGEEGDAL AFLREMAKLV SSGYGLDKCI KHMEGEAAPP YSPPPPLPPS 1560 DVDHDPESLL ATPLQRPFLG ATGSEDMAVR YGGQSTANQR PFPPATASGN SATLRTRRRK 1620 KGDEEGIAVL EDISNDDYDG FDEHDEYGAD RGGDGGRCGV QHDEGEGEHK GGRKLVHMSD 1680 PCFAWKGERA PSATALNRAK FAIEGGGSHQ SKRTLPSGFK VGSGREGGGQ IEDAPSMLGQ 1740 SGVVPEWGFA RDGYGEGNNY GATRMKGIEE GKDMMMGEGE EEEEEEEEEG EIAYREKMEN 1800 GKGVMMMEEG EEEEEEEEEG EIAYRGNREN GRGVMMMMEE GGEEEEEGEI VAREPQRCRM 1860 PKMDVHSLDS NARLRFCRTN EAARARGQAK LSKAAVDLLG MENEGEEGDV RGDEERENMK 1920 ENGRCGRDGS RMCGALAERG VRPPSEKGGI VIPSEKLGQL RGNVSDENDL LDESHVFGAA 1980 DGERWRVEKD VMVSLEKVPL LTGREATGLR EHGSITNEAA SAVELPFPSS RRVEHFEEEV 2040 VAMDSPVAIT SDHNEIDSEQ TERANDVASG GGQGDGLEDQ EGEGEEDDDN LYEVSGGKGQ 2100 SCRIRLGEVE VAATDMTDGY LGNIDADAND YHGDSGHDDG EEDVEEDEEE NEEEDEEEED 2160 EDMILLLPDN DPHFVREEKP MLCVSTWQDL HSSADSASMQ GVGGGAYSPW IGSTLQNPGF 2220 TGRTVSALPS CAAGRVFGLG GPGAGGSSRA TLLRGGGSIM TPMTMMMTPR HAIPSITPHK 2280 RLWNQWVASP SATCQRTQQQ GHGAPGAAGE GEASPMGEGE VCPTGEGEAT ALNTTDEEGP 2340 PSVHVGGEGE AMEMLRGEGG NRTMGNLIGE MTVENATDKM EMELAEAVGE VATHESNDPV 2400 EEEGESCGPM ETSNKRRRCS MVEVGVLKYG QEREQQQQQT ELRSAGCEQE EEGLQSLHSK 2460 SMELVGEGPP GPLGGLVSGS NEEASCMDEE GYELGRRKDA DCGYHDEGSV PGQREEGEDR 2520 YGELYASLDG GKLAGHGHGE SGAHQEQQQQ QQQYQEEKDK ERKKNKEKER KKKSPAELNA 2580 TLKALLSKFV NQGEFVMDAS PERERGQSSS SAVSGTKVGE GTNDEEQKTD GLSPEVLTRS 2640 VGRGRTAAGA AGAAAAMAIY SSGDRSRGGA SATVVSTRRI AQPEEQQQLL QRGVSTAVGA 2700 TKNSGHDVVV ADVASRIAHT DTGPGNAGGV SSRVRELVGR EHSSSLRYLQ QQQEGDVREV 2760 RVSAAAVLGG TARSSASASS SLQCSLVPTM PSGVMVGSLQ KRSPVAKAAA VPSAVRVSPV 2820 SSVQQHQLLS RMPVGGRGAV GVPSLLNPTS LPRPRVWSRK WCREDCVAPG AGSANVPSGA 2880 ATEGAGVAAH HTTTAGKMMG RPSGSSSPEQ YAASASSKTA SSPLLGIKPT AGPSSPVELA 2940 PSKRICTKST FVVGGQHESM LVSSSVAEGR QAGSAAVTSA SGLSGSLLST SLPPSRLSLQ 3000 ISSSAVTGSP GGSSSSALHP PAPSLRPSQL TKCTSGGGTP MTSPSSLLQH LRNRVSSGLN 3060 SSPSLLSHGA AVESPMAKKR IVSASALGGG ECEYVRIGNS LVRSPSMASK VMTGSSSMLG 3120 VSAMGHIMSG NGGSNAPCPL PMRSSTSAAS LTSAAGATAV RSSPLGCHVA VGQLGQRKVI 3180 TSQATTSASV VTRVIPGTPG AAGRLQQQQQ QQVQQQIKQQ VQQQQHPKHQ QRQQQEQREQ 3240 QRQQQQQQQK QQQKQQQKQQ QNQQQEQQQR QKQQKQQQQH QQPKEKQQRK QQHQQQQKEK 3300 QEQNQWQEEK RQLPPQQQVH QQQEQEQEQH QPQQQRQQQQ QEQQQQQHEQ EQQQHARQQQ 3360 QQEQQQQQHA RQQQQQEQQQ QQHEQEQQQH SRRQQQHARQ QQQQQQQHEQ LQQQHEQQHQ 3420 QHERQEKQNE QREQEAQVVL RSHSIGGSDG NDRVPRHACS ISEMNLQDST ATPAAHHHEV 3480 LPAKVDATRV TDSAVNLQDP GVISAAHQQE LPAGKGDDTQ GRAREGGVSS VTVSLESPFS 3540 DTAVNLHDSA VISAAHHEEL PTVKGDDEAQ VTARESGVSV ATISLVPPSS MVTDSAVNLQ 3600 DSAVVPVAHH QELPIGEGDD DTQAGASEGG VSAVTVSLES TRGGQGAGAS MTAAVCRPSL 3660 NDKLELAAGT VPVAANDGGL PIGKGNDTQV TVTEGGVSAV TMSLVSSRGK EDADASKTAV 3720 VCRHLQNDKL GLAGGTVPVA VNDGGLVNKK RSEKIKAVGK EQVNELAVTV STTVNNGRLV 3780 NEENPKGMRV VPKGLAHGLN DLGNDPSPSI PPAPEGALEI IVRSCSTAGP GDARKKEGSE 3840 SAAAAVLWGV QPAAEPGYAS AAVNVPSQVC ASQSVVVEME LPRQAMVGRT DGEEQASGDT 3900 GLGETGIEQG AMEEGMVFLQ QTRSSPPQNM ISVSGIERSA RSDEHDDGGT SIDDRCDFQA 3960 AGTLSTSGAE TTLPTSSSDG LADADVSMGG PCDLRVGTIC ASGDGTPEVD AGPLMSRGGG 4020 DAVPGPMSRN PLCSATALTV PAGARQVNGD QNLARDAGVA CSNNEMQNAE VKEGCILVVP 4080 MPLSSSHANM VDGCEISHGV QAAAKQRLPV ASTLQSGVRK TVGVVRCHRF ASTAVEPMVA 4140 ARPQVPQCQV LSPSTVNHIA GGPEGEAVMG EREEGIEGGP EPAGQAAIEA SETGIRGEGG 4200 TGVCTGTDLT KASKEDGMED MLLSLPSSSN RDCTGCGPDD DNGCCRKDGD DNGTCVCTGT 4260 DTTKASKEDG MEDVLSSLPS SSNRDCTGCX PDDDNGCCRK DGDDDADDGV GLEVRDRLLS 4320 NRNGLSSSAS SSTKGGKNHS LGPGAASGNR GAPNGNAMAR LADAETNAGM EGASGRYDGV 4380 CLSGNVPERQ VVGVNGTCIA GNEISTSREV DVVGHGGKQG ERVTETEALQ KGGRRETRDK 4440 VAPVHPLHES ASSCSGSSNC GRAGEQVVVV PIIAPKTSAG QRPWVGACLQ NSASMSKQKV 4500 LLGPPVNTVV ASAREQTKGA GSMATCVAVT TPISLGRVWM GNDERAGRAV MGRGIRSILT 4560 KQLSRSGSSF TRLVKKNGKI VFDSPAKYVR LRANQLVAVN ASASHPLSAG KGIQLRNTYV 4620 KSKVNQLVRS GVSCGVSMST APGMVGLGVR TGAVLSSPSR KVSPLSSSRA RQRLRGLLKV 4680 KAVITNRQAK QRSTVFPWRR GMNTRELGWR QTKRMSGHYS FWKDVDHLPL EMEKGSLLGL 4740 LRKQLQSKRM YEPVYTVSAN GLSMRRTGVR SNRSGANLKW TKCIGNRAIE ANEVATKAVA 4800 AVEKRNRERK EAVKKAKFDA AKDRARRKAA RRAAVRKSRE RIIFVGLAQY RMDPVGRTLH 4860 RIPEPPGISS KPASAFALLA PKRTSINGIM YVRVGNGSKL VRDPKAAIHA IASHRVKWSL 4920 HNARFSRGQK SQQYCLFYTR FGTCNKKDGK CPYIHDPEKV AVCTKFLVGK CDKENCLLTH 4980 KVIPERMPDC FYFLKGECTK AKCAYRHAEK IDPTGPICAA FRKGYCRDGY QCTKRHTYEC 5040 SVFAETGECP DGSSCKLHHS KKLRQVGQKR RRFGQKDTRT VFTDEDFGEG GLALHVPNAK 5100 PWYATEREAT NGVMTLRKLK LAEEGLRIGF WVEGSPFRKR PRVARGLEEE VVGRVLAGSR 5160 LHTVVTPSSI TEYRGADASD VMEGVGSMPL KETCSRSQLV GPGALVVNRR NCMDQVLWTT 5220 TAWTWTTLLD CGRMEHNSLF DRGWMDKNSL IRRDLLTRTV SQKRYDGRKL FRQASMDVNC 5280 MRSKLLGAEL |
Nucleic Localization Signal ? help Back to Top | |||
---|---|---|---|
No. | Start | End | Sequence |
1 | 1243 | 1247 | RKRRR |
2 | 2560 | 2572 | KERKKNKEKERKK |