PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG64146.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family CAMTA
Protein Properties Length: 3575aa    MW: 404381 Da    PI: 6.6479
Description CAMTA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG64146.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1CG-1117.38e-371456153535116
        CG-1   35 ksgsliLynrkkvryfrkDGyswkkkkdgktvrEdhekLKvggvevlycyYahseenptfqrrcywlLeeelekivlvhyle 116 
                  + g++ L+++k  r+frkDG++w+kk+dgktv+E hekLKvg+ evl+cyYa +++n  fqrr+ywlLeee  +ivlvhyle
  GBG64146.1 1456 PGGRFYLFDKKTCRFFRKDGHCWQKKRDGKTVKESHEKLKVGSAEVLHCYYATGQDNGRFQRRSYWLLEEE--RIVLVHYLE 1535
                  468999****************************************************************9..*******97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 3575 aa     Download sequence    
MASTQSETFL QKIIEECKVR WLRPAEVQFI LENYKQLNLD VRVQPVKEPE KEEEKLKEVD  60
KEEEEDETPL QRKRGQHGGS KDEEMEKRIS EWVANLSLGE DEEVTMYIPK DEQEAAMKKW  120
EEEEDVLNRQ AMEDETRMVW KLAMMREKKR RVEAASEAVK ELEEVQKLTL RLSPQVDLQQ  180
KVDIIAQSVE RLARVQEQQY EFSRSQDIAV CSMRMGFRDF ARELVGAVGA KVNHRLEKTE  240
RFCVGAIEGV KVAAPKEGEP RPRREPVKVS PDQHVLIAIH ALRDKAASFA RSLVRAANCN  300
DDVVAYSSFT PLAEFMKLLH ERFADVARSV KASDKLQTIH ARKWKSARAL KSTMEELVAV  360
PDHGVTDTQL VALFYRAMPE AFRGHFFAKS EDPATTYDSL SREVVAFEAK SVSVSIFWHK  420
DLDKGKQWEG RTISGQVKTK DSLVLTLDEG SVDEIPYDQI EWGLEEEDSG VGQGRTYAAV  480
VAGGRPQRGG RGQGQGGRAS GSRFQGDQGV GGRGGNRQAG GRCQGGCLDS FPETACVRVS  540
LDTSLTGVAE ELKERRPAWA KMKKENKGQL ILVDTKIFRG RVGALVDSGA TRNYISKKAL  600
QKLKLGLKVQ KLADPIVSIL ADNRTMRVED YAEGVQAYFR LEQDGKVEKV LHSLTLLVED  660
SLPFDIVLGM DWGEAAGATL HLKEHECRLP SPSGEAKTAR LFHVSGVENP LAHCCLSAPA  720
FARLVKKEKL EEQVFVAYVR PVTEPTEEKS VDPAIAKLLE EFKDLTEPPT GMVPRPIQHR  780
IEIKPGSKTP KGVVYRMTRN GHYEFIVMPF GLTNAPATFQ RCMNDLFRPW LDRFVVVYLD  840
DILVFSKILQ EHQGHLRQVL EKLREANFKI NAKKCDWAKT QVLYPGHVLD GDGVKPEDCK  900
IAAIRDWSTP RTLTELRSFL GLANYYRKFV RNFSTIAAPL RRLLRKETIW KWDKDCTSAV  960
KKLKQAMIEY PVLKVADPSL PFVVTTDASQ YGIGAVLQQD DGNGYRPVEF MPARMPSEKV  1020
ATSTYERELY ALRQALEHWK HYLLGRHFKV YSDHETLRWL KTKAKMTPKL TRWAAEIDQY  1080
DFELKPVKGK YNVVADALSR RADYFGAIVH YLDIGKDLQQ KIREAYAQDP IYSDLLKKVK  1140
EAPETEPNYR TTEGLLFEKT NIFDHLCIPS SEEFSPLTSL QRELREIQQV EMLRRKLEKD  1200
LKDATDRENA MKTRAARLES LEADKAELEG LDESALTDPL KVLKKNMLSL HAHVDSKLDF  1260
MQSTLDQILD ALTRPGFRPP AQSPLPLSAM SGPFPVQAGT QPSGTSAAPA QTVASSSSGP  1320
AVVATPPQQS VPAQGQQQGQ WYPKTPMKPP LAFSVERKDE ELNTWLRTVP IWVKAKRTLP  1380
EDEVVTAASY IEGKAAKWLD GVVIKAGYGR RMADWAKFWP TGPLHVGVVI QGLNRKSPFM  1440
FMSSAVSAAH KLRLEPGGRF YLFDKKTCRF FRKDGHCWQK KRDGKTVKES HEKLKVGSAE  1500
VLHCYYATGQ DNGRFQRRSY WLLEEERIVL VHYLEKPEAA ERAVATINVS IPGTVTEEAT  1560
DLTPSTSQGL TMPQTPDFSG APLPAGDARG DPFDETNISD VDSDADIGYF HYLALSPKPA  1620
QEQHPHLEGR VPSTSYMLND MVTGSTNPAT IIGSEHSNQQ EWGHVSCSIP FSTSLSWGSG  1680
EEQMDGQICD PDAQSDDWGH HSLLQIHKES HPPLQAGGQQ WEEEQADVSF KGGDLVWSPS  1740
ESEELQYQRN DGKPVIWNDH MDPGPSSGDE WPSTILPLPE DHKDDQLGVL RSHPPKITLA  1800
DGPDQGINWL LLDGGDLDEP SPSSLLVGPQ DRDEGFSGAS DGEHAAPNEM TEVEESFFQS  1860
GSKSSLSNRS HQVLGTIAGE ARHNTTWGTY TQAPKIGDGE NCKGPISATV AQGEGTGFLP  1920
GLHQHTVYRQ QRPQPSSQQQ PPMEQSSTSD VLYNVQPPHD YQLNSHEHFL SSNLQQQQQQ  1980
QQQLYQQEQH QQQYQQEQHQ HQYQQEQHQQ QYQQHQRFQQ QYQEQQQDRY LQKQQQYPYH  2040
EQQHLPPQPS SSASRASEDE RGSEEDAFAD ELDDVESLIP MPFSEVEVPH RLEQHGKPEF  2100
PECQRVMPGY LMEGKEKKIP SQQARYLDKV ALRIFPERAG VAHSPMESTG SVSVSTTPMP  2160
KLFGHVVDMM RRDAVKTGID DEVDRQLTER LDRLLLSEDA ASSIQFAKLT ISAAHHRYII  2220
EDFAPQWVFT GDHQAKVIVI GHVQDHSVRS WFCKFGDVEV EADMVRDGIL RCKPPVRQKE  2280
EKVLFCVTCG DGNCCSNVRV FDYRGQGTGA ENVTQMGLLI RIVKTVLSDE LDNQLADEAV  2340
RTGTDRNPGA LGEWEWLLRG EQSLAYVQEH LLQMLLKRRL SEWLHKQCQM SYRGQDGNPC  2400
TAVDRLDNTG MGIVHIAAAL GYGWAVSILR AAQASLDSCD LGGRTPLHWA AAFGRDLTVS  2460
TLLAAGALPG ILLKDTWTAA ELAGMTGHQA ISAMLSLSTL EQSEDMLSSE LSRTGIMASI  2520
QAHEAGQDAV ARAMLYEDAH EEYLDSTPQD EEEAKAAVYR AARAASRIQA VYRQYYARKK  2580
ESEREVRKLF AARKIQRAYR GFQKKKLKKR TEAATKIQKV YRGWRRRREF VQKRKKIVKL  2640
QALWRGRRQR TEYRKMRNAL RVIEVAVVQW RNRRRKSLEA QLLSAAQEEK DKDTLRSNRQ  2700
GEEWRNVGGV HHQAALEQAV LRVQAIYRSK REQVNQFRKM MEEKQAQKEQ HQAAAEAARL  2760
QAEAEAAAEK QRLQAAADTD AQVRCKEAQD LLQQHEAASV DKLKFWHFEP SEGHDDVAPE  2820
EQLKEFLSKL VTRLVYTCNH LQSKLGNLRR AVRNHKDLYE DATMALLSRV QDLEQAAPGP  2880
DAGEPSNAAS TRQLEQRVDH VVAMLDDVST FAALATISKQ LDTLKIDVRQ LHQPPDNDSS  2940
TSASRPYKMR TFRIKKFDDY THQDPVVWWQ GFTTEIGIYE VPNHLYISAR FLNAKGGCQI  3000
WLSHMATIQG VQVSDQISWD DMTNEWKKRF IVDDAPALAI NRLFAMTQGN TPTRDWLTKW  3060
QKIVATSDLE LPFSHLCWEF YNRSCAALSL ALGDREYDTH EKSVWQPAYV EKGTFGPRLQ  3120
PVAAVQLDNL VEDPAATPAS REGDQVAVVQ LRSNNSRGKG EAKTASPVGN GQPVPWVKSN  3180
LTEAEYNQDF MVRAGLGPRV RRKSRPAQVT LVDHHMHKSI DWYIDSVLVY LATHASEIVS  3240
FDILDTKFDM ILAMSWLRSE DHPVNFYSRT VHIRDWNGVP VPCTVLPPRP SINCHVVSAA  3300
SIRASITRDD IEEMGVCYLH ALPPHDISST DPRITELLDA YGDVFEAPYR VVPDRLIRQE  3360
VVLEAGVVPP RGCIYLMSEE ELSVLWAQLD DLLEKGWIRP SSSPYDAPVL FILKKNKDLR  3420
LCIDYRKLNA QTVKNVNPLP CIDDLLKRLG DAKFFSKLDL KLGYDQLEIR QEDRYKTAFK  3480
TRYGHFEWLV MPFGLMNAPA TFQAAMTTEF GHMLDRFVLI YLDDILVYKS EFGRACGAHA  3540
HRFGTATTSQ DQYDERQGAR QRLTFSGYHH NRFHY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
126052639KKLKKRTEAATKIQKVYRGWRRRREFVQKRKKIVK
226082628KKRTEAATKIQKVYRGWRRRR
326252636RRRREFVQKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G16150.11e-42CAMTA family protein