PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG88391.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family AP2
Protein Properties Length: 2240aa    MW: 238578 Da    PI: 8.4368
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG88391.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP244.63.6e-1413251374155
         AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55  
                  s+ykGV+++k++grW+++I++        k+++lg f ta+ Aa+ +++a +k++g
  GBG88391.1 1325 SDYKGVTFCKRKGRWESYIWV------DaKQVYLGGFTTADAAARVYDRAVLKFRG 1374
                  78*******************......44************************998 PP

2AP233.11.4e-1017291778155
         AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                  s y+GV  ++++g W + I d       k+++lg f ta+ Aa+a+++a +k++g
  GBG88391.1 1729 SNYRGVAICRRTGGWTSCISDG-----GKQVYLGGFTTADAAARAYDRAVLKFRG 1778
                  57****************9993.....4************************998 PP

3AP230.21.1e-0918211863150
         AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaar 50  
                  s+y+GV+ +   grW+A+ r       +k ++lg f  +e+Aa+a+++a 
  GBG88391.1 1821 SRYRGVTLHT-GGRWEARMRR------KKLIYLGLFTEEEDAARAYDRAV 1863
                  79******99.9******999......7*******************985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2240 aa     Download sequence    
MSHSFPATPT DGSRGIGFTE SGRFGNQRCT AMVCTRMTPV HFSSSATSAG SSLFVPRRQL  60
RDASADGDDV HVSTAQPVSA IGCDKLGRSE IGMMVTAREE ETESYCSPSR PAECPAKMVA  120
ADGFRLPLIN NHADPRCARN TAHPHHECIQ AISPARYGHD EGALYLGKDG SHGLESRICQ  180
REWAVNKEVD IPLASSSMAM AAPATLPLAA SQMAPPPTPP AAMAEAVAMT GERMTVVVED  240
SAADVEAAAT ATLDRTPSRA PPPPPQFLFH LAPYGDPGDL TTGGGGGGGG GNDSCSVHCL  300
AVCGDPDGCL TTGGGGGDCA CHVPAGQRSS LSRAALADDS VGTAGLRLAG CRRLESHHHQ  360
HAHRANASSL PLSVPLLSQS LQDIKEGAMR QSLHANRKDE TSDGFPLSTA RDPCFNTYKS  420
NQETGGPNQE GEGLGSVAEP LLGSPYPLDY PLIDLRHTAC DQTQRLPPRV VIRMQACSAD  480
RVSSEHISSS TKRSNRRSSS STSSMGTNGR TKADDGSIAR DIMKREINSE GLVCFSTSSS  540
SSSSPSSTSS SFSLPFHASG LFNGGNESSL SLVLPASAGP TTSSAFPPSS SSSSYSLSQS  600
LHASLLSSNR PSASARPVVD LRWAGSAPGQ NGGPGAETRS PSLVSCPAGN RSAIASLLKR  660
EEDQYLVRQL DQVLNQLQQQ GRHLARTTPL SSGDLHVVKV KREEEEEERS EQGNGQEEQD  720
REEEEEEEAE QKRERRSGHL LAQDGSHSQA TVPVLGDLQP IMITDQSSNG QEKTEGPKHQ  780
QLLVHRQRGR HSQIAMSMPS SSHGLFQSFD NDNRNEMEVK KEKDPLLVYH HSHHHSHHHS  840
YCHSYGCGDP RAPTILRSDL LQALRAQCME QQKQKEKEKE PTRMQEEEDE RDMDKEEEDG  900
EEQKEKENPL RRVHNGQTIV PKTEALQLAV HTRGEEGKEG EKEAEDEKDK QEKEEGAERT  960
AVQTVHSAVH KRLCGVILPV GREMTESGPG AMNPRPLKQL PVLYEQRAFP KLEPELELEL  1020
ELEPRLISVN GQKEEKVVHK AGTGAGTGAG TGAGTRGRAG TATGAGARVM RAGTETGAGT  1080
RARIGPETGA GTGTGTGTGT GRVKTGTGTG TGAVTGAGTG AGTGAGTGAG TGAGTGRART  1140
GTGAGAGTGA GTGRARTGTG TGAGTGTGRV RTGPETRART GAGTGAGTGP EAGAGTRDGT  1200
ATAAGARTGT AIGAGTRAGI GTGSGAGPRR SARGTSSGTG AASEVGPGTE NGPGTGIGTG  1260
TQARTEPAVA TRLTRRVPST RLLRSNQQQQ QQPDETIKTG GGGPASEVPV VKPAGTCYGP  1320
RARSSDYKGV TFCKRKGRWE SYIWVDAKQV YLGGFTTADA AARVYDRAVL KFRGPTAKVN  1380
FTASDYDADM QKVAHLTKEE FVRSLRRQRR CFSCGRSQYR GVKLHKCSHG EARMRRVLIK  1440
KYIYGGLFED EEDAARAYDR ALVRSSGKAA MTNFDFTQYQ GDVDFHKAAA MAARDAGGKS  1500
SDNGARSGAG AASGSGKRGA LIHNNSPNKK IKSDTASKTE ASIPPPPPTR TPPLCHPPPP  1560
GHDRRLSGSV MVAAQDRDIS ISSSVGGEHC RTLTIERRKP TRSVWTKTET DRPMVDKIDS  1620
VRTTEEIGHG TSFATGDGTI GAVSAATEAG PGTDNGPGMS IGTGIVARTE PAVGTRVTRR  1680
VPSTRLPQSN QQQQPEETIK TEGGDPASEP AVSAVKSDGT RYGPKSRSSN YRGVAICRRT  1740
GGWTSCISDG GKQVYLGGFT TADAAARAYD RAVLKFRGPA AELNFTASDY DADLQKVAHL  1800
TKEEYIDSLR RQRRCFSRGR SRYRGVTLHT GGRWEARMRR KKLIYLGLFT EEEDAARAYD  1860
RAVVRSNGKD SVTNFDLSQY QEELDFHKAA TMAARVAGGN SLDNVARSGE GAASGRGKRQ  1920
ALIHTNSPNK KEKLDTTSKT EASIHPPPPP PPTPRTPSCN PTPRTPSCNP TPLSRPPLPP  1980
PPPGHDRTIA AQFRDRDISI SSSVEGEKAT RSVWMTRKTD YQMMDKMDLV RVSEENDQNV  2040
PQLDEALDSF APGGESGSDL LLASAAGEGR RRLPSPEKVP LERWEEIMRQ PLFGNRWITN  2100
SQGQVMCSSA SPWKAWVKRG VVRIADLWDT DRRDWKTAGQ LRSTLAYCHH AERGLEEIID  2160
ATPLSWYSVL KKGPAVNEGD WIKIRGEDNT IMEMSWYGRR SFPATEERAK GSTFDNAVGR  2220
LDLGDGMPPC PPRSVPRSAP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G28550.36e-42AP2 family protein