PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.0367s0009.2.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family AP2
Protein Properties Length: 2109aa    MW: 224905 Da    PI: 9.3929
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.0367s0009.2.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP234.26.1e-11226275155
                  AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                          s y+GV+w   +g+W+A+I+       +   +lg+f  + +Aa a+++a+++ +g
  Bobra.0367s0009.2.p 226 SIYRGVSWSTCTGKWRAQIWKG-----NDVSHLGYFEDEVKAAIAYDEAALANKG 275
                          57****************9994.....68889******99**********99876 PP

2AP235.32.9e-11560609155
                  AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 
                          s++kGV+w+k +++W A+I+       + k   lg+f+  e+Aa+a++a ++k +g
  Bobra.0367s0009.2.p 560 SKFKGVSWHKHSQKWYAYIQA------SgKMRGLGYFDDQEDAARAYDAEARKVHG 609
                          79****************999......449999*******************9998 PP

3AP235.42.6e-11716767255
                  AP2   2 gykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                           ykGV+w+k r  W ++I++p     +++++ g+f+ + eAa+a+++  ++ +g
  Bobra.0367s0009.2.p 716 AYKGVSWHKHRRMWQVYIHVPQG--ASRSYHHGYFDDEIEAARAYDREVLRVRG 767
                          69*******9999*******932..4599999*****99*******98887776 PP

4AP222.43.1e-0711901235151
                  AP2    1 sgykGVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaark 51  
                           s ++GV+w k++++W+AeI         + +f   +    e+Aa++ ++a +
  Bobra.0367s0009.2.p 1190 SSFRGVTWSKQTHKWRAEIEIdG-----CVQFLGSSANE-EDAARMVDRALL 1235
                           679***************99944.....89988666666.******999875 PP

5AP224.56.7e-0813231364147
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaai 47  
                           s+y+GV+w++k ++W++ ++d        k+  lg+f  + +Aa+a +
  Bobra.0367s0009.2.p 1323 SKYRGVSWCEKGKKWRSLLWD------GaKQRFLGHFANEVDAARAFD 1364
                           89*******************......44**********88***9976 PP

6AP240.76.1e-1316801729155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s++kGV+w  ++++W+A+++     g +k  ++g++ ++e+Aa+a++ a ++l+g
  Bobra.0367s0009.2.p 1680 SRFKGVSWSDSSNKWRAQCWN----G-SKVQYIGYYESEEDAARAYDTAILQLRG 1729
                           789***************666....4.7*************************98 PP

7AP236.71.1e-1117791828155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s+ykGV+w +++++W+A++++      +k  +lg ++ + eAaka+++a  +l+g
  Bobra.0367s0009.2.p 1779 SKYKGVSWSERSHKWRAQLWHD-----NKVRHLGFYDDEVEAAKAYDRAVVELRG 1828
                           89*****************994.....5999*******99**********99998 PP

8AP248.42.4e-1518851934155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s y+GVrw++++ +W+A+I d      +k++slg+++t+e Aaka++  +++l+g
  Bobra.0367s0009.2.p 1885 SPYRGVRWHERNTKWEARIFDG-----QKQISLGYYDTEEAAAKAYDVRALRLRG 1934
                           68****************9994.....6***********************9998 PP

9AP223.41.5e-0720252074155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s y+GV++d  + +W A+   + + g     +lg f+++ eAa+a+++ + +++g
  Bobra.0367s0009.2.p 2025 SLYRGVSYDTGTCKWWAH---FAN-G-AHIRQLGAFDSEVEAAQAHDQEAIRMHG 2074
                           57****************...843.2.48889******99*************98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2109 aa     Download sequence    
METLGADVGA SVKVPVPAPT TPVQPGALPD MGTQVLVSPG LPSTPLPGGP PLPKWPVALS  60
ETPETLKEPG MSKTLQLQYD SPSFVNLPGN APAPMSSTQQ QRGSCTPEVA PEGPQVSHSP  120
SSGMEHTESP GNSHRARPGI LWRESTRKWE AHGFANGRLE LLGCFDTEEA AQQYARHALG  180
YAGGGAMISP PKKPIPRLPG GPLLTVSSSG PRKASRKKGP HSHNSSIYRG VSWSTCTGKW  240
RAQIWKGNDV SHLGYFEDEV KAAIAYDEAA LANKGPGAIT NFCPSSYGYP KPDPTQSKRK  300
QKSGVKRRSP SSNPPFASES SPAPGAFIGG SSAGHPQKPS EDMELEILQG QSRGWGATKS  360
RSHSEETNPQ LAGRLPPTPL GDVTEVSPHV ARRKRTCSRK PSLAGLASHL VHEPAAEEAA  420
EALASMSSGD GETLSPRSSD PEGISGTAGI EEAPIPRLSR LISPFTKRLN SPSSCKESPA  480
GSSPKTGQEP DALLGGRRRS TRISDAHDAF AASLLTGLAC KGDAEPSSRA SPAISEGGGS  540
SRKRGSVGRG TASESAAKSS KFKGVSWHKH SQKWYAYIQA SGKMRGLGYF DDQEDAARAY  600
DAEARKVHGA KAVVNFDLEG NRILRECPSR GRSLPSSAVN SEDMASADES NPGDTPRSRH  660
RPTSLPWGPA PSGVPSRMPG SRPNSSGAAT SVGSEEDPSA WGTRRPPYKK PAAHSAYKGV  720
SWHKHRRMWQ VYIHVPQGAS RSYHHGYFDD EIEAARAYDR EVLRVRGPTT PTNFPLSDYI  780
NADGTFINPA HASSSGKGEA RSRPGSSGKA ASPPPPRGAK GASRRQHADS HFEPIEKPDA  840
LYMLMNAAMS LDAPARESDE ESWREEPPLK KRWRRSGTPR PPAPDQRGRV LGSSQDAGLI  900
HRSASPHSIG PCSATRSSTP PPSQQNRLGS SGQILWSPTG AAVTPLIAQA VDLFTSNSPK  960
PVPHTPATAV EIPPRRHPPI KPPPLAEDAH SKGSFPEFAA LMHAAKVFPL SVDERQPMEL  1020
DQQGSQGGDP MGADGTMAVQ QPRDNDAAAA LAPTPRNDGA EDPGSTEGPQ EKETEHMDVD  1080
PQVQNSTAGL TQSGRGHTDV PEIEVSLPGQ AKISSYYRSV GSPAWKWDTQ GPGRAAARTD  1140
AGSHPRSIRR GSPSWTVSRG LGAVSEASPL GTQSHPEGES CETGEGKRVS SFRGVTWSKQ  1200
THKWRAEIEI DGCVQFLGSS ANEEDAARMV DRALLQHPAY PGRLNHPPAE FQMPLWMKLG  1260
TGLSPDDTRV QGAGEALGGP PNVKVEPSSG LKDPMDINEG TPKASTPKSG KQPRSPHSPR  1320
GTSKYRGVSW CEKGKKWRSL LWDGAKQRFL GHFANEVDAA RAFDRNSIML KGKDAAKLNF  1380
PISDYNLEEL AAQFASAPQG PPEAFHPSAE ISQSQSKQTS ASNKQPPVEG PREPQDVATQ  1440
GTEQERTAQA PKAQAPVFRE QLMTPVVPPA SPANPFGEQL RREQMHGESP GVPGVGGTLL  1500
LANTLLSRLD NASRASPLIN PATCIPSPQE IETPVQRPRH FAAGGDGLEA QGSLTPLGDP  1560
NREPQLYCMT PEAQGPPDAG TWRPSRSCSE ASPHEEPGAL KEAPDAEPSQ ATLAQVMPSP  1620
LEAPQGLVHG SSAPHLGPSL GRQAPKQVGK RSRTPDPSST VHGPEGRDRV WWPGPGKRSS  1680
RFKGVSWSDS SNKWRAQCWN GSKVQYIGYY ESEEDAARAY DTAILQLRGP LMQTNFPATN  1740
YESRTNADGQ GEGEVENPQG HIGGTGSPGG RSGIQRGTSK YKGVSWSERS HKWRAQLWHD  1800
NKVRHLGFYD DEVEAAKAYD RAVVELRGTS AQTNFPIAGA GGARPLISHA TVTPSQSAGG  1860
TPVPTTEGPI SPPRVRGSPN AKGSSPYRGV RWHERNTKWE ARIFDGQKQI SLGYYDTEEA  1920
AAKAYDVRAL RLRGPNTQVN FPQSPVAEPA PPHMHGSDGP GDSARRRRSL SFTPDSPVHT  1980
PPGPGVLGPA DSTSPSGWGV RKPRSGVPGL PPRSPIDSSR APRVSLYRGV SYDTGTCKWW  2040
AHFANGAHIR QLGAFDSEVE AAQAHDQEAI RMHGSRAVTN FPHPEVQAIK QPLFGSPVPH  2100
SGAPPAGQ*
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G72570.14e-21AP2 family protein