PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.0367s0009.3.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family AP2
Protein Properties Length: 2108aa    MW: 224776 Da    PI: 9.3929
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.0367s0009.3.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP234.26.1e-11226275155
                  AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                          s y+GV+w   +g+W+A+I+       +   +lg+f  + +Aa a+++a+++ +g
  Bobra.0367s0009.3.p 226 SIYRGVSWSTCTGKWRAQIWKG-----NDVSHLGYFEDEVKAAIAYDEAALANKG 275
                          57****************9994.....68889******99**********99876 PP

2AP235.32.9e-11560609155
                  AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 
                          s++kGV+w+k +++W A+I+       + k   lg+f+  e+Aa+a++a ++k +g
  Bobra.0367s0009.3.p 560 SKFKGVSWHKHSQKWYAYIQA------SgKMRGLGYFDDQEDAARAYDAEARKVHG 609
                          79****************999......449999*******************9998 PP

3AP235.52.6e-11716767255
                  AP2   2 gykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                           ykGV+w+k r  W ++I++p     +++++ g+f+ + eAa+a+++  ++ +g
  Bobra.0367s0009.3.p 716 AYKGVSWHKHRRMWQVYIHVPQG--ASRSYHHGYFDDEIEAARAYDREVLRVRG 767
                          69*******9999*******932..4599999*****99*******98887776 PP

4AP222.43.1e-0711891234151
                  AP2    1 sgykGVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaark 51  
                           s ++GV+w k++++W+AeI         + +f   +    e+Aa++ ++a +
  Bobra.0367s0009.3.p 1189 SSFRGVTWSKQTHKWRAEIEIdG-----CVQFLGSSANE-EDAARMVDRALL 1234
                           679***************99944.....89988666666.******999875 PP

5AP224.56.7e-0813221363147
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaai 47  
                           s+y+GV+w++k ++W++ ++d        k+  lg+f  + +Aa+a +
  Bobra.0367s0009.3.p 1322 SKYRGVSWCEKGKKWRSLLWD------GaKQRFLGHFANEVDAARAFD 1363
                           89*******************......44**********88***9976 PP

6AP240.76.1e-1316791728155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s++kGV+w  ++++W+A+++     g +k  ++g++ ++e+Aa+a++ a ++l+g
  Bobra.0367s0009.3.p 1679 SRFKGVSWSDSSNKWRAQCWN----G-SKVQYIGYYESEEDAARAYDTAILQLRG 1728
                           789***************666....4.7*************************98 PP

7AP236.71.1e-1117781827155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s+ykGV+w +++++W+A++++      +k  +lg ++ + eAaka+++a  +l+g
  Bobra.0367s0009.3.p 1778 SKYKGVSWSERSHKWRAQLWHD-----NKVRHLGFYDDEVEAAKAYDRAVVELRG 1827
                           89*****************994.....5999*******99**********99998 PP

8AP248.42.4e-1518841933155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s y+GVrw++++ +W+A+I d      +k++slg+++t+e Aaka++  +++l+g
  Bobra.0367s0009.3.p 1884 SPYRGVRWHERNTKWEARIFDG-----QKQISLGYYDTEEAAAKAYDVRALRLRG 1933
                           68****************9994.....6***********************9998 PP

9AP223.41.5e-0720242073155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s y+GV++d  + +W A+   + + g     +lg f+++ eAa+a+++ + +++g
  Bobra.0367s0009.3.p 2024 SLYRGVSYDTGTCKWWAH---FAN-G-AHIRQLGAFDSEVEAAQAHDQEAIRMHG 2073
                           57****************...843.2.48889******99*************98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2108 aa     Download sequence    
METLGADVGA SVKVPVPAPT TPVQPGALPD MGTQVLVSPG LPSTPLPGGP PLPKWPVALS  60
ETPETLKEPG MSKTLQLQYD SPSFVNLPGN APAPMSSTQQ QRGSCTPEVA PEGPQVSHSP  120
SSGMEHTESP GNSHRARPGI LWRESTRKWE AHGFANGRLE LLGCFDTEEA AQQYARHALG  180
YAGGGAMISP PKKPIPRLPG GPLLTVSSSG PRKASRKKGP HSHNSSIYRG VSWSTCTGKW  240
RAQIWKGNDV SHLGYFEDEV KAAIAYDEAA LANKGPGAIT NFCPSSYGYP KPDPTQSKRK  300
QKSGVKRRSP SSNPPFASES SPAPGAFIGG SSAGHPQKPS EDMELEILQG QSRGWGATKS  360
RSHSEETNPQ LAGRLPPTPL GDVTEVSPHV ARRKRTCSRK PSLAGLASHL VHEPAAEEAA  420
EALASMSSGD GETLSPRSSD PEGISGTAGI EEAPIPRLSR LISPFTKRLN SPSSCKESPA  480
GSSPKTGQEP DALLGGRRRS TRISDAHDAF AASLLTGLAC KGDAEPSSRA SPAISEGGGS  540
SRKRGSVGRG TASESAAKSS KFKGVSWHKH SQKWYAYIQA SGKMRGLGYF DDQEDAARAY  600
DAEARKVHGA KAVVNFDLEG NRILRECPSR GRSLPSSAVN SEDMASADES NPGDTPRSRH  660
RPTSLPWGPA PSGVPSRMPG SRPNSSGAAT SVGSEEDPSA WGTRRPPYKK PAAHSAYKGV  720
SWHKHRRMWQ VYIHVPQGAS RSYHHGYFDD EIEAARAYDR EVLRVRGPTT PTNFPLSDYI  780
NADGTFINPA HASSSGKGEA RSRPGSSGKA ASPPPPRGAK GASRRQHADS HFEPIEKPDA  840
LYMLMNAAMS LDAPARESDE ESWREEPPLK KRWRRSGTPR PPAPDQRGRV LGSSQDAGLI  900
HRSASPHSIG PCSATRSSTP PPSQNRLGSS GQILWSPTGA AVTPLIAQAV DLFTSNSPKP  960
VPHTPATAVE IPPRRHPPIK PPPLAEDAHS KGSFPEFAAL MHAAKVFPLS VDERQPMELD  1020
QQGSQGGDPM GADGTMAVQQ PRDNDAAAAL APTPRNDGAE DPGSTEGPQE KETEHMDVDP  1080
QVQNSTAGLT QSGRGHTDVP EIEVSLPGQA KISSYYRSVG SPAWKWDTQG PGRAAARTDA  1140
GSHPRSIRRG SPSWTVSRGL GAVSEASPLG TQSHPEGESC ETGEGKRVSS FRGVTWSKQT  1200
HKWRAEIEID GCVQFLGSSA NEEDAARMVD RALLQHPAYP GRLNHPPAEF QMPLWMKLGT  1260
GLSPDDTRVQ GAGEALGGPP NVKVEPSSGL KDPMDINEGT PKASTPKSGK QPRSPHSPRG  1320
TSKYRGVSWC EKGKKWRSLL WDGAKQRFLG HFANEVDAAR AFDRNSIMLK GKDAAKLNFP  1380
ISDYNLEELA AQFASAPQGP PEAFHPSAEI SQSQSKQTSA SNKQPPVEGP REPQDVATQG  1440
TEQERTAQAP KAQAPVFREQ LMTPVVPPAS PANPFGEQLR REQMHGESPG VPGVGGTLLL  1500
ANTLLSRLDN ASRASPLINP ATCIPSPQEI ETPVQRPRHF AAGGDGLEAQ GSLTPLGDPN  1560
REPQLYCMTP EAQGPPDAGT WRPSRSCSEA SPHEEPGALK EAPDAEPSQA TLAQVMPSPL  1620
EAPQGLVHGS SAPHLGPSLG RQAPKQVGKR SRTPDPSSTV HGPEGRDRVW WPGPGKRSSR  1680
FKGVSWSDSS NKWRAQCWNG SKVQYIGYYE SEEDAARAYD TAILQLRGPL MQTNFPATNY  1740
ESRTNADGQG EGEVENPQGH IGGTGSPGGR SGIQRGTSKY KGVSWSERSH KWRAQLWHDN  1800
KVRHLGFYDD EVEAAKAYDR AVVELRGTSA QTNFPIAGAG GARPLISHAT VTPSQSAGGT  1860
PVPTTEGPIS PPRVRGSPNA KGSSPYRGVR WHERNTKWEA RIFDGQKQIS LGYYDTEEAA  1920
AKAYDVRALR LRGPNTQVNF PQSPVAEPAP PHMHGSDGPG DSARRRRSLS FTPDSPVHTP  1980
PGPGVLGPAD STSPSGWGVR KPRSGVPGLP PRSPIDSSRA PRVSLYRGVS YDTGTCKWWA  2040
HFANGAHIRQ LGAFDSEVEA AQAHDQEAIR MHGSRAVTNF PHPEVQAIKQ PLFGSPVPHS  2100
GAPPAGQ*
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G72570.14e-21AP2 family protein