PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.0367s0009.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family AP2
Protein Properties Length: 2133aa    MW: 227446 Da    PI: 9.2116
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.0367s0009.1.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP234.26.2e-11250299155
                  AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                          s y+GV+w   +g+W+A+I+       +   +lg+f  + +Aa a+++a+++ +g
  Bobra.0367s0009.1.p 250 SIYRGVSWSTCTGKWRAQIWKG-----NDVSHLGYFEDEVKAAIAYDEAALANKG 299
                          57****************9994.....68889******99**********99876 PP

2AP235.32.9e-11584633155
                  AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 
                          s++kGV+w+k +++W A+I+       + k   lg+f+  e+Aa+a++a ++k +g
  Bobra.0367s0009.1.p 584 SKFKGVSWHKHSQKWYAYIQA------SgKMRGLGYFDDQEDAARAYDAEARKVHG 633
                          79****************999......449999*******************9998 PP

3AP235.42.6e-11740791255
                  AP2   2 gykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                           ykGV+w+k r  W ++I++p     +++++ g+f+ + eAa+a+++  ++ +g
  Bobra.0367s0009.1.p 740 AYKGVSWHKHRRMWQVYIHVPQG--ASRSYHHGYFDDEIEAARAYDREVLRVRG 791
                          69*******9999*******932..4599999*****99*******98887776 PP

4AP222.43.1e-0712141259151
                  AP2    1 sgykGVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaark 51  
                           s ++GV+w k++++W+AeI         + +f   +    e+Aa++ ++a +
  Bobra.0367s0009.1.p 1214 SSFRGVTWSKQTHKWRAEIEIdG-----CVQFLGSSANE-EDAARMVDRALL 1259
                           679***************99944.....89988666666.******999875 PP

5AP224.56.8e-0813471388147
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaai 47  
                           s+y+GV+w++k ++W++ ++d        k+  lg+f  + +Aa+a +
  Bobra.0367s0009.1.p 1347 SKYRGVSWCEKGKKWRSLLWD------GaKQRFLGHFANEVDAARAFD 1388
                           89*******************......44**********88***9976 PP

6AP240.66.2e-1317041753155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s++kGV+w  ++++W+A+++     g +k  ++g++ ++e+Aa+a++ a ++l+g
  Bobra.0367s0009.1.p 1704 SRFKGVSWSDSSNKWRAQCWN----G-SKVQYIGYYESEEDAARAYDTAILQLRG 1753
                           789***************666....4.7*************************98 PP

7AP236.61.1e-1118031852155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s+ykGV+w +++++W+A++++      +k  +lg ++ + eAaka+++a  +l+g
  Bobra.0367s0009.1.p 1803 SKYKGVSWSERSHKWRAQLWHD-----NKVRHLGFYDDEVEAAKAYDRAVVELRG 1852
                           89*****************994.....5999*******99**********99998 PP

8AP248.32.4e-1519091958155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s y+GVrw++++ +W+A+I d      +k++slg+++t+e Aaka++  +++l+g
  Bobra.0367s0009.1.p 1909 SPYRGVRWHERNTKWEARIFDG-----QKQISLGYYDTEEAAAKAYDVRALRLRG 1958
                           68****************9994.....6***********************9998 PP

9AP223.31.6e-0720492098155
                  AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                           s y+GV++d  + +W A+   + + g     +lg f+++ eAa+a+++ + +++g
  Bobra.0367s0009.1.p 2049 SLYRGVSYDTGTCKWWAH---FAN-G-AHIRQLGAFDSEVEAAQAHDQEAIRMHG 2098
                           57****************...843.2.48889******99*************98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2133 aa     Download sequence    
MSGIESAPSE DGYQQFQSDA QVNAMETLGA DVGASVKVPV PAPTTPVQPG ALPDMGTQVL  60
VSPGLPSTPL PGGPPLPKWP VALSETPETL KEPGMSKTLQ LQYDSPSFVN LPGNAPAPMS  120
STQQQRGSCT PEVAPEGPQV SHSPSSGMEH TESPGNSHRA RPGILWREST RKWEAHGFAN  180
GRLELLGCFD TEEAAQQYAR HALGYAGGGA MISPPKKPIP RLPGGPLLTV SSSGPRKASR  240
KKGPHSHNSS IYRGVSWSTC TGKWRAQIWK GNDVSHLGYF EDEVKAAIAY DEAALANKGP  300
GAITNFCPSS YGYPKPDPTQ SKRKQKSGVK RRSPSSNPPF ASESSPAPGA FIGGSSAGHP  360
QKPSEDMELE ILQGQSRGWG ATKSRSHSEE TNPQLAGRLP PTPLGDVTEV SPHVARRKRT  420
CSRKPSLAGL ASHLVHEPAA EEAAEALASM SSGDGETLSP RSSDPEGISG TAGIEEAPIP  480
RLSRLISPFT KRLNSPSSCK ESPAGSSPKT GQEPDALLGG RRRSTRISDA HDAFAASLLT  540
GLACKGDAEP SSRASPAISE GGGSSRKRGS VGRGTASESA AKSSKFKGVS WHKHSQKWYA  600
YIQASGKMRG LGYFDDQEDA ARAYDAEARK VHGAKAVVNF DLEGNRILRE CPSRGRSLPS  660
SAVNSEDMAS ADESNPGDTP RSRHRPTSLP WGPAPSGVPS RMPGSRPNSS GAATSVGSEE  720
DPSAWGTRRP PYKKPAAHSA YKGVSWHKHR RMWQVYIHVP QGASRSYHHG YFDDEIEAAR  780
AYDREVLRVR GPTTPTNFPL SDYINADGTF INPAHASSSG KGEARSRPGS SGKAASPPPP  840
RGAKGASRRQ HADSHFEPIE KPDALYMLMN AAMSLDAPAR ESDEESWREE PPLKKRWRRS  900
GTPRPPAPDQ RGRVLGSSQD AGLIHRSASP HSIGPCSATR SSTPPPSQQN RLGSSGQILW  960
SPTGAAVTPL IAQAVDLFTS NSPKPVPHTP ATAVEIPPRR HPPIKPPPLA EDAHSKGSFP  1020
EFAALMHAAK VFPLSVDERQ PMELDQQGSQ GGDPMGADGT MAVQQPRDND AAAALAPTPR  1080
NDGAEDPGST EGPQEKETEH MDVDPQVQNS TAGLTQSGRG HTDVPEIEVS LPGQAKISSY  1140
YRSVGSPAWK WDTQGPGRAA ARTDAGSHPR SIRRGSPSWT VSRGLGAVSE ASPLGTQSHP  1200
EGESCETGEG KRVSSFRGVT WSKQTHKWRA EIEIDGCVQF LGSSANEEDA ARMVDRALLQ  1260
HPAYPGRLNH PPAEFQMPLW MKLGTGLSPD DTRVQGAGEA LGGPPNVKVE PSSGLKDPMD  1320
INEGTPKAST PKSGKQPRSP HSPRGTSKYR GVSWCEKGKK WRSLLWDGAK QRFLGHFANE  1380
VDAARAFDRN SIMLKGKDAA KLNFPISDYN LEELAAQFAS APQGPPEAFH PSAEISQSQS  1440
KQTSASNKQP PVEGPREPQD VATQGTEQER TAQAPKAQAP VFREQLMTPV VPPASPANPF  1500
GEQLRREQMH GESPGVPGVG GTLLLANTLL SRLDNASRAS PLINPATCIP SPQEIETPVQ  1560
RPRHFAAGGD GLEAQGSLTP LGDPNREPQL YCMTPEAQGP PDAGTWRPSR SCSEASPHEE  1620
PGALKEAPDA EPSQATLAQV MPSPLEAPQG LVHGSSAPHL GPSLGRQAPK QVGKRSRTPD  1680
PSSTVHGPEG RDRVWWPGPG KRSSRFKGVS WSDSSNKWRA QCWNGSKVQY IGYYESEEDA  1740
ARAYDTAILQ LRGPLMQTNF PATNYESRTN ADGQGEGEVE NPQGHIGGTG SPGGRSGIQR  1800
GTSKYKGVSW SERSHKWRAQ LWHDNKVRHL GFYDDEVEAA KAYDRAVVEL RGTSAQTNFP  1860
IAGAGGARPL ISHATVTPSQ SAGGTPVPTT EGPISPPRVR GSPNAKGSSP YRGVRWHERN  1920
TKWEARIFDG QKQISLGYYD TEEAAAKAYD VRALRLRGPN TQVNFPQSPV AEPAPPHMHG  1980
SDGPGDSARR RRSLSFTPDS PVHTPPGPGV LGPADSTSPS GWGVRKPRSG VPGLPPRSPI  2040
DSSRAPRVSL YRGVSYDTGT CKWWAHFANG AHIRQLGAFD SEVEAAQAHD QEAIRMHGSR  2100
AVTNFPHPEV QAIKQPLFGS PVPHSGAPPA GQ*
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G51190.14e-21AP2 family protein