PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID estExt_Genemark1.C_30153
Common NameCOCSUDRAFT_64835
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Trebouxiophyceae incertae sedis; Coccomyxaceae; Coccomyxa; Coccomyxa subellipsoidea
Family AP2
Protein Properties Length: 2206aa    MW: 233557 Da    PI: 9.5792
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
estExt_Genemark1.C_30153genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP242.51.7e-13147196155
                       AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 
                               s+y+GV w+k++ +W+A+I+d      + k+  lg+f ++eeAa+ +++a+ ++ g
  estExt_Genemark1.C_30153 147 SKYRGVIWHKSNSKWEARIYD------NgKQRFLGYFTSEEEAARVYDEAAMRIGG 196
                               89****************999......44**********************99866 PP

2AP2301.3e-09329377255
                       AP2   2 gykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                                + GV+wd   g+W Ae++d      r +  lg+f+++e Aa+a+++a ++ + 
  estExt_Genemark1.C_30153 329 PFLGVSWDAAAGSWKAELWDG-----REYALLGHFDSEEAAARAYDRACLAQHR 377
                               678*****************4.....7********************9888775 PP

3AP230.58.8e-10561609255
                       AP2   2 gykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 
                                ykGV+w+k +++W A+I+         k   lg+f+  e+Aa+a++a ++k +g
  estExt_Genemark1.C_30153 561 AYKGVSWHKHSQKWYAYIQA------AgKMRGLGYFDLQEDAARAYDAEARKVHG 609
                               69***************999......339999*******************9998 PP

4AP229.81.4e-09705757155
                       AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                               s+++GV+w+k r  W ++I++ s+   r +++ g+f  + +Aaka+++  +k +g
  estExt_Genemark1.C_30153 705 SKFRGVSWHKHRRMWQVYIHVQSQ--ARNSYHMGYFAEEIDAAKAYDREILKVRG 757
                               79********9999*******433..249*********99*******98887776 PP

5AP224.95.2e-0810671118155
                       AP2    1 sgykGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkleg 55  
                                s+y+GV+w++   +WvA  +d       +k   +g f+t+e+Aa a++   ++++g
  estExt_Genemark1.C_30153 1067 SQYRGVTWNSIISKWVAVAWD----RdAKKARAIGFFDTEEQAAHAYDVEILAYNG 1118
                                78****************999....22348889***************87777766 PP

6AP238.72.4e-1213161365155
                       AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55  
                                s+++GV+ +k +g+++A+Ir+        k+++lg+f  +eeAa+a +aa+++++g
  estExt_Genemark1.C_30153 1316 SRFRGVSLNKASGKFEARIRE------AgKNHYLGSFSDEEEAARAFDAAALAMRG 1365
                                789******************......44*************************98 PP

7AP243.76.7e-1415381588156
                       AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkklege 56  
                                s+ykGV+w + + +W+A+++d      +k  ++g+f+ +eeAa+a++ a+++l+g+
  estExt_Genemark1.C_30153 1538 SQYKGVSWSEASAKWRAQCWDG-----SKVKYIGYFDGEEEAARAYDTAMLALRGN 1588
                                78*****************994.....6*************************995 PP

8AP232.32.5e-1016701719155
                       AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                                s+ykGV+w +++++W+A++++      +k  +lg +  +e+Aa+a++aa  +l+g
  estExt_Genemark1.C_30153 1670 SQYKGVSWSERSKKWRAQLWHE-----NKVNHLGFWELEEDAARAYDAAVSQLRG 1719
                                78*******************6.....477778877999***********99998 PP

9AP248.91.6e-1517691819154
                       AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkle 54  
                                s+y+GVrw++++grW+A+I d s    + k++slg++  +eeAa+a++a   +++
  estExt_Genemark1.C_30153 1769 SKYRGVRWHERNGRWEARIFDNS----TgKQISLGYYEAEEEAARAYDAESIRIR 1819
                                89****************99932....25*******************9777766 PP

10AP228.24.7e-0919011951155
                       AP2    1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55  
                                s+y+GV wd+ ++ W+++     ++g   r + g f+t+ eAa a++aa ++l+g
  estExt_Genemark1.C_30153 1901 SCYRGVVWDPDTQYWAVR---LATRG-GERRQFGMFDTEIEAAIAYDAAVLELFG 1951
                                79**************99...54544.4777889****99************987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd000189.00E-14147205No hitNo description
PfamPF008472.0E-7147196IPR001471AP2/ERF domain
SuperFamilySSF541713.47E-15147205IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.106.6E-14148204IPR001471AP2/ERF domain
SMARTSM003801.0E-17148210IPR001471AP2/ERF domain
PROSITE profilePS5103216.081148204IPR001471AP2/ERF domain
PROSITE profilePS510329.361239305IPR001471AP2/ERF domain
SuperFamilySSF541718.5E-7262305IPR016177DNA-binding domain
SMARTSM003802.8E-4262311IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.104.8E-7274304IPR001471AP2/ERF domain
SMARTSM003806.5E-6329391IPR001471AP2/ERF domain
PROSITE profilePS5103212.339329385IPR001471AP2/ERF domain
SuperFamilySSF541718.5E-10329385IPR016177DNA-binding domain
PfamPF008476.2E-5332374IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.109.4E-9338385IPR001471AP2/ERF domain
SuperFamilySSF541714.51E-13560617IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.101.3E-10561617IPR001471AP2/ERF domain
PROSITE profilePS5103213.037561617IPR001471AP2/ERF domain
SMARTSM003801.3E-6561623IPR001471AP2/ERF domain
PfamPF008471.3E-4561606IPR001471AP2/ERF domain
SuperFamilySSF541715.82E-11705767IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.103.3E-10706767IPR001471AP2/ERF domain
PROSITE profilePS5103211.759706765IPR001471AP2/ERF domain
SMARTSM003808.4E-5706771IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.104.7E-7931982IPR001471AP2/ERF domain
SMARTSM003804.1E-5931988IPR001471AP2/ERF domain
PROSITE profilePS510329.611931982IPR001471AP2/ERF domain
SuperFamilySSF541712.94E-7932984IPR016177DNA-binding domain
CDDcd000185.36E-1010671128No hitNo description
SuperFamilySSF541717.85E-1310671128IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.109.1E-1310681128IPR001471AP2/ERF domain
SMARTSM003802.1E-610681132IPR001471AP2/ERF domain
PROSITE profilePS5103213.82810681126IPR001471AP2/ERF domain
PfamPF008471.5E-513161365IPR001471AP2/ERF domain
SuperFamilySSF541711.5E-1313161373IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.101.3E-1413171372IPR001471AP2/ERF domain
SMARTSM003801.7E-1813171379IPR001471AP2/ERF domain
PROSITE profilePS5103215.56713171373IPR001471AP2/ERF domain
SuperFamilySSF541718.5E-1315381595IPR016177DNA-binding domain
CDDcd000182.29E-1115381597No hitNo description
PfamPF008475.6E-815381587IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.103.5E-1115391595IPR001471AP2/ERF domain
SMARTSM003807.6E-815391601IPR001471AP2/ERF domain
PROSITE profilePS5103214.30215391595IPR001471AP2/ERF domain
SuperFamilySSF541719.15E-1216701728IPR016177DNA-binding domain
PfamPF008471.7E-416701719IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.102.2E-1016711727IPR001471AP2/ERF domain
SMARTSM003803.4E-416711733IPR001471AP2/ERF domain
PROSITE profilePS5103213.6316711727IPR001471AP2/ERF domain
SuperFamilySSF541712.22E-1517691827IPR016177DNA-binding domain
PfamPF008475.4E-1017691813IPR001471AP2/ERF domain
PROSITE profilePS5103215.39617701828IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.101.0E-1317701827IPR001471AP2/ERF domain
SMARTSM003802.4E-1517701834IPR001471AP2/ERF domain
SuperFamilySSF541719.81E-1119011960IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.104.4E-919021960IPR001471AP2/ERF domain
SMARTSM003802.2E-419021965IPR001471AP2/ERF domain
PROSITE profilePS5103212.14119021959IPR001471AP2/ERF domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2206 aa     Download sequence    Send to blast
MQAWEGGDRD AATKLISAQV PTEQTRITQK QQKGRQARPM SSPFAAAAQQ EYPQDGWGVR  60
WNEELKRWEA QAGPGAAAAL DEPPAEAEEE EADHLDRVGS GDAAGSACSD PANPRPSRRR  120
AVPSRRYSPS AAQQLQDETR GRPGGPSKYR GVIWHKSNSK WEARIYDNGK QRFLGYFTSE  180
EEAARVYDEA AMRIGGRGAR TNFPAGECLS RSSSAPAELL DMGGTSEGPT AAAPPAALPP  240
KGGRLRKKAS SSGTGGLKGS SKYRGVWKGN DVRHLGYFED EVAAARAYDR AVLEIRGAHA  300
PTNFGPEDYG VAVPGPAAAA TDTAEVDSPF LGVSWDAAAG SWKAELWDGR EYALLGHFDS  360
EEAAARAYDR ACLAQHREAA NTNYPPGDYE EEMAAAALIS AVQRMSDDEE EASDLEMSAL  420
EALASISNEA EVDCEGDDAA CTSGRGQGGL QRGDQYMEES PPPELRRRDS GPPFEREAPP  480
SARLRRAMSD PIERIGSLSR RRSARLSDAD AATAAAALAG LFTKPASSEE PAPVVSAAAT  540
NSRGARSVRS GGSADAPKSS AYKGVSWHKH SQKWYAYIQA AGKMRGLGYF DLQEDAARAY  600
DAEARKVHGK KAVVNFRMYP DDVVREPKNR GVSSGSADTS GPSLEALPSA SISIGEDKPS  660
ARPASGPRSR GGRSERLCGK RDRAGSPTSE EVSRGTPRVG GPRSSKFRGV SWHKHRRMWQ  720
VYIHVQSQAR NSYHMGYFAE EIDAAKAYDR EILKVRGKDA VTNFPDSEMS GDAELKSLEH  780
VAAAAGDGHM LGEDDQAGSP TSAQPLTITY NPASADQGAP EDGEASPTCS AFSLGSLPLR  840
KRSRKPKHVH STAETRSPSP PRHPKPPRHD AAEAKRRQGT PLAEEGGMQL RNGEAGRGRR  900
VGSPQKEPWA APSTSGGVNA VGEAGGDVRA SFRGVTRLER ERKWVARVWN GQKQLTLGRF  960
DTDAYDREML RMKGRAAVTN FPADMYGPLV QEVSRSAVLV VACILRATSN ILLQSDVPSP  1020
RRPVAKSSPA GSFALTTIRP ASAATVGNGD AAPGGSQMAL PGSKSTSQYR GVTWNSIISK  1080
WVAVAWDRDA KKARAIGFFD TEEQAAHAYD VEILAYNGPA ATLNFPQSKQ IAAMMNKAPD  1140
ARPTSAGSAV SSTDVVLDLL ASMTPQSTSQ TPGQPPVRQA AGMDQLASFL RSGAPVPPEF  1200
AQALQMMSRG QQPASQPSRS PLQAPSPTSS PSHATAGAPR TADSTRAESV PPGNGALPAN  1260
ADAEETPSPV PSPLQRHVAA NLAASVAASP RSGSAGDGAP SMERGQRVAA RGANTSRFRG  1320
VSLNKASGKF EARIREAGKN HYLGSFSDEE EAARAFDAAA LAMRGRNAVC NFLLDDGPGA  1380
AAQGASPRHT RQVTVRTSPT AAPAAAAFPG PPRSDGPAPN QAPQMRAAPQ GQGVREAELS  1440
RDDIVAGARE SARLHAPHGE EAIKSDQLGS LAEAAVAQER AAGASPCAAA AGDTWPGLAP  1500
PAGGRVVGST VAALRGRVQQ MGGADARAHW PGPGRRSSQY KGVSWSEASA KWRAQCWDGS  1560
KVKYIGYFDG EEEAARAYDT AMLALRGNSA QTNFAAAEYT GEAIAKAEDA VWGQRQHRAK  1620
SEEPTGVEGI KVELAARVRV PSRRVTSPTN AAAHSGRAAP PSFAYHQGTS QYKGVSWSER  1680
SKKWRAQLWH ENKVNHLGFW ELEEDAARAY DAAVSQLRGA GAAVNFPAPG TVRPLVSSRT  1740
ITTCPAGGPS TTVVVEAIPR INVNAKGSSK YRGVRWHERN GRWEARIFDN STGKQISLGY  1800
YEAEEEAARA YDAESIRIRG IHAHVNLRAP SAARPRRTRR RAASKAVSSE EDDEASWPVK  1860
RPRGFNPAIA RRDLQSMAAA AAAIASARPP EPGASKAPRT SCYRGVVWDP DTQYWAVRLA  1920
TRGGERRQFG MFDTEIEAAI AYDAAVLELF GSRTPTNFDS EYGPAGSPLV PVPKRPRTES  1980
SAVARANAAL FLLDTAVMSP SLMGPLAGNL QHQAQSAANL GAEIRRAVQA RLGLPEPAAD  2040
SALTRQLPGG FAPHPQGSSP DKQSFLRGAG AAAGSQRQDE NSGVDVPVRY GVLADAAQQR  2100
IAVGTPPKTG WQPPAGSTQL GWPQGGSDPL QQPPALFAGA GPTHRYIAGK GQDIARELQL  2160
QSGAKGPNAG GGAADLHLPT PLKLPLLPAA AARDRPDSAG IATNH*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
119721978PKRPRTE
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_005650330.10.0hypothetical protein COCSUDRAFT_64835
TrEMBLI0Z5670.0I0Z567_COCSC; Uncharacterized protein
STRINGXP_005650330.10.0(Coccomyxa subellipsoidea)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP574088
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G37750.12e-15AP2 family protein
Publications ? help Back to Top
  1. Blanc G, et al.
    The genome of the polar eukaryotic microalga Coccomyxa subellipsoidea reveals traits of cold adaptation.
    Genome Biol., 2012. 13(5): p. R39
    [PMID:22630137]