PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
Basic Information? help Back to Top | |||||||||
---|---|---|---|---|---|---|---|---|---|
TF ID | estExt_Genemark1.C_30153 | ||||||||
Common Name | COCSUDRAFT_64835 | ||||||||
Organism | |||||||||
Taxonomic ID | |||||||||
Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Trebouxiophyceae incertae sedis; Coccomyxaceae; Coccomyxa; Coccomyxa subellipsoidea
|
||||||||
Family | AP2 | ||||||||
Protein Properties | Length: 2206aa MW: 233557 Da PI: 9.5792 | ||||||||
Description | AP2 family protein | ||||||||
Gene Model |
|
Signature Domain? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
1 | AP2 | 42.5 | 1.7e-13 | 147 | 196 | 1 | 55 |
AP2 1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 s+y+GV w+k++ +W+A+I+d + k+ lg+f ++eeAa+ +++a+ ++ g estExt_Genemark1.C_30153 147 SKYRGVIWHKSNSKWEARIYD------NgKQRFLGYFTSEEEAARVYDEAAMRIGG 196 89****************999......44**********************99866 PP | |||||||
2 | AP2 | 30 | 1.3e-09 | 329 | 377 | 2 | 55 |
AP2 2 gykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 + GV+wd g+W Ae++d r + lg+f+++e Aa+a+++a ++ + estExt_Genemark1.C_30153 329 PFLGVSWDAAAGSWKAELWDG-----REYALLGHFDSEEAAARAYDRACLAQHR 377 678*****************4.....7********************9888775 PP | |||||||
3 | AP2 | 30.5 | 8.8e-10 | 561 | 609 | 2 | 55 |
AP2 2 gykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 ykGV+w+k +++W A+I+ k lg+f+ e+Aa+a++a ++k +g estExt_Genemark1.C_30153 561 AYKGVSWHKHSQKWYAYIQA------AgKMRGLGYFDLQEDAARAYDAEARKVHG 609 69***************999......339999*******************9998 PP | |||||||
4 | AP2 | 29.8 | 1.4e-09 | 705 | 757 | 1 | 55 |
AP2 1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 s+++GV+w+k r W ++I++ s+ r +++ g+f + +Aaka+++ +k +g estExt_Genemark1.C_30153 705 SKFRGVSWHKHRRMWQVYIHVQSQ--ARNSYHMGYFAEEIDAAKAYDREILKVRG 757 79********9999*******433..249*********99*******98887776 PP | |||||||
5 | AP2 | 24.9 | 5.2e-08 | 1067 | 1118 | 1 | 55 |
AP2 1 sgykGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkleg 55 s+y+GV+w++ +WvA +d +k +g f+t+e+Aa a++ ++++g estExt_Genemark1.C_30153 1067 SQYRGVTWNSIISKWVAVAWD----RdAKKARAIGFFDTEEQAAHAYDVEILAYNG 1118 78****************999....22348889***************87777766 PP | |||||||
6 | AP2 | 38.7 | 2.4e-12 | 1316 | 1365 | 1 | 55 |
AP2 1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkleg 55 s+++GV+ +k +g+++A+Ir+ k+++lg+f +eeAa+a +aa+++++g estExt_Genemark1.C_30153 1316 SRFRGVSLNKASGKFEARIRE------AgKNHYLGSFSDEEEAARAFDAAALAMRG 1365 789******************......44*************************98 PP | |||||||
7 | AP2 | 43.7 | 6.7e-14 | 1538 | 1588 | 1 | 56 |
AP2 1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkklege 56 s+ykGV+w + + +W+A+++d +k ++g+f+ +eeAa+a++ a+++l+g+ estExt_Genemark1.C_30153 1538 SQYKGVSWSEASAKWRAQCWDG-----SKVKYIGYFDGEEEAARAYDTAMLALRGN 1588 78*****************994.....6*************************995 PP | |||||||
8 | AP2 | 32.3 | 2.5e-10 | 1670 | 1719 | 1 | 55 |
AP2 1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 s+ykGV+w +++++W+A++++ +k +lg + +e+Aa+a++aa +l+g estExt_Genemark1.C_30153 1670 SQYKGVSWSERSKKWRAQLWHE-----NKVNHLGFWELEEDAARAYDAAVSQLRG 1719 78*******************6.....477778877999***********99998 PP | |||||||
9 | AP2 | 48.9 | 1.6e-15 | 1769 | 1819 | 1 | 54 |
AP2 1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkle 54 s+y+GVrw++++grW+A+I d s + k++slg++ +eeAa+a++a +++ estExt_Genemark1.C_30153 1769 SKYRGVRWHERNGRWEARIFDNS----TgKQISLGYYEAEEEAARAYDAESIRIR 1819 89****************99932....25*******************9777766 PP | |||||||
10 | AP2 | 28.2 | 4.7e-09 | 1901 | 1951 | 1 | 55 |
AP2 1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 s+y+GV wd+ ++ W+++ ++g r + g f+t+ eAa a++aa ++l+g estExt_Genemark1.C_30153 1901 SCYRGVVWDPDTQYWAVR---LATRG-GERRQFGMFDTEIEAAIAYDAAVLELFG 1951 79**************99...54544.4777889****99************987 PP |
Protein Features ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Database | Entry ID | E-value | Start | End | InterPro ID | Description |
CDD | cd00018 | 9.00E-14 | 147 | 205 | No hit | No description |
Pfam | PF00847 | 2.0E-7 | 147 | 196 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 3.47E-15 | 147 | 205 | IPR016177 | DNA-binding domain |
Gene3D | G3DSA:3.30.730.10 | 6.6E-14 | 148 | 204 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 1.0E-17 | 148 | 210 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 16.081 | 148 | 204 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 9.361 | 239 | 305 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 8.5E-7 | 262 | 305 | IPR016177 | DNA-binding domain |
SMART | SM00380 | 2.8E-4 | 262 | 311 | IPR001471 | AP2/ERF domain |
Gene3D | G3DSA:3.30.730.10 | 4.8E-7 | 274 | 304 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 6.5E-6 | 329 | 391 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 12.339 | 329 | 385 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 8.5E-10 | 329 | 385 | IPR016177 | DNA-binding domain |
Pfam | PF00847 | 6.2E-5 | 332 | 374 | IPR001471 | AP2/ERF domain |
Gene3D | G3DSA:3.30.730.10 | 9.4E-9 | 338 | 385 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 4.51E-13 | 560 | 617 | IPR016177 | DNA-binding domain |
Gene3D | G3DSA:3.30.730.10 | 1.3E-10 | 561 | 617 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 13.037 | 561 | 617 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 1.3E-6 | 561 | 623 | IPR001471 | AP2/ERF domain |
Pfam | PF00847 | 1.3E-4 | 561 | 606 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 5.82E-11 | 705 | 767 | IPR016177 | DNA-binding domain |
Gene3D | G3DSA:3.30.730.10 | 3.3E-10 | 706 | 767 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 11.759 | 706 | 765 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 8.4E-5 | 706 | 771 | IPR001471 | AP2/ERF domain |
Gene3D | G3DSA:3.30.730.10 | 4.7E-7 | 931 | 982 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 4.1E-5 | 931 | 988 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 9.611 | 931 | 982 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 2.94E-7 | 932 | 984 | IPR016177 | DNA-binding domain |
CDD | cd00018 | 5.36E-10 | 1067 | 1128 | No hit | No description |
SuperFamily | SSF54171 | 7.85E-13 | 1067 | 1128 | IPR016177 | DNA-binding domain |
Gene3D | G3DSA:3.30.730.10 | 9.1E-13 | 1068 | 1128 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 2.1E-6 | 1068 | 1132 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 13.828 | 1068 | 1126 | IPR001471 | AP2/ERF domain |
Pfam | PF00847 | 1.5E-5 | 1316 | 1365 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 1.5E-13 | 1316 | 1373 | IPR016177 | DNA-binding domain |
Gene3D | G3DSA:3.30.730.10 | 1.3E-14 | 1317 | 1372 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 1.7E-18 | 1317 | 1379 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 15.567 | 1317 | 1373 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 8.5E-13 | 1538 | 1595 | IPR016177 | DNA-binding domain |
CDD | cd00018 | 2.29E-11 | 1538 | 1597 | No hit | No description |
Pfam | PF00847 | 5.6E-8 | 1538 | 1587 | IPR001471 | AP2/ERF domain |
Gene3D | G3DSA:3.30.730.10 | 3.5E-11 | 1539 | 1595 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 7.6E-8 | 1539 | 1601 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 14.302 | 1539 | 1595 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 9.15E-12 | 1670 | 1728 | IPR016177 | DNA-binding domain |
Pfam | PF00847 | 1.7E-4 | 1670 | 1719 | IPR001471 | AP2/ERF domain |
Gene3D | G3DSA:3.30.730.10 | 2.2E-10 | 1671 | 1727 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 3.4E-4 | 1671 | 1733 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 13.63 | 1671 | 1727 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 2.22E-15 | 1769 | 1827 | IPR016177 | DNA-binding domain |
Pfam | PF00847 | 5.4E-10 | 1769 | 1813 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 15.396 | 1770 | 1828 | IPR001471 | AP2/ERF domain |
Gene3D | G3DSA:3.30.730.10 | 1.0E-13 | 1770 | 1827 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 2.4E-15 | 1770 | 1834 | IPR001471 | AP2/ERF domain |
SuperFamily | SSF54171 | 9.81E-11 | 1901 | 1960 | IPR016177 | DNA-binding domain |
Gene3D | G3DSA:3.30.730.10 | 4.4E-9 | 1902 | 1960 | IPR001471 | AP2/ERF domain |
SMART | SM00380 | 2.2E-4 | 1902 | 1965 | IPR001471 | AP2/ERF domain |
PROSITE profile | PS51032 | 12.141 | 1902 | 1959 | IPR001471 | AP2/ERF domain |
Gene Ontology ? help Back to Top | ||||||
---|---|---|---|---|---|---|
GO Term | GO Category | GO Description | ||||
GO:0006355 | Biological Process | regulation of transcription, DNA-templated | ||||
GO:0005634 | Cellular Component | nucleus | ||||
GO:0003677 | Molecular Function | DNA binding | ||||
GO:0003700 | Molecular Function | transcription factor activity, sequence-specific DNA binding |
Sequence ? help Back to Top |
---|
Protein Sequence Length: 2206 aa Download sequence Send to blast |
MQAWEGGDRD AATKLISAQV PTEQTRITQK QQKGRQARPM SSPFAAAAQQ EYPQDGWGVR 60 WNEELKRWEA QAGPGAAAAL DEPPAEAEEE EADHLDRVGS GDAAGSACSD PANPRPSRRR 120 AVPSRRYSPS AAQQLQDETR GRPGGPSKYR GVIWHKSNSK WEARIYDNGK QRFLGYFTSE 180 EEAARVYDEA AMRIGGRGAR TNFPAGECLS RSSSAPAELL DMGGTSEGPT AAAPPAALPP 240 KGGRLRKKAS SSGTGGLKGS SKYRGVWKGN DVRHLGYFED EVAAARAYDR AVLEIRGAHA 300 PTNFGPEDYG VAVPGPAAAA TDTAEVDSPF LGVSWDAAAG SWKAELWDGR EYALLGHFDS 360 EEAAARAYDR ACLAQHREAA NTNYPPGDYE EEMAAAALIS AVQRMSDDEE EASDLEMSAL 420 EALASISNEA EVDCEGDDAA CTSGRGQGGL QRGDQYMEES PPPELRRRDS GPPFEREAPP 480 SARLRRAMSD PIERIGSLSR RRSARLSDAD AATAAAALAG LFTKPASSEE PAPVVSAAAT 540 NSRGARSVRS GGSADAPKSS AYKGVSWHKH SQKWYAYIQA AGKMRGLGYF DLQEDAARAY 600 DAEARKVHGK KAVVNFRMYP DDVVREPKNR GVSSGSADTS GPSLEALPSA SISIGEDKPS 660 ARPASGPRSR GGRSERLCGK RDRAGSPTSE EVSRGTPRVG GPRSSKFRGV SWHKHRRMWQ 720 VYIHVQSQAR NSYHMGYFAE EIDAAKAYDR EILKVRGKDA VTNFPDSEMS GDAELKSLEH 780 VAAAAGDGHM LGEDDQAGSP TSAQPLTITY NPASADQGAP EDGEASPTCS AFSLGSLPLR 840 KRSRKPKHVH STAETRSPSP PRHPKPPRHD AAEAKRRQGT PLAEEGGMQL RNGEAGRGRR 900 VGSPQKEPWA APSTSGGVNA VGEAGGDVRA SFRGVTRLER ERKWVARVWN GQKQLTLGRF 960 DTDAYDREML RMKGRAAVTN FPADMYGPLV QEVSRSAVLV VACILRATSN ILLQSDVPSP 1020 RRPVAKSSPA GSFALTTIRP ASAATVGNGD AAPGGSQMAL PGSKSTSQYR GVTWNSIISK 1080 WVAVAWDRDA KKARAIGFFD TEEQAAHAYD VEILAYNGPA ATLNFPQSKQ IAAMMNKAPD 1140 ARPTSAGSAV SSTDVVLDLL ASMTPQSTSQ TPGQPPVRQA AGMDQLASFL RSGAPVPPEF 1200 AQALQMMSRG QQPASQPSRS PLQAPSPTSS PSHATAGAPR TADSTRAESV PPGNGALPAN 1260 ADAEETPSPV PSPLQRHVAA NLAASVAASP RSGSAGDGAP SMERGQRVAA RGANTSRFRG 1320 VSLNKASGKF EARIREAGKN HYLGSFSDEE EAARAFDAAA LAMRGRNAVC NFLLDDGPGA 1380 AAQGASPRHT RQVTVRTSPT AAPAAAAFPG PPRSDGPAPN QAPQMRAAPQ GQGVREAELS 1440 RDDIVAGARE SARLHAPHGE EAIKSDQLGS LAEAAVAQER AAGASPCAAA AGDTWPGLAP 1500 PAGGRVVGST VAALRGRVQQ MGGADARAHW PGPGRRSSQY KGVSWSEASA KWRAQCWDGS 1560 KVKYIGYFDG EEEAARAYDT AMLALRGNSA QTNFAAAEYT GEAIAKAEDA VWGQRQHRAK 1620 SEEPTGVEGI KVELAARVRV PSRRVTSPTN AAAHSGRAAP PSFAYHQGTS QYKGVSWSER 1680 SKKWRAQLWH ENKVNHLGFW ELEEDAARAY DAAVSQLRGA GAAVNFPAPG TVRPLVSSRT 1740 ITTCPAGGPS TTVVVEAIPR INVNAKGSSK YRGVRWHERN GRWEARIFDN STGKQISLGY 1800 YEAEEEAARA YDAESIRIRG IHAHVNLRAP SAARPRRTRR RAASKAVSSE EDDEASWPVK 1860 RPRGFNPAIA RRDLQSMAAA AAAIASARPP EPGASKAPRT SCYRGVVWDP DTQYWAVRLA 1920 TRGGERRQFG MFDTEIEAAI AYDAAVLELF GSRTPTNFDS EYGPAGSPLV PVPKRPRTES 1980 SAVARANAAL FLLDTAVMSP SLMGPLAGNL QHQAQSAANL GAEIRRAVQA RLGLPEPAAD 2040 SALTRQLPGG FAPHPQGSSP DKQSFLRGAG AAAGSQRQDE NSGVDVPVRY GVLADAAQQR 2100 IAVGTPPKTG WQPPAGSTQL GWPQGGSDPL QQPPALFAGA GPTHRYIAGK GQDIARELQL 2160 QSGAKGPNAG GGAADLHLPT PLKLPLLPAA AARDRPDSAG IATNH* |
Nucleic Localization Signal ? help Back to Top | |||
---|---|---|---|
No. | Start | End | Sequence |
1 | 1972 | 1978 | PKRPRTE |
Annotation -- Protein ? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-value | Description | ||||
Refseq | XP_005650330.1 | 0.0 | hypothetical protein COCSUDRAFT_64835 | ||||
TrEMBL | I0Z567 | 0.0 | I0Z567_COCSC; Uncharacterized protein | ||||
STRING | XP_005650330.1 | 0.0 | (Coccomyxa subellipsoidea) |
Orthologous Group ? help Back to Top | |||
---|---|---|---|
Lineage | Orthologous Group ID | Taxa Number | Gene Number |
Chlorophytae | OGCP5740 | 8 | 8 |
Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Hit ID | E-value | Description | ||||
AT4G37750.1 | 2e-15 | AP2 family protein |
Link Out ? help Back to Top | |
---|---|
Entrez Gene | 17043790 |
Publications ? help Back to Top | |||
---|---|---|---|
|