PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre02.g100100.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family AP2
Protein Properties Length: 2619aa    MW: 257188 Da    PI: 7.4624
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre02.g100100.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP240.37.8e-13878927155
                 AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                         ++y+GV++++++ rW+A+I++      r++++lg f  +++Aaka++ ++++ +g
  Cre02.g100100.t1.1 878 PRYRGVTRHRRTRRWEAHIWEE-----RRQVYLGGFEVEDQAAKAHDVMALRCRG 927
                         68*******************6.....6**********99***********9998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PRINTSPR012171.8E-9403415No hitNo description
PRINTSPR012171.8E-9495516No hitNo description
PRINTSPR012171.8E-9538555No hitNo description
PRINTSPR012171.8E-9556581No hitNo description
CDDcd000189.51E-15878934No hitNo description
PfamPF008476.9E-9878927IPR001471AP2/ERF domain
SMARTSM003801.2E-21879941IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.101.0E-13879934IPR001471AP2/ERF domain
SuperFamilySSF541713.53E-12879935IPR016177DNA-binding domain
PROSITE profilePS5103216.792879935IPR001471AP2/ERF domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2619 aa     Download sequence    Send to blast
MDLDMHFDAM EVDTAVAAGG GGGEQEMSQA PAPAPPPPQQ YHQQQGQQPQ HLQQQVEHAQ  60
DMQPGPYSGV RLHPQPPQQL PAGPRQQQQQ LQQQHYHYHQ QQQQQQQQYP QQQQQPHLQQ  120
LVMEAVPLGA RRGGGRGSNS SGASASGSSW AVGCGTAGAA GAGAAGGGAG GGGAVPAADN  180
NNRGSAADMT TMLQLNRYGS TAATQLTSAG GGPGGGGPGG GANVGYGYGF DGAGAGGGGG  240
GWVEASSPPP SGQGGGAAGD NGRGAARPPG SSAFHHHAAA AAVAAATAAT AAAAATAVAA  300
APAAGTTAPG MAAGGGARFQ SYGYLPSQAQ AAPQPPAAAA GYPPAPPPPL HRHHSEGALL  360
APGPQPQPSG PSPAPHRYLQ PPPQHPHYPA PWEQQQPQQP HAAHGGYRPS PSPVPPSLPR  420
QHSFPRAPSA GQLPGGNPTN QHPQQQAQHR PHHHQQHQQQ HQQQHQQHQP AQQVQQPHQP  480
PYYAYSQPPP GYSQPPPQPP QPPQPPPQQP PQPPQQPYHS NPYYQHHRQQ HQQQSHTPPP  540
PQPPQSAYAP PLPQPPPQPP PQPPPQPPPQ PPPQPPQPPQ PPTSPFPHFM SPPPPVPAPL  600
PPLPTWSPAA SPPAFPRSGS GFAAGVGLDR TLSGAARGGG GGGGGRQLPP SPLAGMRAPP  660
VPPELQQEMA ESFLRALALI MARPATRAPT LGNVSSSLAI AIDVKSRRVA EPAAAAPAAA  720
AAGTTGSAAG DGGGGGGGAA VKAEGGGGSV DTVGFRTVTT ATTGAAGGGG GGGNIGSAAG  780
GGATAAGGSG NGSSGRLQQL VGDVPSPPAP SPSPGSKPPP AASTPAGGPA AAAAAAPATA  840
TTPATPPGAT AAAAPATAAA APATAGVSAT AAASGVSPRY RGVTRHRRTR RWEAHIWEER  900
RQVYLGGFEV EDQAAKAHDV MALRCRGPDT VLNFLPDTYI ELQPLLLPQQ GRRPLHRNEV  960
VRLLRSYGKE VTRLAHTAAR PPGAAAAGGG GGAPGASAAA AAARQGSADS GGGAGPVEPP  1020
VTIFGDDECD VYCFMPGTLK AQALAAAGSS PPGGGVGGIH RSASVGGVLG GGAAATGGGG  1080
WGAGLSPWSP KAAVVAAAAA AAATAAAGTP ASPSTLLSAG LLSPGGGARA FGGAGGGGGG  1140
DGGDGNGGGG GALPPLTPLS PDWAEALASL GVGGGATGLW GLPGGGGGGG SLNGLSGLLP  1200
LSDTLDAITS TAAAVEDPDM LDLQYRQTVR DMARLQQQQQ QQHVLQPEQQ QQQPHMLQPS  1260
GELQGQQQQQ QQQQQQQQQY GGHQGHQGQG HQGQAAPLQR HASGAAAPQP PPQPTTQPTT  1320
QPPPQPPPPA PRPPQPPQPP PLFSSFSQPQ SSLVGLLESP RHFLVDPYGT PWGGHSYSYS  1380
PATAAAGSGT HTNPQYSHPQ QQQQQLKAEA PQQQQQQQPY QQQHQQQPGP HMPSPYTPHA  1440
GSGRPQPQTH PFGIGNGTGS GSGASGRGSS PPRQPPEARR LPAPAATPPP PPAWAPPGAQ  1500
PPPQPPLQPF SSQPYTDPWP PYMPPQYGQR GSGGPAVAAA PAAVAASGGG GGTTLLGHHS  1560
APAAQLSLFA AVAATAAAAA TATAGAEQAP AASSAVRRAA ELLLDFGGDG GGADGGGADA  1620
GAGAAAAGAP AATGGGGGGC RVEFEPISWS AVDARRPGSD GGEQPAEQAP VVGGAAAGEG  1680
SGGGAASDAD ASAGATANGD GLDGSMEAFW EGLAAAAAAA IRRQSSRSDG GASGGGAGGA  1740
ASGNGNGNVG GGGGGGGGST GSGGASFWDD FFASQLLMTP GPTDGGRAPS LPPPAVAPAP  1800
AAAPAVAVGH SPEVGGGGEQ MAGGGDRAAA APGSPAAAPL PQAPAPVQLL PPPPQPWRSI  1860
SDVGQLTAAA GLHLQPYTCA AAATATATAA AAGPGGATPP PHDGGMAPAG EDAAAAGRER  1920
TAFSHLPPPP PQQQQPQEHP PQPPPWLHAP QPPAHYPQQH NHQQHPHQLQ PQGSASGALQ  1980
VDMEEAEDAG EWGHGPAGPA AVSLAPPAAS APAAAGGNDT AGAGWGGGGA ARWYPQPPPQ  2040
LQPQQPQQQL QPTPHPQLQP TPGGLLRGGT VWRFGGSTQQ PGTHTPSHSD GAPAASSHAA  2100
ATPLPPLPPL PPLPPRPASA AGGAATAATA APAAAAFILS PRAAQHFSLS LPSGDAFNSY  2160
TSAAAAAAAA TPPARSASPA AARSAGDAYT IAAATPTTAA AAVAGGAGTS APAASAVRRS  2220
RRNSSGGGGR NSAVSGGGSG GTGGAVDPGG TAGDEAASVG GGGGGGARGG SGGATRSRRS  2280
GSGNDSLAAA GSAGRMGAEA EVGTSDAGGA AGTSSGRDGG GAAGTRKRRA SFMTARSPPP  2340
SLPPPMTTAA AAAPAPPLAP HMPGSGPGPL APPVPPPATP LASAPPLPPP SHAFLSHYTP  2400
APPPTHLAAH QWPPPQPPPG PQQQPQQQQQ QPGAAAWGSE SKEARTPSDG SAHSTHITHT  2460
PPLPYPQQHP QQQPQLYPHL AHMPPQPPQP QQQQQQPQPQ PPQPLQPPQP PLSPADMYGR  2520
ALYGGSYGGG GGVYGGGGGT LMQPMPAAAG VEAAPTGGGS ALPRNGGGGG GRRTVTTPSP  2580
TTPGGGVGVS DSGVQQQTPP QQQQQQQQQQ QEHWFPQF*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1636645RGGGGGGGGR
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre02.g100100.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2K3E2750.0A0A2K3E275_CHLRE; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP561657
Representative plantOGRP4971784
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G39250.17e-09AP2 family protein