PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre08.g383000.t2.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family AP2
Protein Properties Length: 2189aa    MW: 211491 Da    PI: 8.2837
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre08.g383000.t2.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP241.92.4e-13887936155
                 AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                         s+ykGV+ ++++gr++A+I++       ++ ++g++gt  eAa a+++  ++l+g
  Cre08.g383000.t2.1 887 SRYKGVSLYRRTGRYEAHIWHE-----GRQLHIGTYGTDLEAALAYDRVSRYLRG 936
                         79*******************5.....4**********99*********999998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF008471.9E-8887936IPR001471AP2/ERF domain
CDDcd000182.23E-11887944No hitNo description
SuperFamilySSF541713.07E-12887945IPR016177DNA-binding domain
PROSITE profilePS5103214.302888944IPR001471AP2/ERF domain
SMARTSM003803.3E-13888950IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.102.6E-12888945IPR001471AP2/ERF domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2189 aa     Download sequence    Send to blast
MTTTPQQPQQ QPQQQQQQHR QRRHSAAAAE QPPTPPRATA ATSTCCPPPL AAEAPAPATA  60
VVAAASRAAA AAAAGATAAA ANAVAAATAS MAASAALERQ FKVAVPDGHG VAAPAAPAAP  120
QQPPTPAAAA NIAPSPRVVS SNAAKPAAAA PAAASAAGAS GGPAAAAPTA AAAAAAVAAA  180
AAAAAAAAVT AGSNGSSAPT AVASSPVPAA AAGGSSSSSR QQQPRQQRCE EPRRERPQAV  240
PPLARASAGG CAAAAAATTA AARSTPAVFT SPPAPLRLSP RGPPPPPPLP QLPLPSPPAQ  300
LPPMPSPTKP RQNGSTGSGS GSGGGRRRGG GSPAATAATT AAAAAAEAAA AALQVAATSP  360
AVRAAGVAHR RAQLLRTSCV IVGSGSSSGG GGGGGSQQQL LLLGRMSHSR ESSGGGAPPT  420
IAAATALPAA AAPAAAGATA AASSAVAHGG VVSSPGGGRE AAGLWDWAVA APSPQRQQQP  480
QPLPGPAAAA EVAVAGAGAG GRDSASVSTP STESDEGAAQ WRAGSGGGRG GSVGRRRPSE  540
QQRPSRDEQQ QQQQQQPPQQ QQQPRPEPEE EEKAAAVRPP QPAAAIPPTT SRREQQQQQQ  600
QQQQQQQPST PLPALPQQPQ LPWSAALPPH PCWMPSAAAA AAVPAGGSDG ECGGASGGSD  660
GAVAELLAGW HLEEATSTPS RPPRPLLTVL LPSSCGSPGD AAAAAAAAVA AASAGAADTK  720
ATKAATASAT APAAPAPPQP LPLPPSPPQL TPVRRRWSAP GGEQEGEAQP QPQRKAAGCA  780
GAGGGDGASA TASAATAPAA ASAVAPAVAP ATAAAGAPRR QLLQQCPARP AAAGPCPAAP  840
AAPRSDAAAA TAAATAAAAA TALPAATAAA AAATAGSLAR GPRNRSSRYK GVSLYRRTGR  900
YEAHIWHEGR QLHIGTYGTD LEAALAYDRV SRYLRGAAAV VNFPSEAAAA AAITAAAASE  960
SAAASRDTAA TAASPQAAVT VRRDSAAAHP PSPSPGHTAE GPAAADEAGG GGGSGGASAG  1020
GGGGRGAGGG KPWPATLLDS IPAVASSAAA AAAGGGMPAP AAAAAVDEDA LLAAPCGGGG  1080
GGGRCVRRRI SALEPRSTVS PGVLQPPPPP AVPAAAAAPV SRQHTAAAAT THMMQQQQTV  1140
AAAMVVAASA AAAAAVAGAA HQAAAAANGG AAAAAAARRR RDSGGGAADI VTDGAAATSS  1200
AHARAAAAAC GASAAARPGP QAVSGGGGDG DDGHNAAATA AAAATGQGVI SQVTGGSAPP  1260
LQPWRPAATE VLDCNGCDGG GGDGDVDSSS ARASCWVARG LPGIDVAWRH SASAGAGAGG  1320
DHDDCCAPTD AVMLPAAVSA APHAFLPSTP QQHPQHPQRP QHVAAAAAPL LPPLAAVAAA  1380
AAAPARSFLR HSCPPDHPAA AAAAAAPSHQ DIQQPHHHHH HQPLTRSGSP PPPDPSSGGN  1440
SRREVAAALA ALLAAGVPPD FLAQVISRRQ PQAAQHELQR AHTQQLAAVD CHHMHLLLQP  1500
CPSSSHPPPP PATAAAAATT TPSPTTTRPT AATGKDAAIA AILKSLMGPQ QRPTPPPPPP  1560
PPPPPHAMPY PPAPAVKGPP PPGPPPPSPA LPPLQSNSGW CPSSRNSSSH PAATATAMAS  1620
ATATAIATGA GSGSADGASL DRSAVSSAAA AAAAATATGG SVGTCGAGGG GRGGGEEEEG  1680
SGLRRHVMTW VGGQLPGLRG GLLGGALLAA AAPPPPPTPA VAAAPATAMA EEPAVASGVS  1740
PDSIRSLLLL QRRLQQQQQQ VAAPPAEQQQ AGRPPCAPAA AAAAAAAADA DADMVDAVSS  1800
PRVAEAAEAQ PAVAAAVVCR RRHTTTELSY TPAGGFAAGA AAAASGSSYA REPPRQRPRL  1860
TSHYEEEQPQ QGQQRQQGQE QQQQPPLLPL LLLPTRRHTA PVWAHGGGGH ENGGRENAGG  1920
VSGGGGGGRE FGFCAVDEAD EELLCGGGGG GGGDGGSPLA LHWRQLQAMQ LQAQQAQQAQ  1980
QAQVSWSLGP QRVLELARGG GGGGGGGGGL LMPLSPSHQQ AQAQEQAQGW LLGQLLQQQG  2040
QQPQEGTPCS PTAVGGAAAA AAGGKRPRED ACYRSSADGV EALGLRLGLG GVGLGAGYAG  2100
AAASGMDGME SGGWEDAGQQ APLSDDVDAA SGEAAAPPAS AAPAAPEAPE AAAADWAPAP  2160
AARGHPDRKT RLLYGHQHHQ HHHLRPQL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1383395SGSSSGGGGGGGS
210221029GGRGAGGG
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre08.g383000.t2.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2K3DI670.0A0A2K3DI67_CHLRE; Uncharacterized protein
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G39250.11e-10AP2 family protein