PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Dusal.0066s00042.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Dunaliellaceae; Dunaliella
Family AP2
Protein Properties Length: 1418aa    MW: 146504 Da    PI: 7.5781
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Dusal.0066s00042.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP232.42.3e-10538601155
                   AP2   1 sgykGVrwdkkrgrWvAeIrdpse.ng......kr..krfslgkfgtaeeAakaaiaarkkleg 55 
                           s+ kGV++++ + rW+A+++d s  ++      ++  k+++lg f t+ +Aaka+++a+  ++g
  Dusal.0066s00042.1.p 538 SKRKGVTRHRHTLRWEAHLWDSSApRKvtgkggRTrgKQVYLGGFSTESDAAKAYDRAAIVYWG 601
                           688******************6667556666773357**********88**********98876 PP

2AP243.76.7e-14645696256
                   AP2   2 gykGVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaarkklege 56 
                            ++GV+++  +++W+A+I   p   g +k+ +lg+f+t  eAa a+++a++k +g+
  Dusal.0066s00042.1.p 645 TFRGVTRHNLQNKWEARIGRiP---G-SKYLYLGTFDTQLEAAVAYDHAALKHRGH 696
                           69****************9999...3.5************************9997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd000188.22E-13538611No hitNo description
SuperFamilySSF541712.68E-10538607IPR016177DNA-binding domain
PROSITE profilePS5103215.145539609IPR001471AP2/ERF domain
SMARTSM003801.5E-17539615IPR001471AP2/ERF domain
PfamPF008471.4E-6539601IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.101.4E-11541607IPR001471AP2/ERF domain
SuperFamilySSF541713.79E-15644704IPR016177DNA-binding domain
CDDcd000184.05E-17644703No hitNo description
PROSITE profilePS5103215.83645703IPR001471AP2/ERF domain
SMARTSM003801.2E-19645709IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.106.5E-16645704IPR001471AP2/ERF domain
PfamPF008473.9E-7646696IPR001471AP2/ERF domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1418 aa     Download sequence    Send to blast
MYPSSGAAPL TTGSQGLAAP ILKACSTQNT PATPADAAPA GTGAHAGATA PASAGTHAAG  60
ASAPASVAPD GAPVNTGTGT PAAGASAPAG AVAPIGSGAP AGAAPADMGV PAAGATTPAG  120
SVAPAAPAVG TAPANPGTPA ATSAGIAPPA GATTPAGSGV HASATGPPAG PALQLPTGVP  180
RPRPKSRLGA GAPAEHAAVR DVSSAIPHLM QGHPFHHLLH PQQWQQPQLQ RQLQRHQHPL  240
LRQQPQLQRQ LQQQQQWQQL QLQQQLQQQQ QHPQQWMQGR VGTSLEDCWP EWFKAWIRAA  300
QNPSTAEENG GGVPEGANQG PHAPEQMSTR GGDGSRMESV RPQQPHHHHP QQQQQQQQQQ  360
QQYKVEATDT EGSLENLGQA QALQQQQQQQ QQISFPCFGP AADSSTAGTG ASPLLPACVF  420
NTGSGMGGFS ATGCGLGGSV GADGGAYPIM NISIGGGGAA ANGGDCDMQE QGEGDGSDEG  480
DSESGSGVRR GRKGKQTAKG ARGKKPVGCA AVKLRAGGGV QKHRSNQRIG PRPEGGTSKR  540
KGVTRHRHTL RWEAHLWDSS APRKVTGKGG RTRGKQVYLG GFSTESDAAK AYDRAAIVYW  600
GTTAQLNGNL SDYEDELDVL MSISREEVVQ MLRRNSSGFT RGVSTFRGVT RHNLQNKWEA  660
RIGRIPGSKY LYLGTFDTQL EAAVAYDHAA LKHRGHRAVT NFPRCNYFDP GNQLIDLADA  720
QVAAPHWPRP NLPANASRSS AQVDGEDGEE GCEEGSVPSS SHAPQQQQRQ QQMDLPSASP  780
RQRPPRPQRA QRRRAQPVPS DTTSSGGSTT EVEEEDEDES TVSEEWSGAE ECTQQRAKLR  840
VRKTGSSGGS DVQAGVVAAA AAAEAMDEGK LGGEKEREVS NGEGGGMDLE CGAEWWEREG  900
RGCVRRSMRG SAKAARVMEM GTAGAGASAR RNSQGQQHGV KEEEEGEDGG AVCRTRSLQQ  960
HQQQGQRRLA VPQVGVAVAE PGGASSGHGL PAVPPVVVQH TETAEQDDLV WALGKPHAQH  1020
QHLQQQQQQQ QQQQQQLRFG TRLASAAVGD SVGSCMPSFG GPLSAAAASI TVAAAAAAPC  1080
APLPVSGPPA AASVSVAAAA ADADAGAPCA PLRAAAGPPA GLAIRPIRTS EAQAKQPICA  1140
PICTSEARAQ QLCCPSSMQL CAPLSEQPTL PQQKGQPAGS AHTQEEEELL LMHQHHLLGL  1200
PGSNSTPGSS ASVLLPSFWE GLGSPKSANL LGSLLSPFSG GAAANALSPL HHFSPRLSPS  1260
LLLSPRFSPS LLFSPRHAYL TPPHSSSGAM GAPAAQAALP HFSVSSVPPA AAAQPTQAPS  1320
AAATVELRGN AGASKISAAK TPTTPPFQDS TLPTLPTLGQ LMDREVMELL HGLSPTYKPL  1380
PSPFSSTAQR GITEATANPL SAVDPCSSHD TISLYKR*
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP561657
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G72570.19e-54AP2 family protein