PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Dusal.0415s00011.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Dunaliellaceae; Dunaliella
Family GATA
Protein Properties Length: 5428aa    MW: 565373 Da    PI: 6.6892
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Dusal.0415s00011.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA23.86.4e-08729762134
                  GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                           C+nCgt +T+ WR + +     CnaCG y+r +g
  Dusal.0415s00011.1.p 729 CANCGTGSTSVWRTDRETGLVRCNACGQYWRTHG 762
                           ****************655559*********997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF577165.7E-9723772No hitNo description
SMARTSM004010.0012723774IPR000679Zinc finger, GATA-type
PROSITE profilePS5011414.655723778IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.101.4E-11725771IPR013088Zinc finger, NHR/GATA-type
CDDcd002025.25E-10728778No hitNo description
PROSITE patternPS003440729754IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009846Biological Processpollen germination
GO:0010208Biological Processpollen wall assembly
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0048655Biological Processanther wall tapetum morphogenesis
GO:0055046Biological Processmicrogametogenesis
GO:0071367Biological Processcellular response to brassinosteroid stimulus
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 5428 aa     Download sequence    Send to blast
MASRGRKKTG FPEELEGLSG PFHDNVKRFL LRNAVSVAVS GIEGCSAWVI RLVDLATAEE  60
HNRGGGASSA SPLSSRVLLH VYEERLVEGS NSVCDQCRNM GWQDHPVSNC KYHFIIPGKD  120
VLADPKQLPS LVEAVVTKRG ARNQPATVDL AHLLAKQRGD DTTFNAPTSV YDSPSHYLHG  180
VLHMNGYGHL LRINGKGGGS ERLSGKQIMN IWDGLCTMLC ARQVSVEDVS SKSGMLLRML  240
HAAAHELTWY GKWGYRFGRG AFNIHPDEYH EAVRDVHNAP LSALAHDFQD VDETIPRILG  300
RYASATSGLP ALEHKPHTEQ PQQQQLLPQE RSGHQQQQQQ QQQQQDMELD QGEQQQQQQQ  360
QQQLHLQLPQ QEHKAEQEQE LQVRSPAPDP QKQEQQQREQ EQQDREQQQQ QQQQLVPRPE  420
LHSDASLLSQ QQQQQQLVPH PEPPSDASLH LQQQQQQQPL PRAETLGEVL RRMLDEVSRA  480
GRPRPSIAPP SAPLLPSHIT PANAASAPLA AHSAPQTDAA PPSTTAAQNA RTASSVVPLR  540
SLHGWGILPP SVHHHGGPAT AGDVHMQQNR PPSSRAVRTA AHALTASPSH TARRTLSASA  600
EPGSDAGGLE GEEEVQRNRG AARRHSSRQP HQHQLAGSGD EAGMDGVDRS QMQATAAVAA  660
AGAAPRRVGT GGRLRRPSGS KEEEVGEEAA QQQQQQKQRQ GAGGMPWRKG STMSSRKAST  720
MPPGDQTTCA NCGTGSTSVW RTDRETGLVR CNACGQYWRT HGYERSMDLL KGAQLKSKGA  780
GHDARGPEQQ AQPQEQQHHQ PQPRGTVPSG GMQKAVRAGS HDAGAGGRVV RQSEGMPRRS  840
VEASGEAVAG EGVSVEGQNV ALPRHGCKRG RAEAEGGLSV APDNMAEAEE GAEVRAAEKD  900
GGGLPANEAP GRESAEGQGA DRLSSEQEGG VELGGEGPAS TPQPQPVPGT GGADHGHTEE  960
CAMEGEGEQE GSHAQEVRGQ EVSHAAANAG VALGEELVLG IKGEATRGQK RPLELAVGEA  1020
EVPEQGLTQE RQTRQRVQQQ LVQEQMCEQQ QQQQQQQVCE QQQQQPMQEQ VCEQHQHQHQ  1080
QQQEQQQQEQ QQQREMDQQE QVQLTKSPML GDDCQQQQQQ QQQQFEEQQM HMDVMDPHPH  1140
GPQHSHTQQQ PRLQQRYFPS ALHPEMETEQ HYSPLMPAWT HGTLHQQQQQ QQQQLQEQQH  1200
HHLQQLQQQE QHPVPHPPHP PAHSPSHKQT PEQQQQQQQQ QQHAQPSPPP PSSRISPSQA  1260
PPEQQLQHHH PSSPLAPPPH SSPSQASPEQ QLPHRHPSSP PPPPSRSSPS YAPFEQHLQL  1320
QHNHPPSCPS PFRGPPEQYQ QLQHHYLPSP PNPPSRSSPS HAPPEQHQQL EHHDHPPSPP  1380
PPPAGSPPLE EPPDEPPGDH DSYVPRPLLP SDGGSLAGSV AGSLGAAVEP PGYVPRPLLP  1440
SPPDEHASAS PPSAVHPPPE DQHMQPVDEL KQAAHPGSPH EQEHAAHPSS PRPSLPADES  1500
SGQHPSPTPP PVPPSPLQHQ PQPLMEQQQQ QQQQQLLSGA PQPDAAMKEE AGQQQQQQPP  1560
PQPTPSQREP QPPPAWDAPS LQHSDPYPSE ASMQQQASQP HIQLPPPPPQ PMPAQQQAPS  1620
QAQPQQHTSQ SDLAGEHAPS ATAPDAAMED TVLQEAPPLQ QQQQQQHHHH HHQPAHHIPS  1680
HTTPGLSPIN TSPAAAPMRA TECTAFCATA AAHACSSQDQ VPMETDPAPS DVAADAGHHR  1740
EQAAMQPGTL GGVPQDVTYD AGHPDLAPGN VAADAALHAE QAVTQQGTQG RIPMDATTLN  1800
AGHPDSAPGD VAADSGLGSE QAATQPGTSR CIPMDATPDA GSPEVHLNQG LPSSQAAGQV  1860
RKSGGLQRVK NQVGEEQHAA GLAGKQQLVD GQDGEQQRAQ GLQGKQQRVE DQEGEQQGFE  1920
RQEVEQQQAG GQEGEQQQRE GVQGKQEHVE GQESKQQQEV EQRQAECQEV EKQRAENQGG  1980
HTKHVGARGG EQQQQVEGQR GEQQQVENHG EEQHQQQVEG QGDQQQVEGQ GEEQQHGEGQ  2040
GEEQCHERGT TEPGQLQQSA TAAAEPQQPD AALGLARSVP ALEPQQHAGP TSAVAAAAEC  2100
QPPDAALELA SAAPATEPQL HAEAPSAAAA AAAAAAAECQ PPDAALELAS AAPATEPKLH  2160
AEALSAAAAA AAAAAAECQP PDAALDQAGP APALEPQQHA EPTSSVAAQA AAPPAAKSPS  2220
PEALAAPADP APAKDTLPPP EAAPASAAGA APPAAKSPPP DAPAAPADSA PTTEPHQPSE  2280
AAPASAAGAA PPAAKSPPPD VDPAQTGPAP APDPQQPSEA APASTTRSAP LAAEAHPYAG  2340
PEPAAGKVGA SSVPQQPPDA AAASAAAEAV PPTGFQQPPE AAAAGAKAPQ SPPPTEGLPS  2400
AAEPGAADSG VGEQGPPSTQ QPAAAVAAAA AKPQQVKAKA APKGKGRQQK GKGRKQPATE  2460
PPTQAPPLLQ QQQQQQQQQG PPQEDLHQHE SQQQLQQREP QSESLQQEPQ PEQQQQQQQQ  2520
QQEQAEAAVE EAATGQGPKT GDHLPLQPQS QQGQQQQQVA QALGRAGPDS AELQQEQQQQ  2580
QQQPLSGALQ PDAALKEEVE QQQQQQQQQQ QQGRGGRQTR KAAPKRSRPA PKGAATGSAA  2640
AAAAAAAAAG AAPESEQQHG TDERPPQVQV EAAEPHARGE QQQQQQQQQQ QEQPPQPKRG  2700
GRSHTATQGA PPEEMAAGTL AKQEVRPQET AGGTQGEGDA GPAPFRGAGA RVVAEEAELE  2760
AQAALPTQTQ EQQQQQLKQE GPSGARKKTG SKAQAKSGPL SAEVQRQLDE ELQQQQYPPP  2820
AQLPKFAYPN EMGKRPFTQD RQQKYTQTVH LVLKEIPKAR WIHRSALLAA LIKHGMADKL  2880
ISSLTMTRLC GAVLDDMAVY RCGKTGQIYY KVVQLDALGN PSSGNQSAAE ALAEAAPAAA  2940
VHTPTTASGA AAAGPSARAM SMRRGGGKKS AATEGPSSPA ALPDSCPLSS AAPGVAGGDQ  3000
QQQEQQQQPD TAAHKQSRSS AGGEQHGQQQ QPAAAAHKQS KSCTDRVARA KAEPRKGWRF  3060
SSRFGGEGGG VMDPGDSEAA AIAAVAEHEA KEAAVAAKAN AEAQSTAPQQ LPPRAEEPPG  3120
AEDAAEPTTG ASNGAHEAAP PVPSAPAAEV EVGPVGTAPA PAPAQAAPAT GPAAPTAEGE  3180
EPEAKAAAAT ATATAAAAPA PATAPVATVG GPTQEAGGSD AGGTVDAAGP AQAHQPAAVP  3240
TGPARKGGQR NRSRASAKTS RTSAATRSAG GTQGGAQGGV QGPAAEGGSQ GPATGEGVQG  3300
AATTQGAPAG VQAQEGPERG AAGAPAGLHK DPGAAATGTG ELDAPPAATP ADAAAARAEK  3360
DGIAADHERQ QPLAAAQHGD AAVTQPGASA MPVSDAQHGD AQHGDAAAAA QATAGARPLE  3420
TSAPAPEAGA PAPASDAGGA CAVTETHTAA AAATAHSNAA AAAAAAAAAA AAAVADPEDA  3480
AAQGTQSLTT QEGSAVLPGG EAGAAAPPPP VEAAPVPAAE AAVPPPAAPA AAAAALPPAA  3540
PAAAATPATG SGVAAKEGRS GGSKGGSAKK GGGSRVRRSG GGVRKRAGSS QRQRAPKLLL  3600
QQQQQQQQQQ VVQGTEQPPP QVPANEHEQQ QLQQQPQAVP LAEVVYHPKS RVSKRELQAL  3660
NAHQVRDASQ QYRKLAQRAT NKQQAKLAKQ GARCSSHVST PTHSTQHQPR APPSGQQRQG  3720
KQQQQGKQQL QQQGKQQLQQ QGEQQQQQQQ PKQQQRGNQL QHQQQLQQLQ QLGEQQQPEQ  3780
EHAEPVQQQP GGEDMAQGLK LKQNAEQTMQ QQQQLPPPPP PMVPIGAEAP AGEGAATAAG  3840
ASTDAGGTSM SAAGEAGTGA AGAGVADAGA AAAGTEGVGA AGAGAEGLGA AGAGAAGAGA  3900
AAAGEARAKG PQAEVTEGGP RAADAAGPDI AGAASGGAAG AAGAAGANVL SQPPSAPARP  3960
RQPRPRASGH KSRGRTTPKS RSGVRRVSAA AAPAPAAAAA TPQQVPIKHS VAPPPAVAAA  4020
QAPAAAATPQ QEPMLPVGPP LAVAPAPAEP EHSMQPPSAL HGPVDPKHSM PPPTALEPPP  4080
QPQHSMQPPE ASQSAPEPKR TTQPRFSMRF VPIPTGDIVP KPAVVVQGAP VAAAAAPAWP  4140
APAAAPAAAA TSIGRSTTRA AIRAAAPDAA PADDGAHIED GSMLLRQPQQ QQQQQQRQPQ  4200
QQRQRQPQQQ RKQRQQEPKQ QQQQQQQQSL MPPPPSPSHL VQGGSLPLKQ QQQRQSLMPP  4260
PPPPSRLHQQ APPLAGSHPL QPEHPAHDAA AGRPPPGAAP AHNVALATPP NEAAPPDEAA  4320
PPTQPPISQP PCAGTTAART PGVPLSAPDS THSAEASAPP TQSQHHQPHR HPQQQLEQQQ  4380
RRTERRAERR MQSPRSPHRE HRQERELQQE HRAEQSTQPP QVHQQQRQQQ QQQGQEQQQQ  4440
QGQEQQQQEQ HGVHVMQPSQ FLQQQQQQQQ QGQQEKHSMP PQLPPPPPPP PPPQQQQQQQ  4500
QQRGSQRSRR ATQRWYSEEE NAHAQLCAEK TPAHTPHAPQ GTANTSGAPL LGGSIGGSSK  4560
VPQPPGPPGP PQPPQTARIP SAPSRKIRIF APSARRSAQS SSKPGLPHTP SSTPHTPLGT  4620
ASATPQPHLP SPPAKPPTSE AGAGVGALGT GVSQPSSLEL LAKTSKPRQP KGLAYQPTHC  4680
PQCGTFVKQH RYLERHMKTQ NCLASAQARA QGLQGGALMG EGGGADATPA GPEAASQHAT  4740
HAREGGGGPT AAGPGALVAG PGVPYHPGGK SLAAKIREAA RRNVSKAAAA AASVAAPTAA  4800
APAAAAGESA QEGNKSAGAA PSAGDAGGFP PTAQGLGFVL PPGAVEGSRG VVTGAHGTVQ  4860
RERGSERPLK KRRTKSEASR LVAAAGKAIQ GGAPAGKVVQ GEAPAGTAYR GGVPAGKVVE  4920
GEAPAGTVFR GEAPADTVTQ EAAHEAAVHG DLEGMEGPPK KRRAKSEARA YLGEALAGRV  4980
RIGATEEGEQ GIGVPSKTAS RSGAGGERGL ASRVGIARSE RASARRSAGL AVRDSGRRRS  5040
ARREAAGGVG RRILLGLGSD DDDDGSPARS EGYERVYTHR RTPATAPPAH RGSAAVPSRA  5100
AADGSRTGGV ARVTKGLKGH AIRINDSATP PINPRRASNP RIPAVALAQP EEPSESEGDE  5160
EQTLGPEARR ARTLALLTER HAAGRPAPTA AWRVPPRSIG DPHPFLRSIP LMYLTQASGG  5220
WRRRDEEGKL QGHPPLESDK EGKLQGHPPL REYQPVAAAL AKAGPAAARS VATATAAVSA  5280
AQLSLPQLAS SLQIIKDTKY FVKDYGGPFR VPLPACMDAQ GVVPLRLQCQ VRLPQHMESH  5340
WPALPGIRGH MSRPRLLALT PPEPIIFPAT DVASKRTTLA DFCRWGAWFC MKAPWRAPNS  5400
PQNIVSPVTG IVSKKTVHAD FACAQWH*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
149594964KKRRAK