PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 93058
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; Mamiellales; Bathycoccaceae; Ostreococcus
Family GATA
Protein Properties Length: 1330aa    MW: 151248 Da    PI: 5.3232
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
93058genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA32.11.6e-10293328134
   GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkg 34 
            C++Cg +k  Tp +R gpd+ ++LC aCGl+y+ +g
  93058 293 CVQCGISKeeTPKMRLGPDKRRSLCTACGLFYACMG 328
            ********************************8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF577163.42E-6287341No hitNo description
SMARTSM004011.1E-4287341IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.102.3E-7292328IPR013088Zinc finger, NHR/GATA-type
PfamPF003201.2E-8293328IPR000679Zinc finger, GATA-type
CDDcd002022.04E-6293333No hitNo description
SuperFamilySSF637484.51E-5762805No hitNo description
SuperFamilySSF473703.27E-23867987IPR001487Bromodomain
Gene3DG3DSA:1.20.920.102.2E-24872981IPR001487Bromodomain
SMARTSM002975.7E-17873984IPR001487Bromodomain
CDDcd043691.54E-23876977No hitNo description
PROSITE profilePS5001416.57893965IPR001487Bromodomain
PfamPF004391.5E-11894969IPR001487Bromodomain
PROSITE patternPS006330898957IPR018359Bromodomain, conserved site
PRINTSPR005032.0E-6910926IPR001487Bromodomain
PRINTSPR005032.0E-6946965IPR001487Bromodomain
SuperFamilySSF473701.31E-2311661291IPR001487Bromodomain
Gene3DG3DSA:1.20.920.101.9E-2511741283IPR001487Bromodomain
SMARTSM002976.8E-2211761286IPR001487Bromodomain
CDDcd043699.03E-2211791281No hitNo description
PfamPF004392.8E-1311951271IPR001487Bromodomain
PROSITE profilePS5001416.18311971267IPR001487Bromodomain
PROSITE patternPS00633012021259IPR018359Bromodomain, conserved site
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005515Molecular Functionprotein binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1330 aa     Download sequence    Send to blast
MAREEELTEY ERERQAKIER NKALMERLAL AKLASEAGIG GEGDGPGRGR GRGRGRGRGR  60
GRGPRVKREP ARQSSRVEKL RKESRLTSWR GTRIERPLRT HWAFEVDADV GSLGTATEED  120
RGLNGRLVPK NYFVEGYCGD EPYAAWTDDV GWHHLIWGND RVKENKRHVR TDEKGSATAS  180
GAAVIILQEC ARRRRRLYED FIVERAEKAL DLLMGSPTPS WYRPYYPTYI DERARALWAP  240
IVKHAPIPVV KEDELDEEEK TRRAFRKQFK ENKFQNWEVD KIDVLDGEIA SHCVQCGISK  300
EETPKMRLGP DKRRSLCTAC GLFYACMGHT YRPMGFADNL NSTVEFKDLQ INGYVEVKEE  360
KNVEVTDDAE ARCERVAVTL SVSDAGDLVL NVDVVEDDRM EVDVAADDGL SRYERTEKDW  420
ERHSEPLSTQ YPAVNRFLPG APVSTSTMLM RTTRGVVEDY PIDAVDWSSL PHFTKLVTLA  480
STGLITNRQV MADELPEERL YQRAVPYAAH SHLKRMVETG RYEKYQLNLT ESERYKPVEY  540
EWKPDQIDGD TLFGYSEEQV QDSLEAHMER KVEAAAMDDA DETKREMLAN IALEDLDAKR  600
TFEYQVLRSI ESFEKAAEKS ERDAERERER ERRRRDAEEE EVRREAEMEL HGGRRVVCMK  660
ESIGYSAEDL AAYEGEEDIP DPIAITCAGH VGMLQTMCKS RAERIHDVAS DKIVGGGEFE  720
RMSGCGSSKK WRNSCRVLNP DDTPGITIGA HLVERGDEKG DDVIGRRIAV WWATEQLFYL  780
GNVEAYNVAN GDHSIRYDDG QLEDVALCMQ RVKWLDSDVL VPNDEEEAED EKAPSRLHVR  840
PPGVAGSEEA TVRVPLGGFS VYTRLKPSTT TMKHSERRKC FEILQAVRKV EADGRSLSEP  900
FERLPSRFTL PDYYEIIKCP VDCAAIERML RKSMAGYPNV WFFLVAMELM FTNCQRFNDP  960
ASMLYRDAEV LRGVYLKAVQ ERFPGHPVPS KNIYDNVDEP AWDKPVDDGV IEDEDDPFPQ  1020
KYVQPELRQS NTQAAPQRKR RARYDSDSEE YVPKRKSQRS APRDRRVPKN PLDCAIYVLS  1080
RVHGNAMPLN SLYEIMVEKG VDCGLWGRRP LAALGALLRQ NPLVFRDARG GEVWELVNKV  1140
DVDTDEEIDY LADDGREEMS EDTAEASPPP EKIEMTKQET SACRSVLAAI RASKDKKGRK  1200
RADPFELLPT RKALPEYYRA ISAPIDLGSI QKCLNAGGYP STWMFCVALE LMLSNCQNFN  1260
ESSSTLYKDA EVLRGVIAKT IQSLYPGHPV PERDSPYDAA KCVEPRWRPP ADKGAPKPKL  1320
RFTMKTVSK*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
15161RGRGRGRGRGR
25362RGRGRGRGRG
3625634RERERERRRR
4630635ERRRRD
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankCP0005870.0CP000587.1 Ostreococcus lucimarinus CCE9901 chromosome 7, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001418985.10.0predicted protein
TrEMBLA4S0J20.0A4S0J2_OSTLU; Uncharacterized protein
STRINGABO972780.0(Ostreococcus 'lucimarinus')
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP636355
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G21175.23e-08ZIM-like 1