PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID EuGene.0500010229
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; Mamiellales; Bathycoccaceae; Ostreococcus
Family GATA
Protein Properties Length: 225aa    MW: 24032.8 Da    PI: 8.363
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
EuGene.0500010229genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA49.27e-1693127135
               GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                        C+ C++ kTplWR+gp g ktLCnaCG++y+  ++
  EuGene.0500010229  93 CACCRAQKTPLWRNGPTGAKTLCNACGVRYKAGRV 127
                        999****************************8876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM004011.8E-987148IPR000679Zinc finger, GATA-type
PROSITE profilePS5011413.09987123IPR000679Zinc finger, GATA-type
SuperFamilySSF577161.24E-1188130No hitNo description
Gene3DG3DSA:3.30.50.101.5E-1389125IPR013088Zinc finger, NHR/GATA-type
CDDcd002024.19E-1192129No hitNo description
PfamPF003201.9E-1393127IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 225 aa     Download sequence    Send to blast
MASKPRAYAN EMFDAHAKAN AQFGSAWRQP ATTYAALAAA QTKASAERRR RDPSEESDVT  60
RQDGGGAATT PYARNVIVDD DDEGPPPEAG VTCACCRAQK TPLWRNGPTG AKTLCNACGV  120
RYKAGRVVCD ENGKVVTLAP QGRKRAASAT TGDAPYASLH KRVKTPQSSF GYAHLRDARL  180
IELERVDKTT VSEKSPLARV HSATILTDYD GAVLLMLLHD GDDD*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14651ERRRRD
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001417744.12e-80predicted protein
TrEMBLA4RXG35e-79A4RXG3_OSTLU; Uncharacterized protein
STRINGABO960379e-80(Ostreococcus 'lucimarinus')
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP633544
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G66320.22e-13GATA transcription factor 5