PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID EuGene.0200010371
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; Mamiellales; Bathycoccaceae; Ostreococcus
Family GATA
Protein Properties Length: 675aa    MW: 74255.6 Da    PI: 10.2618
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
EuGene.0200010371genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA51.41.5e-162250129
               GATA  1 CsnCgttkTplWRrgpdgnktLCnaCGly 29
                       C++C+t +TplWR+gpdg+ktLCnaCG++
  EuGene.0200010371 22 CAHCNTHTTPLWRNGPDGPKTLCNACGVR 50
                       ***************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM004013.8E-141668IPR000679Zinc finger, GATA-type
PROSITE profilePS5011413.2171649IPR000679Zinc finger, GATA-type
SuperFamilySSF577162.66E-121858No hitNo description
Gene3DG3DSA:3.30.50.101.1E-131855IPR013088Zinc finger, NHR/GATA-type
CDDcd002028.75E-152176No hitNo description
PROSITE patternPS0034402247IPR000679Zinc finger, GATA-type
PfamPF003203.5E-142250IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 675 aa     Download sequence    Send to blast
MTQAMDASTS GQLLPGVAGK RCAHCNTHTT PLWRNGPDGP KTLCNACGVR DNRRHAKANR  60
VQKPSAPKAP KASKSNGKGG DKRKRGDAAS PGRGGKKDAK KAKPARNYFA QKVDIHVPSF  120
HEVADYEMSH AGGFRQPNAY LRENVADHQL NRYGPGTAAP MYEATPADFE WLEHMNSEPL  180
EGKGTVCATT PNAQYMRPEH LEKLFDTFEE TSWASSAIPT QEQAAQVVLG SVFGAVNTPE  240
SKMEDVLHWA SNAQLGDGQL VNLKSEVEGW NPLDLHDENS NQSSSETAAP LTPSGSAEFL  300
LKRTSQSGSI PSENGDDDSS TQSADDLIER RLASNTDRQQ SENLKSSDLD TRAQVGKLRR  360
AGYFFSRRVA QVSRPRIGGK SELKNAIQTI NQINEKIFKI CKHAPSFEVI CKVYRYWLKK  420
RWKNGGKPLL KRFDPVPPLR LRERPEVATE SEQLAYLFCF NLEAVFRQQQ ERAVRVEQAE  480
AQKKRRRRPC FNPAATKRRR RVQAAIDDAV SKPLTISFEP LDKLFAHGWE VVEYQPKPVV  540
PVKEQKTPKK EPTVKKQSKP AADAARLPTR SPKSVKAEKL ESPAENAVFD DSVGRRSEAK  600
SVKSPMGAAA AKAGALVKNF VSSVGCAFGY GNNGKNNHEM VNGSSGDPNA PSTPTTKRRS  660
ARVKSSDWTV ARLP*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1481487QKKRRRR
2482487KKRRRR
3485500RRRPCFNPAATKRRRR
4496501KRRRRV
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00110ampDAPTransfer from AT2G18380Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001416350.10.0predicted protein
TrEMBLA4RTA30.0A4RTA3_OSTLU; Uncharacterized protein
STRINGABO946430.0(Ostreococcus 'lucimarinus')
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP823833
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G18380.12e-12GATA transcription factor 20