PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 14167
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; Mamiellales; Bathycoccaceae; Ostreococcus
Family GATA
Protein Properties Length: 741aa    MW: 81299.9 Da    PI: 10.3251
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
14167genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA51.71.2e-1683111129
   GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGly 29 
            C++C+t +TplWR+gpdg+ktLCnaCG++
  14167  83 CAHCNTQTTPLWRNGPDGPKTLCNACGVR 111
            ***************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5011413.29777110IPR000679Zinc finger, GATA-type
SMARTSM004017.6E-1377129IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.106.9E-1479116IPR013088Zinc finger, NHR/GATA-type
SuperFamilySSF577163.42E-1280120No hitNo description
CDDcd002022.80E-1482137No hitNo description
PROSITE patternPS00344083108IPR000679Zinc finger, GATA-type
PfamPF003203.2E-1483111IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 741 aa     Download sequence    Send to blast
MAKVRPRVVE TTRTRDGERL GFLEPQYARD ATTRARACFH ARNCRVIRRA CDETDFGELF  60
VRDKVMDAAT SAQGLPGVAG KRCAHCNTQT TPLWRNGPDG PKTLCNACGV RDNRRHAKAN  120
RVAKPSTPKA SKGGKSNGKG EGKKRGASSP SKGEKKDAKK QKPARDYFAP KANIHVPNFN  180
VVSDYEALHA GGFRQPAKYL RQQAPDHQLH RFGPGTAASM YEATAADFKW LEHMNADEPL  240
EGKGSEVCTT TPNANYMRPE HLEKLFDVFE ETSWAASAMP TQEQAAQTVL GNIFNSANTA  300
EGKLEDVLHW ASNAQLGDGQ LVNLKAEAEN WNPFDLNDER SNQSSSETAA APLTPSGSVQ  360
FLQKRASPSD GVPSENGDDN SSTQSADDLI ERQYATNTVR QPSENVKSSD LGTRAHVGMT  420
RRGGKSMFIA RGARVPRPPT GGKSELKNAV KIVNQINDKI FKINKKAPSL AVFCKIYRYW  480
LNMRWKNGGK PLLSRFDPVP PLRLRERPEL ATESEQLAYL FCFNLQAVSR QQKERALRAE  540
QAEAQRKRRR RPCFNPAATK RRRRVQAAMD DAVSLPLTIT FDPLDELLAH GWDVVQYDPS  600
TCVVPKKKKI ISESTKERAI KKEEPVKKSG ANRPAPDALA SPPRRSVEKT EFDVAQIVVP  660
NESVLPQSDP HLKSPMGAAA AKAGALVKNF VSSVGCALGY GGNGKKCLLV NGSVGGHLSA  720
PTTPTRRSTR VKTSDWNATF *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1544550QRKRRRR
2545549RKRRR
3548563RRRPCFNPAATKRRRR
4559564KRRRRV
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00110ampDAPTransfer from AT2G18380Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMap-Retrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankCP0005820.0CP000582.1 Ostreococcus lucimarinus CCE9901 chromosome 2, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001416350.10.0predicted protein
TrEMBLA4RTA30.0A4RTA3_OSTLU; Uncharacterized protein
STRINGABO946430.0(Ostreococcus 'lucimarinus')
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP823833
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G18380.15e-13GATA transcription factor 20