PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID kfl00779_0040
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Klebsormidiophyceae; Klebsormidiales; Klebsormidiaceae; Klebsormidium
Family GATA
Protein Properties Length: 1212aa    MW: 123693 Da    PI: 10.169
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
kfl00779_0040genomeKFGPView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA41.91.4e-13278307130
           GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyy 30 
                    C+ Cgt +++ WRrgpdg+ktLCn+CGl +
  kfl00779_0040 278 CHYCGTRESTYWRRGPDGPKTLCNSCGLNW 307
                    99**************************99 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5011410.739272305IPR000679Zinc finger, GATA-type
SuperFamilySSF577162.95E-11272310No hitNo description
SMARTSM004014.2E-7272321IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.101.4E-11277312IPR013088Zinc finger, NHR/GATA-type
CDDcd002021.27E-11278312No hitNo description
PfamPF003201.7E-10278307IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1212 aa     Download sequence    Send to blast
MGAVKVWFLK WPHAAATVLI YSWLASGAAS RQHSVFDEDA IAIRVACLSR DAVIGGIVWV  60
QAGGQHPFNH RQSNNMDPSA MPLTAPSKAS SLPSPHSAED QNGNQSSGSA EPTPLPAITP  120
SPKPGHEAVR RANKRSLADT VKMRTRGALA EDTATAEASQ LEGVAKKRAK AAKVETGGGP  180
AASPRAPPPT HSQDAGEPNI LTGRGGQDET RFEDFGSSPG AALDGLGCGA QAEARQAVGD  240
AGATQAHAGQ AAGRKGGGKS ERKVAPSPAA KVGGDIHCHY CGTRESTYWR RGPDGPKTLC  300
NSCGLNWFRQ RPMQREQVEV GNLMAAAVAA AALKRPANSL KDGPEKRKRE EKKDGKQLLE  360
QLHGTKGAHA QQQPGAKRRK GDGGAGAGKP ERAGGQAVAS SLKGLQRTEK GSIRRKLGDA  420
AVKKAALIKR TPSQLSLVGL LESAAAAPPA PPELPPAAAV KPTGVLKGAR FSPLKQAQTR  480
LGAQKLLAVL NVIKQKKGLG RVTSTASLES VPEEDPLAAQ PDGIPGISGS GVPGGTEVLA  540
SSQDALRGGW EPGPTGASTR RPTKTGARSQ RRPNTALLQG ASFLPWKARP RGWESSSLTF  600
AAAESRLLKP QAAAPAAHGR SPLQTRAEAG PIGAGPEQEG VGGGNRGAAP DPQQRARLRE  660
GGSHEAAAGG AEAGEGGSDD DGFAGYGVLR KLLEEVDAPP TSPPAKGRPP LNKKGGAPLA  720
KKLQRSEQPL GSQGPSARAG LNSYESFEDL VMGTAGGMKR FESFEDLALR VGPVFARVES  780
FEELALRAGP AMRRFESFED LAMRSQPVVE PPPHPAPSAL PAAHVAGRGE GESSAEAGSE  840
GEAGSRDGAG RRSGREKKQK TWEWAVVDGP KRRRKDARDD KGLPREGPDG PAPWRSGTAQ  900
GQRKGEPGQA LQPPRGSQGR GRGGAQQFPG MLGPGESPTA GSLESAQRGE GTEEPVLPND  960
ASGSAEGRLL AAPGQAGGLP QHSAGAAAAG QPGAALLPNG LRYALRVPAA AQPSGSAGPT  1020
ASLARNVLGL GPRGVSARAL HLLVDCAVEP EPAADVATLM ADLQTLAIGT ADKRGGSDGG  1080
QEQGRAADGA AGGMKAGFAA SFTLPNGRAV KQVAPVAHAV RLPAGAEVAS DGAAEGRGGG  1140
VIVELGAPEA GPPAAEQEDG AVGPSKATSV TPRGAPAAVD KDARAAQLLQ VPAGEPELHC  1200
AEVLQSPEDL PC
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1164170AKKRAKA
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A1Y1IMQ30.0A0A1Y1IMQ3_KLENI; Uncharacterized protein
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G21175.17e-08ZIM-like 1