PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID kfl00334_0040
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Klebsormidiophyceae; Klebsormidiales; Klebsormidiaceae; Klebsormidium
Family GATA
Protein Properties Length: 3984aa    MW: 423983 Da    PI: 8.8153
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
kfl00334_0040genomeKFGPView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA37.72.9e-12335369135
           GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                    Cs C+  k+p WR gp+g+ktLCn CG+y+  +++
  kfl00334_0040 335 CSVCHLNKSPRWRTGPEGPKTLCNRCGVYWSQNRH 369
                    9*****************************99987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF577165.7E-10329371No hitNo description
PROSITE profilePS5011410.462329363IPR000679Zinc finger, GATA-type
SMARTSM004012.2E-9329382IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.107.7E-11330369IPR013088Zinc finger, NHR/GATA-type
CDDcd002023.46E-8334370No hitNo description
PfamPF003209.0E-10335369IPR000679Zinc finger, GATA-type
SMARTSM0038433726738IPR017956AT hook, DNA-binding motif
SMARTSM0038427812824IPR017956AT hook, DNA-binding motif
SMARTSM003842.6861873IPR017956AT hook, DNA-binding motif
SMARTSM0038437014331445IPR017956AT hook, DNA-binding motif
Gene3DG3DSA:1.20.920.101.4E-1121042138IPR001487Bromodomain
Gene3DG3DSA:1.20.920.101.4E-1121812245IPR001487Bromodomain
PfamPF004396.9E-521862233IPR001487Bromodomain
SuperFamilySSF473709.55E-821862246IPR001487Bromodomain
SMARTSM0038436022582270IPR017956AT hook, DNA-binding motif
SuperFamilySSF579036.58E-1422842337IPR011011Zinc finger, FYVE/PHD-type
PROSITE profilePS500169.64122872337IPR019787Zinc finger, PHD-finger
Gene3DG3DSA:3.30.40.105.6E-1622882335IPR013083Zinc finger, RING/FYVE/PHD-type
CDDcd155193.60E-2122892334No hitNo description
SMARTSM002491.3E-1022892335IPR001965Zinc finger, PHD-type
PfamPF006281.3E-1022902336IPR019787Zinc finger, PHD-finger
PROSITE patternPS01359022902334IPR019786Zinc finger, PHD-type, conserved site
SMARTSM003842323852397IPR017956AT hook, DNA-binding motif
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010223Biological Processsecondary shoot formation
GO:0043966Biological Processhistone H3 acetylation
GO:0043967Biological Processhistone H4 acetylation
GO:0048573Biological Processphotoperiodism, flowering
GO:0005634Cellular Componentnucleus
GO:0009506Cellular Componentplasmodesma
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0004402Molecular Functionhistone acetyltransferase activity
GO:0008270Molecular Functionzinc ion binding
GO:0042393Molecular Functionhistone binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 3984 aa     Download sequence    Send to blast
MTVAASDGGP SDQTGSTTMQ SAENELWYSI QQTAEALPGG QISGGAAGDS VAGFLEQTAH  60
ASVVGELENA KRGSQEGAVV GPGPGKAAGA FLPEASQLEA PKQAPPADVS SPGVRQSDRL  120
QARPGSHVRT GSSAATSPRA RNSFTHARNP LGLDAREAAV AAPKLNGLAA GHARPEVALH  180
LRNLSVTDEV QEEPLPSAGA ISPAALPDAG AVFDRDPSVR PTTRTGGSYK TPESPVGLVS  240
PRAGVTSTAV LALPAPDRPV VTAVKRRAKA PPGVRAVGRG GPSQNVAVGL SQQEANRQSD  300
IEEEQSGSGL LEGGLSAGVS GRSGTTSGGK VPRECSVCHL NKSPRWRTGP EGPKTLCNRC  360
GVYWSQNRHK WDELVDRFKA SSDGEVNQLQ AEFKAVRHAK SADLEQKRAK VVRPPPRAPK  420
ALPQGTGTAR EGHRPRKASN LGRSEGHSRS DTETILRKLQ ELPGHFAKSL RKELLGFGLV  480
RSASPKPDAE GGQGGVPQQQ AEAVEEEVVI SSGSDDEGAH EQVRGSDVAG PSGRASDSLM  540
QGNGSSNNLG SAALAAGGSG SLPATTGSSF PSEGHPGENK AKRKRSETKI VRDSLTQTGG  600
KKARLMTGEA AMDDGAELRV GRKEEDVIKE AEELLRMGGL GGGAPRGTVA GHFVGKKSGG  660
QRTVKVTERK ALLHQARAEG PPPSSVGIVI NSSGLAKIRI GGVTRIVPKG EPVSGLDAYG  720
IPIPKRKGPG RPPSAAKLAE RLAAAGGSVV DLSGKKHLKV DLPGGGSAQQ AEGGLPEGMT  780
PTIRSARIKA RGPAPDFTDD FDDEGELPLV KKRGRGRPPS GKMSSLGQLP DVREGGRRYK  840
GAKRGRKPKN WTGEPGEKPL PRKRGRPANS QLEFSSKLDE FQKVFKPSVR RTKRPRHGLK  900
EPRNSVDFGS PSGEFPLVPD SLLMPPFPPV PELMLPSRLQ KARSHSPALG ASLGSLKPRL  960
PLQVDDIIVE SFGSVDPQHI TVETGRICAI GYRSTWKDPT LGLVHVSEVI AGEDGKPVYT  1020
VTCRLKERTD EEERSIGGGP RSPFGRSQSI QVQRSPRSPI QRSFSLQDGR FRLSAPSPLG  1080
QNQQSAFAKT GGLEQRAPGL AAEVKEAAVA GGEESAAIVE EKKALLPAAQ DWNAEPVPVL  1140
STVPEQAPIP EETEMAITPK EKDTTPLLEG AKPLVKEPTI SPLLVGKPGP PIEGLKGFDG  1200
GAADLHDLDT AMIWDEDFHV AFGDTERPAI GGFGPSLGDG RKSEGQELGW GFGLGSEQDK  1260
AQGSLWGGNG HVSEQEAEEG MLAVFRGTGE GPRLDFPPGP GEGARLDSPP DEAPKPEPAV  1320
PLEPSPVQSP EAADGPTARM EPDLLSGLPA TKKESPPKPA FAPAVLPQYM QGVTIRPIAS  1380
SGQPSNLLRI NTELPPLLAP LHINTQIPLD HPESPKPAGL APLRTAAEAL ANPRRPGRPK  1440
RLDTKPLPAV SIPDRQLRSE KQSQGVETPR TAHRRLTGSD LLVKRSSPEE AWRAYAAAWR  1500
ETFEARAPQH LEEIDWDEFE RDHFGLEKPE VVQWIADLFQ ASPRLTQARR SFQNLQLMQH  1560
GGPGPSLLQP ASKPDLDRPA TRAEEKEAND EAVRAKRQVA EGMRRQLVLF SDSGHAQREE  1620
EDGQKLVLEG VSIAEVQEAL QVYEQLSSVP SIHLPPLDVF LSGLIHPSVS AKSLINLEGI  1680
TRGKPKSKVP TASPLTAGAA QATAGKEGST AGKDGMPAVK ESRKPDAAEA VAALHLQLVR  1740
HAVAKLGSEA QGIPKLFSPR QETSPGPVST PPGDSFPAPE GEASEAAGLA GPPPQTPRPA  1800
EEASPGGGLS DSGEKVPARV STRAPKWKAG LTNGEAPWAL LMSRNMAKLN GMDKGAEAHH  1860
EGSPFRHGLE LLDGLTWPAF ARRVLEEEVK IPEPTVEEEA PPTAEPAPVP MVGSTEATMD  1920
PETGPLAEEE EPVPLPIILT PVEIVRRAEK EAEVLAEAER QLARAVAMLK GHEGPVTETG  1980
GPSLGTDRFG SVPPESAEPR PVSAHEALED EEPMEPLEKF EFWWSPLEGL GPMNSNQGSK  2040
IRGRIQTALA MGPPEWARAD LEKVISRDVY RSSAGGPMKR MALTVLERAH QYHVDSEKPS  2100
LAQFHTTRMC REVVRLVAEQ DDVKIFSNLP DEVENRFSLE GFRIKSEPYR VRTKGFRVSA  2160
GEKRGLKRTH EETASEEVPC GYIRTVDLRT IDMRLTARAY GSSLELFLAD VRMVWENARQ  2220
QYPPESKEVR TAEELSELFD RLYQEKVLNL GPDGLPLPKP SSRPPKDLTI RISDPVWGRL  2280
PKAPWEWEGC KVCGIDENDA QVLLCDQCSA EYHTYCLDPP LARIPDGDWL CHNCRELGPI  2340
PLASPTAGRP GRKKGSGRNR ADGRRVKVRI GGIPRRRSPG EGGGPKKRGR PLSAKGAERR  2400
AAEERTAERK AAARFARETV PGAAFRMERF SGVLALEAPE SPAIREARLA EEARAEAEAL  2460
AEAGVGPGAK PSVKVQAVQV GNPSEDVKPS VKPLESSNAS EVAVVAVPGG WESAPGTAPL  2520
EGTKLVGDVE DVKPTGAGST AEAAPGGEAR ALRSQGVVTP EKKQEKKKRK GQNQWTKRKE  2580
LAELAAKLEA EGKPVPQLPP RKHQNQYTKR KELALAAAKE GDKSPSKVPS SPERKPKAGV  2640
SVASRLKIAR GVTSAARERG SPGGKEVKVL RRVKGGGSSP DGEKSPKKKR AGEELKKGPQ  2700
NQHTKRKEGA EPGAEVKRKG ENQYTKAKRL AEEAQKEKER EKRKGENQYT KAKRLAEEAE  2760
TKKVAEAAEP KRKGENQYTK AKRLAEEAAR AKEERRAAKE RRRAEKEKRL AEEEKEGKKR  2820
GENQYTKAKK LADWGAKKPG PKKLAEGGAK KLVVAGGEVK RRGENQYTKA KRLAEEAAKE  2880
ARREKRREKR KGENQYTKAK KLALLALEGT QSDAPKTEPP AGTSAKQPVA DKEADVKPQV  2940
VEVGGVAPAV EAAKPSGSEA RLEAKPEKPP RKRENQYTKR KRLAEQGIEA KPVRAEPRSG  3000
LRSKDGEMPT LRSGATPKQE KGPNGEPVVY TRRKSARGEA KAEKEGGGKE GESSLQLVKA  3060
EPVDGEPSVP VQLSGDAQGE PSKGESEGVG VEVAGVSSVT AQAEIVSEPP AAKQTAEGGA  3120
PEAAGKPPAD KENGALRPMD VDLGASTAGH VGAADITAPP IGTPADVTAA APESVLTRAR  3180
AEAGPADVSR GVPVLEGRPA KRQRLAAAEG PSRPGGPILR ASQGGERRER RPSQSRGLTV  3240
RVTEPPKKKE RTPEQQLADR MGECAYWDLT LSERLRLFKA LAKRAADVTL SDAGQKPAPL  3300
GSDADGAHYW LLTCPSGPLL LTGKPAPPPP PTPKPASNPS APATSGESKD ETALLLPDLN  3360
AQPSPPAVEE HLKNTAEKGP IEGQPVLAQG LETGPKAEGL TPAAPSAGAA AAEEVAVSME  3420
EDGEAANAER PVPMDTDVGV KGEESGTALE TGLEEPQEEK KDGSAAGGKE EELEKVVTAD  3480
GAGVVDAVKA AEKDGNAEKV EEEEMLRKGT WSVVTSLEEL NDVRSRVTSA DGVTSAEANL  3540
REKFAACEGS LRQGLERVFG PEGKRPREEK RPREEDEGPF TSGKKPRLAE EEHAPGQTLR  3600
ATDLLLERER TAGETRAVPR CVCGEPLVVP RVHCPFCHCS AEKGTRAFSH WCDRPHGCLL  3660
LEELRRVEAS PATGPFGLPA GADPLVESQA DLEVLEGLEI DLQAVFAEGL DYPGLGGVAS  3720
APFTSLARSP PKRVALNRLH RSPNPAPSEL GPPAVPFGDA FFKPPQTSPK PEAPPLPHAQ  3780
SFPQFVPGGS AAKPPNPTAP LETSASLPAD TNALVPYNPP PAAADVSGGA PRDPSPETPR  3840
KVEDLPAETL ISKPPGPGKW LVEGQQPALA RLKMDLLDME AALAPQCLGG VRATPERCNA  3900
WRVMVKRAQS AKELANAAIL LELMIRPALI SKEWAVMGPL SRAVADGREG TVAQVALMVH  3960
SLDGVIDYEG LQSAKESSKR RRVR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2kwj_A3e-142286233757108Zinc finger protein DPF3
2kwk_A3e-142286233757108Zinc finger protein DPF3
2kwn_A3e-142286233757108Zinc finger protein DPF3
2kwo_A3e-142286233757108Zinc finger protein DPF3
5i3l_A3e-142286233762113Zinc finger protein DPF3
5i3l_B3e-142286233762113Zinc finger protein DPF3
5szb_A3e-142286233762113Zinc finger protein DPF3
5szc_A3e-142286233762113Zinc finger protein DPF3
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
128812889RREKRREKR
239783982KRRRV
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A1Y1I8740.0A0A1Y1I874_KLENI; Uncharacterized protein
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G25830.14e-08GATA transcription factor 12