PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID IGS.gm_5_00384
Common NameCHLNCDRAFT_143055
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; Chlorellaceae; Chlorella
Family GATA
Protein Properties Length: 2459aa    MW: 258021 Da    PI: 5.0252
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
IGS.gm_5_00384genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA22.12.1e-0711241153129
            GATA    1 CsnCgttkTplWRrgp..dgnktLCnaCGly 29  
                      C  Cgtt++++WR+ p   g++ LC  CG +
  IGS.gm_5_00384 1124 CPVCGTTSSTQWRKPPgeAGHI-LCDDCGKR 1153
                      888**********99999****.******65 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.30.50.101.8E-4773823IPR013088Zinc finger, NHR/GATA-type
PROSITE profilePS501149.68411181151IPR000679Zinc finger, GATA-type
SMARTSM004010.002911181167IPR000679Zinc finger, GATA-type
SuperFamilySSF577163.52E-611201155No hitNo description
Gene3DG3DSA:3.30.50.101.2E-611211156IPR013088Zinc finger, NHR/GATA-type
SMARTSM004010.06911891246IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.106.6E-411931246IPR013088Zinc finger, NHR/GATA-type
SMARTSM004010.2417811838IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2459 aa     Download sequence    Send to blast
MSAEVAHQGG FEADATAGGA APAPTSSGEA AQRLQQQAGA WVGGFLEDVQ ELGEGMAPLP  60
PELPHEAGSG GQDAGVLARP LRGKAQASQH DAESYAEAAV AAARRRTGAA DVASAAGAAD  120
GGQSGAPPSG QHAAVAEAEQ EQEQPAACTA AVLLPTPDMP LAEALAPHSA AAADQPLLEA  180
GRAASEQEAA AVEAVPAQPA VPGAEHEVED VTAAAQQQQY DTLLGMSGAA QQQLVPGVYD  240
PAVAQQQAFD ALVIGPLRTA QAAGGTGGAV APGATGAAGT TYTFEASGSG PAGRPPSQQA  300
AASGGRGRPG RPASQLAGAS NDAVASGMAS ALEQAQQEVE DQHPEQGVTE GHALPVVGVP  360
ATAAGEPVEQ SVQAPKHGGT KFRVRVPSNA RLGAAAAPPA APHGSRGVSE APPGSGKYKV  420
QARDVRIAGY QSEALAAAAH DHARIWWALR QQQPEGPGTA AQAASQDQVQ AAVLGLQGEL  480
NFPPVKFSSD AALLALLSGA TRQQLEEQLQ PFRLQQWEAA ARSAGQQQLQ AGVGSAAALN  540
QDNTTRFVGV LRITGSVKLQ AAITVDGTRE HLGAFDTPVE AAVVRDIGII WKQLHGLAGD  600
AMLNLPHSNL DKDAELVQEL RDAANPNALR AVLVRWVPKR LQDVVASLPG AEAGSQRAAG  660
TTLARVGSGA VHGSELPASP PPSAAGAAQG PPAAEDVGQP GAAHGAGALA CNGTSAGGGS  720
SGNSQQTSQQ QAGDGPEGGG GSSAGEGSED RNEEGDGDEE EQPGDSSDVS GSQLCSHCGD  780
DQCTGYQRHP SSGKLLCANC FSYLQRTGLD RPADVIQAYF ANRAAGMHAR TGRGNGHGSG  840
RGSGRDNSSS RRAAAGLDVV RGPGGSNRQA AAGRARQAAA RHPEDTVLSS DEGAAAADDE  900
SSEEESDLKE RRLRRQAQRH LDVQLQLRRQ PIAASAQGRV LPRRRNAAAG VAAAVAAAGA  960
SSSEEDEGRP ATSSDYEQQE ASESEEEFAE SEAEPVQQQQ HAAASAGGDG ASAFEWEDAG  1020
HSRMLKTINV AEAEARAFIF PESTWDDLAG GSNREVLVRH AYSGEEWRGY LSGTTLRLPG  1080
LFGKLELSAG SRLQLRVAAP KRELPLVECT LLMPEAVAAI GRKCPVCGTT SSTQWRKPPG  1140
EAGHILCDDC GKRVWQQRAE SLQQQQQQPL QPQQQPLHEQ PRDQPAVGTA AVRQCTHCGF  1200
LQPGPAGRQY YWRRHPTTRE QLCAPCGRYA DKHGGELPEL LEEEEQEDSE EQQDEDSEED  1260
EEEHPQRQPP QRQCLQCGST SKGSAKWAGW HRHPASGEEW LCHPCYRKVH NAIKRQQRQA  1320
EKQAQQEVES EVEPDEEEQS EEEEEEAAIA AAVKQCTHCG SLRPGPPGTK YSWRRHPTSK  1380
EQLCGRCWGH AHRHGGELPA LSDAEGEQEQ SEEEEEEQPP QQPQERQCLQ CASPTKGSGR  1440
WATWHRHPAT GEEWLCQPCF NKARAAIKRQ QRQPEKHPQE VESEEEEAQS EEEEEEEQLQ  1500
QQPPQRQCLQ CGSTSKGSSK SAGWHRHPAT GEEWLCQPCF NKARTAAMRQ QRQPEKHPQE  1560
VESEEEEAQS EEEEEEEQLQ QQPQQRQCLQ CGSTSKGSTK SAGWHRHPAT GEEWLCHPCY  1620
RKVHNAIKRQ QRQAEKQAQQ EEKSEVEPEG EEQSEEEEAA VAAAVKQCTH CGSLRPGPPG  1680
TSYSWRWHPT SKEQLCGPCC GYSDRHGGEL PVVAWKEAAA AATNQPKKKQ QGKRKQHGRQ  1740
PPAAPAVDEE PPVSPRQHKQ QRVADAATAV GAEGQSDAGA AGEQRLCSHC GADHAGPTPT  1800
YPWRKHATTG ARLCKDCWDF SRKHYGFLPE LPQPAEPPQP SQPEGQEEQP AAGKGRQRSG  1860
TRWASGQEQG DAAAGDAEAL QAPVLLSLPG RRVTRHQQHQ QEEQQHGAVE VEAQAATAAG  1920
QASRRRRRLS EVEGEAAAEE QQQPPSASEH KRGRRQLVQA EQEQTALLGS MHKRGRRQQE  1980
EQAEDTAEAE AGEKQKEQAA EPAAKRRRGK QQEVPEPPAA AVAAAPAEPV GASQASDSAG  2040
ATGRGGGECR WVNADSTALE VEVNGTCILS ETLVLPAAMA AKLTGLQAGG HLRLRDARTA  2100
NNRTAWNASL SDGRIEGKKL FSKLDVQEGD RLLLHRQAPA AAARAPGGRP PLLVAAKVAR  2160
EGPGEQALQQ ALAMAAAAAG VSAPAAVPGS GGEGAEPADG AGAGAPAAQQ EQPPPQDVAP  2220
AAAAAEAAMD ADEARQEQEP GAAAAEAQQQ AGEAEPAAAA VAAVQPAPAE GAGPRPELEG  2280
SMAGSGGGQQ EQAQHPGVAL RMFKQAREEP ARSSAPRGAG GNAAAGQRGP AAGGPPPAEP  2340
AGEPPAAAAA PAEPPPAAAG APPAVGVAAP GGSPASGASK EANDVLERLI LVLGSDAAAR  2400
QAGLTIDLIT DHQAQFERMD LKQQHDRAKP LTMLLEMRMF AEAVRFMEKL VANNNLSA*
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_005849601.10.0hypothetical protein CHLNCDRAFT_143055
TrEMBLE1Z9D60.0E1Z9D6_CHLVA; Uncharacterized protein
STRINGXP_005849601.10.0(Chlorella variabilis)
Publications ? help Back to Top
  1. Blanc G, et al.
    The Chlorella variabilis NC64A genome reveals adaptation to photosymbiosis, coevolution with viruses, and cryptic sex.
    Plant Cell, 2010. 22(9): p. 2943-55
    [PMID:20852019]