PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz02g09070.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family GATA
Protein Properties Length: 1187aa    MW: 128441 Da    PI: 6.5984
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz02g09070.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA47.82e-15344376133
           GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkk 33 
                    C +Cgt  +++WR+gp  ++ LCnaCGlyyrk 
  Cz02g09070.t1 344 CMHCGTLLSSQWRSGPHNKPVLCNACGLYYRKL 376
                    99**************88888**********96 PP

2GATA37.43.4e-12394428135
           GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                    C +Cgt  +p+WR gp  ++ LC  CG ++rk+++
  Cz02g09070.t1 394 CGHCGTEASPQWRAGPPEHPVLCDMCGRHFRKTKT 428
                    9*******************************976 PP

3GATA49.65.5e-16444478135
           GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                    C++Cg+  +p+WR gp  ++ LCnaCGl+yrk+++
  Cz02g09070.t1 444 CHHCGSQFSPQWRGGPADKPVLCNACGLHYRKTRQ 478
                    ****************77777***********997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1187 aa     Download sequence    
MLGLQAAGSA QQPEPNPGQA VRLITTQDGV CVTIHQNGSS PASAQSADKL ELAVPSSGLA  60
HCQHNHLGQQ HEQQHQHQQH QQLSILASVA TAVPQAGPDV QGSSESDGTL HREQTPPSDI  120
SDPPPPLSQP LVQACQQQLA AMPPETTAAQ EATGAIGPTA SMDAYGEQGH HMDQLHLDRQ  180
QSTNTSTQQL LPLQQPPHQQ AMLADSATHQ QQPHPAQEQR GDEQREANGH QQHSQQQHLA  240
GGIAQQAPSN GFNNHQQYQE EQQQQQQQQQ FQEQQQQQQH QQCYPNEAQQ LVTQQDSAMR  300
PHAEHHNADG PQQGLPGTSY PDAQAAPTSG DEAGPSGRRM RGPCMHCGTL LSSQWRSGPH  360
NKPVLCNACG LYYRKLQSLP NNTCQVADRL QGPCGHCGTE ASPQWRAGPP EHPVLCDMCG  420
RHFRKTKTLP KRKRRSLTQH FQGCHHCGSQ FSPQWRGGPA DKPVLCNACG LHYRKTRQLP  480
DRPAHVMQTL LDLSRAAEQQ QAMEDNRGAG EAGLVQHVEG GTRQDGSRQD SGESSSCTHT  540
GNTGPSKSVT AGAQGPTHGY PVASALPQQH GSTIAGQNAM QVCTSDAACN MHLVPSDGAD  600
ANGTIQHGGV TASATAQGVD TQMPQAVQPW QAATAGHGSV FSGGQHYYQQ MQQSQQLVAS  660
TDYQMQQQQY AQQYGYYQQY HPMQQHQQQA VVAQPYMPQR PAQQSPAAQQ QQSQSQQPAA  720
PDYDHLHMQG IHHAQTGHYA WQHYYQLQQA AAANAAAAGR VGGPTAQCMP GQQMGWPVVA  780
PGVSMPAMVP GMDAQYQQYQ QQYMQWQQQY AAHQQHMYAQ QQPQHHIHQA PAISAAPTQL  840
DSAGNLDQQA AGQSTSNHQQ AAADYDAQGT VQGLPVHEHQ HPHYQQYMQQ QMHQQMHQQN  900
ECQYSQQPSS THQTLQAADT PSQQQQPQAG NQQVCTDVYS GDLMLEHHHG ALHQQLQHPQ  960
ATTQHADLQP MSPHMSPRAV NAQLYHSLAQ PQPQPQSQQQ QLQQQLQQQQ QQPQQHAVST  1020
GVPLSQQPDA TMAVSGTPLA PFNPNSSATH HPVSVHDGHQ TATAHQTQPT PVAPANGSED  1080
GVPSGATGEQ LGIPVGTPGM MNPMLMPPPM ATFPGSWPSC AVPPVGSHPL AAAMMGQYPA  1140
YGMCMPQPLV IVLCKDNQQQ GASAEGSVVK DVTDRGQEGN GNEARQ*
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G51080.11e-10GATA family protein