PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz11g08060.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family GATA
Protein Properties Length: 1993aa    MW: 208845 Da    PI: 4.8475
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz11g08060.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA35.31.6e-1113821416135
           GATA    1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35  
                     C +Cgt  +p+WR+g   ++ LCnaCG++ r++++
  Cz11g08060.t1 1382 CCHCGTNYSPQWRKGTPEKPILCNACGIRLRRHKR 1416
                     99*************955555***********975 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1993 aa     Download sequence    
MSEYDTDIWA ELLSEPLCPD LTDDDGLAGS VPRAGEPCAA QRHTKSPSIM QAKAASAATS  60
KQAPIHHAGA EQASIGAAKP MERAGADSGG MDEAWQPSDV PCLPQQHSSL NAAPPKQQVS  120
AVPQPACFVA SEAEVVATPV VPLSYTDPLS SADQLEQTFS PMSTAQPVVT AHRDNISSTI  180
DSAHDVVARV QAQAQSIRQH PTVILTSQST APERAHGVAP LLPQADVMTG SGLLAEPTCS  240
AALPGGIQAV DRLHMPASAA QGQTPTTDLD VQPSTVLSAA ALPSNPPVAP AEGQKAASGL  300
AAGKPVAKDM QVDPDMSFLP SNIPYQPQSL LTASSGMVLV MPQTVMSVCQ GVAPSQLQGQ  360
AVASTNGATQ LMLTQLPLML PLHSAMTGVQ SPSAVAAGSL AVQLPLQVHQ YANEYVAQLA  420
TLLSMVNGSY AKLLDSKHVP SVAGVQQTAV LTGGATVSHS SKQQLDDAHM PPVSKAMHAE  480
GARSLAADGK APSTLAFDAN AARAIQLLVS LQGATEPAIA ANASCAGDAT ECADLGLQLS  540
PNSDSLDCNH HTAEAVANNV GSSSRLPTDT AANSAAAGLC ASSKESEQAA LGKSVKQPKW  600
MGSACTTADA LGRSRRGRSI KRPSWMLDHI EVRSAGKKVC KVGAAAIKAV AGSQQQPMSE  660
QQADTIATVN QPQSPAVAHG PDEHEGALAG DVDMTEQQQD EQQHQQRKEG HSAVAFQTDT  720
PANQLAPGKQ GCTDAGQLQP VVDSAEQQQQ AALPQSMNQT CDSMSGGCGR PAVDDSLQEV  780
IAVVQDSNRT SAGVDAAASP ALHDSGEATV STAAIDAAAD MKCAYCIIQH VSEAELADCP  840
AALAASQPGD TKEQQQGCLM PAAAKAPQEI QQQDVAAEAV HVTYSHVLDM GTGSMSLEEV  900
TNVHMNLNRK KLPFETKMCV VAADAHEPGA QNVMQTEVAG VTTMVAGDLL GGYAVAAARD  960
GAAASAQGTV PVRAEDEAGM CVDVAAFLHP DGPESGEHAI KADDATCRGQ NDIMCYEKAD  1020
VAAGSDTDMA AVSDADADVY KEVKATESNA HGLLWEDNDH GLSGFWGSDT KQDLSSLWIK  1080
DSPHALEPLW VPEAALGLAG TQDDICLTQQ HDVGQQQGFW GERCDTELGN EGQRVKLEEE  1140
QQEQHLGFPA CTSTQPAIET CEDPLQQTDE EEEPSCMQHD ADQQAQFADQ GWRGTWQQTS  1200
EGNVHPAAFQ GLNHHPPMSC QHDEGPGDML VDSQGGLKHE FADQQIPVHY AAAVEHYRSD  1260
DSGPDDDMPA QQLHVMGRQQ RPMSSHHVEG LGFASSFSEE LEQLEDEAQE CHRACGYDRA  1320
CQPVYGRMGR LGHQSSFAGR HARGKHHRPD EYKPHRGEPK RDAKAAGPKR HSNGKAELAG  1380
PCCHCGTNYS PQWRKGTPEK PILCNACGIR LRRHKRLEPC NPTGGEGPGC VADVHGIQGA  1440
GSSGSASCQQ GLNGKWQLRS HGSRSTQHSK QQAVESLQRT RPLSLRLQRR AQEGKSVSPA  1500
KLELDDDLDE PASPRRKYHP ATASVKLQQC RPTPYLSIPI PRNAAKRMQV PSPGSRLPPQ  1560
GFRERHSSPR QASPAAEQVV IWTASGLPPS GNQLGLRGRH ASLSPMCRQM PKSTRAGLAV  1620
NKAEERDTAM LQHPARNNGQ QAMLLTSINA VDIHGSDMNA LTAVPATACH VSAGCVASIE  1680
NQPQDQISNS QLLYNDPDSP FSVYGNDMEP VQQSMDEESS GQWSSGMTPS CLVARPYSVS  1740
SDNLQDSADT HVAWNADALY CEDMGRDGLM CGSKPAALTS MSSGALTGQL SVEISPTAEQ  1800
LVGAFPCDQS MIAAAACRPG GVASRLYTAG SDADVGFNSD LAWDQPEASA QGLAAADGQL  1860
PASGLHALTE LDNQAHHGLP EFGHNAMVLG GAGYGRSDMF IVVGGGNGGV SSKAGDMVEG  1920
LGGCELGTCQ DAAVAGSPES AECESMQVYG IDNGSGDLVN GITSQAFNQC GQCSKHVPAG  1980
YMHPLCDACR WV*
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17570.18e-10GATA family protein