PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY48144.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family GATA
Protein Properties Length: 654aa    MW: 70540.5 Da    PI: 5.6418
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY48144.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA49.84.7e-16317352134
        GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkg 34 
                 C++Cg ++  Tp++Rrgp+g+++LCnaCGl+++ kg
  GAY48144.1 317 CTHCGISSksTPMMRRGPSGPRSLCNACGLFWANKG 352
                 *****99999***********************998 PP

2GATA48.61.1e-15506542135
        GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkgl 35 
                 C++Cg+++  Tp +Rrgp g++tLCnaCGl+++ kg+
  GAY48144.1 506 CQHCGVSEnnTPAMRRGPAGPRTLCNACGLMWANKGT 542
                 *******99*************************997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 654 aa     Download sequence    
MPTQTGPANL QTSTADTSLP SLRQQPRPAA AVLTALPTLC EPPSISSSHL PPPCLQPSTT  60
IIATVSIVRH LVSGRPLRDG NSPSQIHSHE GFTQNLQKVF PFGNASKSPS FGPISPMYGQ  120
SQSMNISSQM SGGGAAADED DVSVAADDHH LSYDPHSALE NGIVVVEDVA HDSGYATGGN  180
ELSNSSQLTL SFRGQVYVFD SVTPDKVQAV LLLLGGCELS SSPQGMEVIP HSQRGIADYP  240
AKCTQPQRAA SLDRFRQKRK ERCFDKKVRY SVRQEVALRM QRNKGQFTSA KKCEGGALGW  300
SNAQDPGQDD SPSETSCTHC GISSKSTPMM RRGPSGPRSL CNACGLFWAN KGALRDLGKK  360
MEDQPLTPAE QGEGEVNDSD CGTAAHTDNE LVQAVLLLLG GRDIPTGVPT IEVPYDQSNR  420
GVVDTPKRSN LSRRIASLVR FREKRKERCF DKKIRYSVRK EVAQRMHRKN GQFASLKESS  480
GASPWDSSQD GIQDGTPRPE TVVRRCQHCG VSENNTPAMR RGPAGPRTLC NACGLMWANK  540
GTLRDLSKGG RSLSMDQLEP ETPMDVKPSI MEGEFSGNQD ELGTPEDPAK AVNQGSDNPS  600
IDPDEEDMHG AAEDLTNSLP MGLVHSSADD DEQEPLVELA NPSDTDIDIP SNFD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1424429DTPKRS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G24470.25e-70GATA family protein