PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY48146.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family GATA
Protein Properties Length: 634aa    MW: 68398.1 Da    PI: 5.6337
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY48146.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA49.84.5e-16317352134
        GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkg 34 
                 C++Cg ++  Tp++Rrgp+g+++LCnaCGl+++ kg
  GAY48146.1 317 CTHCGISSksTPMMRRGPSGPRSLCNACGLFWANKG 352
                 *****99999***********************998 PP

2GATA47.91.8e-15506542135
        GATA   1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkgl 35 
                 C++Cg+++  Tp +Rrgp g++tLCnaCGl+++ k++
  GAY48146.1 506 CQHCGVSEnnTPAMRRGPAGPRTLCNACGLMWANKET 542
                 *******99************************9986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 634 aa     Download sequence    
MPTQTGPANL QTSTADTSLP SLRQQPRPAA AVLTALPTLC EPPSISSSHL PPPCLQPSTT  60
IIATVSIVRH LVSGRPLRDG NSPSQIHSHE GFTQNLQKVF PFGNASKSPS FGPISPMYGQ  120
SQSMNISSQM SGGGAAADED DVSVAADDHH LSYDPHSALE NGIVVVEDVA HDSGYATGGN  180
ELSNSSQLTL SFRGQVYVFD SVTPDKVQAV LLLLGGCELS SSPQGMEVIP HSQRGIADYP  240
AKCTQPQRAA SLDRFRQKRK ERCFDKKVRY SVRQEVALRM QRNKGQFTSA KKCEGGALGW  300
SNAQDPGQDD SPSETSCTHC GISSKSTPMM RRGPSGPRSL CNACGLFWAN KGALRDLGKK  360
MEDQPLTPAE QGEGEVNDSD CGTAAHTDNE LVQAVLLLLG GRDIPTGVPT IEVPYDQSNR  420
GVVDTPKRSN LSRRIASLVR FREKRKERCF DKKIRYSVRK EVAQRMHRKN GQFASLKESS  480
GASPWDSSQD GIQDGTPRPE TVVRRCQHCG VSENNTPAMR RGPAGPRTLC NACGLMWANK  540
ETPMDVKPSI MEGEFSGNQD ELGTPEDPAK AVNQGSDNPS IDPDEEDMHG AAEDLTNSLP  600
MGLVHSSADD DEQEPLVELA NPSDTDIDIP SNFD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1424429DTPKRS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G24470.24e-70GATA family protein