PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KN540155.1_FGP001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza
Family GATA
Protein Properties Length: 3185aa    MW: 356720 Da    PI: 4.5765
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KN540155.1_FGP001genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA48.31.3e-1528962933136
               GATA    1 CsnCgttk..TplWRrgpdgnktLCnaCGlyyrkkglk 36  
                         C++Cg +   Tp++Rrgpdg++tLCnaCGl+++ kg++
  KN540155.1_FGP001 2896 CHHCGISAasTPMMRRGPDGPRTLCNACGLMWANKGTM 2933
                         *****88888*************************987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF579975.1E-622112412No hitNo description
Gene3DG3DSA:2.30.110.104.4E-2726152725IPR012349FMN-binding split barrel
PfamPF012434.1E-626162683IPR011576Pyridoxamine 5'-phosphate oxidase-like, FMN-binding domain
SuperFamilySSF504753.09E-1826172783IPR012349FMN-binding split barrel
PROSITE profilePS5101713.38428212863IPR010402CCT domain
PfamPF062038.3E-1528212863IPR010402CCT domain
PROSITE profilePS501149.24928902938IPR000679Zinc finger, GATA-type
SMARTSM004012.2E-1228902943IPR000679Zinc finger, GATA-type
SuperFamilySSF577168.55E-1128942946No hitNo description
CDDcd002025.92E-1328952939No hitNo description
Gene3DG3DSA:3.30.50.101.9E-1428952936IPR013088Zinc finger, NHR/GATA-type
PROSITE patternPS00344028962923IPR000679Zinc finger, GATA-type
PfamPF003201.8E-1328962933IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0055114Biological Processoxidation-reduction process
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0004733Molecular Functionpyridoxamine-phosphate oxidase activity
GO:0005515Molecular Functionprotein binding
GO:0008270Molecular Functionzinc ion binding
GO:0010181Molecular FunctionFMN binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 3185 aa     Download sequence    Send to blast
MGVQLQQFRK KKEKRGPGKK AEAKADAAEA EEGSSKSGVD AEEAVPEPKS PVGLKLLAGE  60
GGASHRTPFE EAARSQVEQC NGQGPDTAES CVVDNSDVLP VQEGGDGGGN AQDVGVSEHG  120
SLEHVNPGTD DGEGATIPVT GADGSGLLIE GAQPVEMDVD EKLPDNSLKE NMELCTSSQG  180
AIADDNGDSQ AEEHQQVEMY PVERPTSSDS KEITDIIGHS QDIGAGNTNK GEGRARETEI  240
DVSGMPSGAV VECEGELNVR ASHEASESTS REDTDKEADA LGEEAAVQED PGVANATEGV  300
VMVDDLSLHA KSIGAVSLPP HKEIDQALLA SDISQDMAPY HLEDIQRHLY LATMSRDFLQ  360
LQMDESADLN TDGTPESSNE VINLQVLLEE TEKSKLAVCE ELQQCRHELS DMNTVKEELE  420
LTVASLKDRI NTSNNKCEHL EFELQSSKEN TQQILNELSG CRAMLEALQK ENLELTATLT  480
FEKEARKEVE EQREHLCSEN KRVLSNLSDL ELSLASLKEE MNDGSNRCAD LECELRSTKE  540
NMERTLVELA SCRNSLETLQ NDNLELSANS SFEKEAIKKL EEDNLCLSNE KQGLLLDLSE  600
TKEELHLSYA KHEHLESHAR DMETYFGQLT EQLIEENIYT STSVDIYQTI TKELYAKCNV  660
VLGEARNAHQ DNEACLDSSE IIVENVERET TSPELIGHDD NQRPLLVAEN DSCNSTALQS  720
LKGHLKVAKG DLRDLQKLVE RISSRSDGRV LVSKLIQSFE SKGNQEDLGM SEGEHDNLRK  780
LTQEMICRLV EKLKAMTSDI AKTEEYVAEL CNRIELSVKF MSQHEAEIEH TAVLVAKMDG  840
FAGKLSNYKD TIDQLVSQVA NVHQDADNHA GRLIDQAELL QNDVTERIST LEKERTSLTD  900
VLMEVTDKLS ALSKNALPSD LGGSEGLGSL ALSSVECAAK LVQNLQEKLE DAQTDNAELN  960
ASLVELKTAH SDVQERSKHA HGIVKKMYIS LQELLFNSLG NPDESGVEFN AEEPIEALFS  1020
QYGDIIEHLK SLLHERQYLL SKNTDLESRL LSKCEETEAL SSSLTKNMND FSLLNEELKS  1080
VSISRIEAQD ELHGRCLAIA EKMVHRSTSH SSTVLSSMEM SSKANHILTT LLPCIEEGVA  1140
SYIEEFENMA EEIRLSKICL QESNIIGQSS SENWSVSLPV LIKEEIVPIF FDLQGRIDQL  1200
STLNIQLETE VPVLRDGLTK LDSALETSRA ELQKKVFELE QSEQKLSSVK EKLSIAVAKG  1260
KGLIVQRDSL KQTLLEKSGE LEKLAHELQS KDSLLIELEA KIKSYADADR IEALESELSY  1320
IRNSATALRD SFLQKDSVLQ RIEEVLEDLD LPENFHFRDI VEKIELLSKM AVGASFTVPD  1380
GNKQSSVDGN SESGAAIDSI NDEQNSNSNS GAEEIKIKYD ELHRRFYELA EHNNMLEQSL  1440
VERNNLIQKW EEVLGQISIP QQFRMLEPED RIAWLGNRLL EVEHERDALH LKIEHLEDSS  1500
EMLISDLEES HKRISELSAE IVAVKAEKEF FSQSLEKLRF DFLGLSEKAV QDEYVRDNLR  1560
KDLAELQEKL AEKTEESKLY HDMEMEINKL MDLVRDALQD DSNTEIPSGA GVGAAVLCLG  1620
SLLSRLIDGY KTHLSESTVR SSAEMETLSE TKISKDASTS ERGMEEKEMA LNTLSGELEH  1680
TRNSLALLEQ QRDEAVEKTQ SLTIELETLR AQIDQLQGDG AEQMNRYQSL MLELESMTKQ  1740
RDDLQEKLGQ EEQKCTSLRE KLNVAVRKGK GLVQHRDSLK QTMEEMNTMI EKLKVERKQH  1800
IESLESERSS LMGRLAENEK SLHDATQYLS RLLNSLSTVD IGREFDTDPI TKVENFSKFC  1860
LDLQNEVKKS KQATELLLAE LNEVHERADN LQDELVKAEA ALSESFKQNS VVESARADAV  1920
RHLERIMHMQ SQTKRKQIDH LMELNSTSSQ LREIFSELLH HLLNTFSKDV DIINYMESFV  1980
KSSDKWMDST SMVEIPITSN HHLSNSISSK TCSSQMAHIP NVPLEITLDN ADETQILHHL  2040
ATACHAVADC VNDCNDIKSR IHEHGFSVDQ KAADLFNVMS NLQNKFTSQN NELESLRENI  2100
IELQSEIKQR DEEILSMRRN LSLLYEACTS SVAEIEGMTG IESGDHSCSV VQNHLSADDH  2160
IKSVVNQLVA AIKTTQNSNE GNTKELKATV LELQQELQEK HIQISTISAE LASQVREAES  2220
SAKQLSVELA NARMEIHNLE KHSEMLLNQK KNLETQVSEL KDMEAVAHDQ HGRIKDLSDE  2280
LSKKDQEIEG LMQALDEEER ELEVLENKSN DLEKMLQEKE FALKSLEVSR TKALTKLATT  2340
VDKFDELHSL SESLLAEVEN LQSQLQERDS EISFLRQEIT RSTNELLTTE ESNKKYSSQI  2400
NDFTKWLETA LLQFSVHCDS TNDYECTQVP VYMDMLEKKI GSLISESDEL RVTLQSKDSL  2460
LQAERTRMEE LLRKSEVLES SLSQKDSQIG LLRRDRTSGQ PSRFINLPGT SEIEQVNEKV  2520
SPAAVVTQIR GARKVNTDQV AIDVEVEKDK PLDDEDDDKA HGFKSLTMSR IVPKFTRPIS  2580
DRIDGMCCSD LLWRVTDSLM QILLVIMRNL ENCKAKKEGY PFGSLVDFAP DPMGHPIFSL  2640
SPLAIHTRNL LEDPRCTVVV QVPGWSGLSN ARVTIFGDVV PLPADLQVLV DFVNGNSDLK  2700
VNLFFFRNGL INSFGTVAWL DVKEYEALKP DKIATDGGEQ SLKELNAMYS KPLKELLSTE  2760
IEVDDAALIS IDSKGIDIRV RQGAQVQAVL LLLGGRELAP GSGSVPSSSA AYSKKMNFPH  2820
RMASLMRFRE KRKERNFDKK IRYTVRKEVA LRMQRNRGQF TSSKSKAEEA TSAITSSEGS  2880
PNWGAVEGRP PSAAECHHCG ISAASTPMMR RGPDGPRTLC NACGLMWANK GTMREVTKGP  2940
PVPLQIVPAS TNDVGYKDAL VEAQALRVNC SSESERRQAL ESHVADLKSD NERLRRLYTE  3000
TLFKFTNQMK FHTESRNLKE ELEKANTRLL SMEEEYKREI EQLKLGSEMN SNDLENKLSC  3060
ALVQQATNEA VIKQLNLELE AHKAHIDMLS SRLEQVTAAV HQQYKNEIQD LKDVVIVEQE  3120
EKNDMHRKLQ NTENELRIMK MKQAEQQRDS ISVQHVETLK QKVMKLRKEN ESLKRRLATS  3180
ELDCS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1813RKKKEK
Cis-element ? help Back to Top
SourceLink
PlantRegMapKN540155.1_FGP001
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAP0051910.0AP005191.3 Oryza sativa Japonica Group genomic DNA, chromosome 2, PAC clone:P0479D12.
GenBankAP0149580.0AP014958.1 Oryza sativa Japonica Group DNA, chromosome 2, cultivar: Nipponbare, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015624718.10.0golgin subfamily A member 4
TrEMBLA0A0E0N9U90.0A0A0E0N9U9_ORYRU; Uncharacterized protein
STRINGORUFI02G03810.10.0(Oryza rufipogon)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G21175.17e-47ZIM-like 1