PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OMO67422
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus
Family GATA
Protein Properties Length: 906aa    MW: 99864.8 Da    PI: 8.3944
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OMO67422genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA56.34.2e-18251285135
      GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
               C +C +tkTp+WR+gp g+ktLCnaCG++yr+ +l
  OMO67422 251 CMHCEVTKTPQWREGPMGPKTLCNACGVRYRSGRL 285
               99*****************************9986 PP

2GATA542.3e-17489523135
      GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
               C +Cg+t+T++WR+gp g+ktLCnaCG++yr+ +l
  OMO67422 489 CMHCGVTETAQWRQGPMGKKTLCNACGVRYRSGRL 523
               99*****************************9986 PP

3GATA55.57.7e-18772806135
      GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
               C +Cg+t+TplWR gp g+ktLCnaCG++yr+ +l
  OMO67422 772 CMHCGVTETPLWRDGPMGKKTLCNACGVRYRSGRL 806
               99*****************************9986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 906 aa     Download sequence    
MNDPWFDKGL NGVSDDLFNF DDVINYFDDL PPEDVDLSGG SILPLDDVEE NNAGVDGGEE  60
WDCNFQNLEP PPANVLASLS SGFYGDFFTD TLPKNVTVSC DGSSQLTERS STIKASSSRS  120
ITQHSESGDV KGSSRFQTSS PVSVLESSSS CSAANSTPIN PKLCFLVKRG RSKRRRASTF  180
NLPFALPFIS STSSTSRGSN SLVGSESESE SHLTEKHAKK RQMKKKNLTL LSGSSETKDS  240
PSQQPGVVRK CMHCEVTKTP QWREGPMGPK TLCNACGVRY RSGRLLPEYR PAASPTFVPS  300
LHSNSHKKEA HDGVLDFLDA PMEEAMEEWT TEVLASPCDD LDSLPCLFHF DSYDSTPREG  360
KEKGKNSSML SGSTEMKNQV DDINPVVAAL PLASKETTNK TEREVALELA ASSSFLPPKK  420
RSHRLAFLRC SYFLETLQTK RTMKHDKEKI KKPSDSEALN SIELPLVSGS GENKDSSSPR  480
PVVVKKCKCM HCGVTETAQW RQGPMGKKTL CNACGVRYRS GRLLPEYRPA ASPTFVPSLH  540
SSSHKKVVEM REKAMLSKSP SDNLDNLPCL FHSDSHHGTP RTIPVEGKKK EKNLSMLSSG  600
MEMKNQVDVS NGGIAPLPLE SKETTNKTEM EVVLASSSSS VAPLPENEGS GPCPLQSSTF  660
LPPRRSHRLA FLRCSDFLAT LQTTRTEKQG KDKIKEQSEA LNSIELRRSH RLAFLRCSDF  720
LETLQTTSSE KQDKEKIKEQ SEALNSIELP LVSGSSENKD SSSPRPVVVK KCMHCGVTET  780
PLWRDGPMGK KTLCNACGVR YRSGRLLPEY RPAASPTFVP SLHSNSHKKV VEMRKRTNLS  840
KSPLDNLDSL PCVFQSDSHH GTPRTIPVEG KKKEKKLSME MKNQVDESNC GIAPLPLRVR  900
RDNRQN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1419423KKRSH
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G08010.22e-45GATA family protein