PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.20G047500.1.p
Common NameGLYMA_20G047500
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family C2H2
Protein Properties Length: 502aa    MW: 55331.9 Da    PI: 8.31
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.20G047500.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.40.000485475223
                         EETTTTEEESSHHHHHHHHHHT CS
              zf-C2H2  2 kCpdCgksFsrksnLkrHirtH 23
                         +C++Cgk+Fs+   L  H r+H
  Glyma.20G047500.1.p 54 ECNICGKVFSSGKALGGHRRSH 75
                         5*******************99 PP

2zf-C2H216.72e-05119140223
                          EETTTTEEESSHHHHHHHHHHT CS
              zf-C2H2   2 kCpdCgksFsrksnLkrHirtH 23 
                          +C +C+k F++k  L  H+r+H
  Glyma.20G047500.1.p 119 VCCICKKEFPTKNALFGHMRSH 140
                          6********************9 PP

3zf-C2H215.74.1e-05374396123
                          EEETTTTEEESSHHHHHHHHHHT CS
              zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                          ykC+ Cgk Fs+   L rHi++H
  Glyma.20G047500.1.p 374 YKCGACGKIFSTFQGLGRHISVH 396
                          9*******************999 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF139127.2E-115275IPR007087Zinc finger, C2H2
PROSITE profilePS5015710.6995380IPR007087Zinc finger, C2H2
SuperFamilySSF576675.67E-55379No hitNo description
SMARTSM003550.25375IPR015880Zinc finger, C2H2-like
PROSITE patternPS0002805575IPR007087Zinc finger, C2H2
SMARTSM003550.017118140IPR015880Zinc finger, C2H2-like
PROSITE profilePS501578.767118140IPR007087Zinc finger, C2H2
SuperFamilySSF576673.32E-8119140No hitNo description
PfamPF139123.6E-6119142IPR007087Zinc finger, C2H2
PROSITE patternPS000280120140IPR007087Zinc finger, C2H2
SuperFamilySSF576673.32E-8369396No hitNo description
PROSITE profilePS5015712.321374401IPR007087Zinc finger, C2H2
SMARTSM003550.032374396IPR015880Zinc finger, C2H2-like
Gene3DG3DSA:3.30.160.602.0E-4374399IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE patternPS000280376396IPR007087Zinc finger, C2H2
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003676Molecular Functionnucleic acid binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 502 aa     Download sequence    Send to blast
MDQKDKHAVS DHHGWVKSCK KEKMKQQRSL LLGSCQAELE SPSAAKSPSR TVRECNICGK  60
VFSSGKALGG HRRSHFQKHQ KKVKVRFTNH SSKQAGDTSN NIKRARNCDY DTVDDGKRVC  120
CICKKEFPTK NALFGHMRSH PERSWRGVSP PTHFPNKNSS SSSSLSSSFS FSSHNSDSME  180
KNMEGDRDDY DECVGVGAVC DGGGNRAIDL STVTCPSWLK TDVRGRKCIG AYEAAETLAY  240
LNEVRPKSAP PLIKLGKRKI NFSGSSSSKK HEVKKIKFYL KGELKIGKRN DADGEDDDDE  300
DEKLNRCKGL SESEVEDEEG FDNVITEVVA PDRMQDDVDE RKGKRVVDHK GKNIKKLVLK  360
SMAKEKENEK VGGYKCGACG KIFSTFQGLG RHISVHKGKN NNAVIIMDES NHSHSKALGD  420
KENNSSSSSN THKVDEASMN EAPLPTTMND APLPADERKV NDETSETPPL LPLPAHEACQ  480
SSSGVKKLDF DLNELPCAME D*
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.20G047500.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006606740.20.0uncharacterized protein LOC102667832
TrEMBLA0A0R0E7W20.0A0A0R0E7W2_SOYBN; Uncharacterized protein
STRINGGLYMA13G04041.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF15181913
Representative plantOGRP1249135
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G26940.12e-16C2H2 family protein