PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.04G192000.4.p
Common NameGLYMA_04G192000
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family C2H2
Protein Properties Length: 1117aa    MW: 127317 Da    PI: 8.4833
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.04G192000.4.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.20.0002610001025123
                           EEET..TTTEEESSHHHHHHHHHH.T CS
              zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                           y+C    C++sF +k +L++H r+ +
  Glyma.04G192000.4.p 1000 YQCDidGCTMSFGSKQELMHHKRNiC 1025
                           99********************9877 PP

2zf-C2H213.30.0002410251047323
                           ET..TTTEEESSHHHHHHHHHHT CS
              zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                           Cp   Cgk F ++ +L++H r+H
  Glyma.04G192000.4.p 1025 CPvkGCGKKFFSHKYLVQHRRVH 1047
                           9999*****************99 PP

3zf-C2H213.90.0001510831109123
                           EEET..TTTEEESSHHHHHHHHHH..T CS
              zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                           y+C   dCg++F+  s++ rH r+  H
  Glyma.04G192000.4.p 1083 YVCAepDCGQTFRFVSDFSRHKRKtgH 1109
                           899999****************99666 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.30.160.601.9E-49891022IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SuperFamilySSF576677.59E-59961052No hitNo description
SMARTSM003557.310001022IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015712.94410231052IPR007087Zinc finger, C2H2
SMARTSM003550.004510231047IPR015880Zinc finger, C2H2-like
Gene3DG3DSA:3.30.160.601.2E-610241051IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE patternPS00028010251047IPR007087Zinc finger, C2H2
SuperFamilySSF576672.74E-910391081No hitNo description
Gene3DG3DSA:3.30.160.608.1E-910521077IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS5015710.74110531082IPR007087Zinc finger, C2H2
SMARTSM003550.001410531077IPR015880Zinc finger, C2H2-like
PROSITE patternPS00028010551077IPR007087Zinc finger, C2H2
SuperFamilySSF576675.71E-810711105No hitNo description
Gene3DG3DSA:3.30.160.605.2E-910781106IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS5015711.69710831114IPR007087Zinc finger, C2H2
SMARTSM003551.110831109IPR015880Zinc finger, C2H2-like
PROSITE patternPS00028010851109IPR007087Zinc finger, C2H2
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003676Molecular Functionnucleic acid binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 1117 aa     Download sequence    Send to blast
MHSSKGFVSD DLAFNRSHGI KQGKSFYSVK DKFSTLCERD RISSFDVNDN ISISSSNPLQ  60
RDTERETCQG DGLSDQRLFS CVTCGILSFS CVAIVQPREP AARYLVSADC SFFNDSVVGS  120
GISKNKFTIA REEAIIPEPN IYTGWMKKNV QDGIHDVPFQ SSQVALNMVS ENGNTALALL  180
ASAYGNSSDS EEDQIAVDSH ESNVINSASE SLLSYTRDSH ASPMTALDRG DYIPSKSSSY  240
EDFIHRRLEC FENTRTVANS TSNCSQDAHN AERSLSNNAM MVPFDNKKAS MVLQSDEDSS  300
RMHVFCLEHA AEAEQQLRPI GGANLLLLCH PDYPKIEAEA KMVAEDLGID YMWKNIEYSH  360
ASKEDEEKIQ SALDSEEAIP GNGDWAVKLG INLFYSANLS RSPLYSKQMP YNSVIYSAFG  420
CSSPASSPVE PKVYQRRVNR QKKIVAGKWC GKVWMSNQVH PLLAKRDFED IEDEKLLIGL  480
ILPDDKIERS ESTPKSEATS RKSGKKRKKT AENGRFRKGS YANKNLLSDN STEDKPNLLP  540
RSILRSKKVR HVERDCAALK GGYSPPYHHR KPSNNQTNFT ESYAVSDDSL DDDDHMQQRR  600
NVKIEKAKFM DNDVVSNDTM DNDSDWQQRE DVSSKQVEDT EGDAISEDSL DVGSLQLQRK  660
TSKGKHPKYI GEEDIISDDQ MESHFQKRQK RIPESRQGKY LTGKDIISDD QLELKMKKQQ  720
RRNPKSRQAK YLNEEDIASD DQLEGHYRRY QRKNPKGRQA TCVAGEDQMS DDQLENHCQK  780
QQTSFYRKRQ NKGIEREVKN EMSDDHLEDH FLKQQQRFPK SRRNKHTDKE DTDDLAENNS  840
HLLRRTPKRK QAKCMEDDDM NSDDEMEDDQ QLRRTLRSKQ AKPKTLQQMK QANSLQAKKQ  900
ASRPIKQCSR MLVKSKAPQQ IKQPSHLPNK QSNNTQEFSL DMEEEEEGGP STRLRKRATK  960
AQESERKLKD KQTKRKKVKN AAAAKVSVGH AKMKDGEAEY QCDIDGCTMS FGSKQELMHH  1020
KRNICPVKGC GKKFFSHKYL VQHRRVHEDE RPLKCPWKGC KMTFKWAWAR TEHIRVHTGA  1080
RPYVCAEPDC GQTFRFVSDF SRHKRKTGHS AKKNCQ*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6a57_A1e-70998111421137Lysine-specific demethylase REF6
6a58_A1e-70998111421137Lysine-specific demethylase REF6
6a59_A1e-70998111421137Lysine-specific demethylase REF6
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1504508KKRKK
2843849RRTPKRK
3843850RRTPKRKQ
4954976RKRATKAQESERKLKDKQTKRKK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.155130.0hypocotyl| pod
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.04G192000.4.p
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2353400.0AC235340.1 Glycine max strain Williams 82 clone GM_WBb0086A04, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006578680.10.0lysine-specific demethylase JMJ705
TrEMBLA0A0R0KH930.0A0A0R0KH93_SOYBN; Uncharacterized protein
STRINGGLYMA04G36630.10.0(Glycine max)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.11e-108relative of early flowering 6