PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.04G192000.1.p
Common NameGLYMA_04G192000, LOC100775813
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family C2H2
Protein Properties Length: 1573aa    MW: 177616 Da    PI: 8.6096
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.04G192000.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.70.0003814561481123
                           EEET..TTTEEESSHHHHHHHHHH.T CS
              zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                           y+C    C++sF +k +L++H r+ +
  Glyma.04G192000.1.p 1456 YQCDidGCTMSFGSKQELMHHKRNiC 1481
                           99********************9877 PP

2zf-C2H212.80.0003514811503323
                           ET..TTTEEESSHHHHHHHHHHT CS
              zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                           Cp   Cgk F ++ +L++H r+H
  Glyma.04G192000.1.p 1481 CPvkGCGKKFFSHKYLVQHRRVH 1503
                           9999*****************99 PP

3zf-C2H213.40.0002315391565123
                           EEET..TTTEEESSHHHHHHHHHH..T CS
              zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                           y+C   dCg++F+  s++ rH r+  H
  Glyma.04G192000.1.p 1539 YVCAepDCGQTFRFVSDFSRHKRKtgH 1565
                           899999****************99666 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM005453.2E-152061IPR003349JmjN domain
PROSITE profilePS5118313.7442162IPR003349JmjN domain
PfamPF023751.0E-132255IPR003349JmjN domain
SuperFamilySSF511975.22E-27107123No hitNo description
PROSITE profilePS5118434.116175351IPR003347JmjC domain
SuperFamilySSF511975.22E-27179369No hitNo description
SMARTSM005581.8E-49182351IPR003347JmjC domain
PfamPF023732.1E-35215334IPR003347JmjC domain
Gene3DG3DSA:3.30.160.602.8E-414451478IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SMARTSM003557.314561478IPR015880Zinc finger, C2H2-like
SMARTSM003550.004514791503IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015712.94414791508IPR007087Zinc finger, C2H2
Gene3DG3DSA:3.30.160.601.8E-614801507IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE patternPS00028014811503IPR007087Zinc finger, C2H2
SuperFamilySSF576674.18E-914951537No hitNo description
Gene3DG3DSA:3.30.160.601.2E-815081533IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SMARTSM003550.001415091533IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015710.74115091538IPR007087Zinc finger, C2H2
PROSITE patternPS00028015111533IPR007087Zinc finger, C2H2
SuperFamilySSF576678.56E-815271561No hitNo description
Gene3DG3DSA:3.30.160.607.6E-915341562IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SMARTSM003551.115391565IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015711.69715391570IPR007087Zinc finger, C2H2
PROSITE patternPS00028015411565IPR007087Zinc finger, C2H2
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003676Molecular Functionnucleic acid binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 1573 aa     Download sequence    Send to blast
MVGVCEGNGE VLAWLKSMPV APEYRPSAAE FQDPIGYIFK IEKEASKYGI CKIIPPFPPS  60
SRKTAIANLN RSLAEAGSTF TTRQQQIGFC PRRPRPVQRP VWQSGDRYTF SEFESKAKSF  120
EKTYLKRHSK KGSGSGSGLG PLETETLFWK ATLDKPFSVE YANDMPGSAF SPKCRRTGDP  180
SSLADTPWNM RAVSRAKGSL LQFMKEEIPG VTSPMVYVAM LFSWFAWHVE DHDLHSLNYL  240
HMGAGKTWYG IPRDAAVAFE EVVRVHGYGG EINPLVTFAI LGEKTTVMSP EVFISAGVPC  300
CRLVQNAGEF VVTFPRAYHT GFSHGFNCGE AANIATPEWL RFAKDAAIRR ASLNYPPMVS  360
HFQLLYDLAL ALCSRIPAGI SAEPRSSRLK DKKKGEGETV IKELFVQDVL QNNDLLHFLG  420
QGSAVVLLPR SSVDISVCSK LRVGSQQSIN VSNSEGMHSS KGFVSDDLAF NRSHGIKQGK  480
SFYSVKDKFS TLCERDRISS FDVNDNISIS SSNPLQRDTE RETCQGDGLS DQRLFSCVTC  540
GILSFSCVAI VQPREPAARY LVSADCSFFN DSVVGSGISK NKFTIAREEA IIPEPNIYTG  600
WMKKNVQDGI HDVPFQSSQV ALNMVSENGN TALALLASAY GNSSDSEEDQ IAVDSHESNV  660
INSASESLLS YTRDSHASPM TALDRGDYIP SKSSSYEDFI HRRLECFENT RTVANSTSNC  720
SQDAHNAERS LSNNAMMVPF DNKKASMVLQ SDEDSSRMHV FCLEHAAEAE QQLRPIGGAN  780
LLLLCHPDYP KIEAEAKMVA EDLGIDYMWK NIEYSHASKE DEEKIQSALD SEEAIPGNGD  840
WAVKLGINLF YSANLSRSPL YSKQMPYNSV IYSAFGCSSP ASSPVEPKVY QRRVNRQKKI  900
VAGKWCGKVW MSNQVHPLLA KRDFEDIEDE KLLIGLILPD DKIERSESTP KSEATSRKSG  960
KKRKKTAENG RFRKGSYANK NLLSDNSTED KPNLLPRSIL RSKKVRHVER DCAALKGGYS  1020
PPYHHRKPSN NQTNFTESYA VSDDSLDDDD HMQQRRNVKI EKAKFMDNDV VSNDTMDNDS  1080
DWQQREDVSS KQVEDTEGDA ISEDSLDVGS LQLQRKTSKG KHPKYIGEED IISDDQMESH  1140
FQKRQKRIPE SRQGKYLTGK DIISDDQLEL KMKKQQRRNP KSRQAKYLNE EDIASDDQLE  1200
GHYRRYQRKN PKGRQATCVA GEDQMSDDQL ENHCQKQQTS FYRKRQNKGI EREVKNEMSD  1260
DHLEDHFLKQ QQRFPKSRRN KHTDKEDTDD LAENNSHLLR RTPKRKQAKC MEDDDMNSDD  1320
EMEDDQQLRR TLRSKQAKPK TLQQMKQANS LQAKKQASRP IKQCSRMLVK SKAPQQIKQP  1380
SHLPNKQSNN TQEFSLDMEE EEEGGPSTRL RKRATKAQES ERKLKDKQTK RKKVKNAAAA  1440
KVSVGHAKMK DGEAEYQCDI DGCTMSFGSK QELMHHKRNI CPVKGCGKKF FSHKYLVQHR  1500
RVHEDERPLK CPWKGCKMTF KWAWARTEHI RVHTGARPYV CAEPDCGQTF RFVSDFSRHK  1560
RKTGHSAKKN CQ*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6ip0_A3e-80123726353Transcription factor jumonji (Jmj) family protein
6ip4_A3e-80123726353Arabidopsis JMJ13
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1960964KKRKK
212991305RRTPKRK
312991306RRTPKRKQ
414101432RKRATKAQESERKLKDKQTKRKK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.364650.0leaf| somatic embryo
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.04G192000.1.p
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2353400.0AC235340.1 Glycine max strain Williams 82 clone GM_WBb0086A04, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006578680.10.0lysine-specific demethylase JMJ705
TrEMBLI1JXF50.0I1JXF5_SOYBN; Uncharacterized protein
STRINGGLYMA04G36630.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF56483250
Representative plantOGRP44011217
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0relative of early flowering 6