PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.14G079100.2.p
Common NameGLYMA_14G079100, LOC100813851
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family B3
Protein Properties Length: 382aa    MW: 42368.6 Da    PI: 8.756
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.14G079100.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B339.87.9e-132494697
                         EEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
                   B3 46 WevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97
                         W++ +  + ++++ +++ GW++Fvk++ Lke+Df+vFk++g+s+f   v +f
  Glyma.14G079100.2.p  2 WNIGM--TTRDDTLYFGHGWEQFVKDHCLKENDFLVFKYNGESQF--DVLIF 49
                         99999..999999***************************99999..77666 PP

2B335.12.3e-11245325487
                          E-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS CS
                   B3   4 vltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr 87 
                          v++p +v k+  +++  + + +h ++  +s++++l+  +  +W  +++y++ ++   lt+GWk+F  + +L+egD +vFk  g+
  Glyma.14G079100.2.p 245 VMKPTHVYKRFFVSIRGTWIGKHISP--SSQDVILRMGK-GEWIARYSYNNIRNNGGLTGGWKHFSLDSNLEEGDACVFKPAGQ 325
                          67888999999999999999888655..67789988855.58*******99999999***********************7665 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5086313.126152IPR003340B3 DNA binding domain
PfamPF023624.2E-9248IPR003340B3 DNA binding domain
CDDcd100174.02E-12250No hitNo description
Gene3DG3DSA:2.40.330.101.5E-13250IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.06E-12250IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.16E-11242327IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.9E-14242326IPR015300DNA-binding pseudobarrel domain
PfamPF023623.4E-9245325IPR003340B3 DNA binding domain
CDDcd100176.35E-12245338No hitNo description
PROSITE profilePS508638.839283340IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005773Cellular Componentvacuole
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 382 aa     Download sequence    Send to blast
MWNIGMTTRD DTLYFGHGWE QFVKDHCLKE NDFLVFKYNG ESQFDVLIFN GWSLCEKAGS  60
YFVRKCGHTE IDHAGGSLNK KRDTDNDSLE EGNIPSNAGV ECALHEKSAH VNGTKEPIDV  120
PPETPPTENT FNAGVESSGV EQFTPDGGVT LAAVPSETAN GKRIRNIVSA VKHVHTKRKG  180
RPAKWHVRER TLDWVAALEA EPVSASRSGT YEVYKSNRRP VTDDETRKIE SLAKAACTDD  240
SIYVVMKPTH VYKRFFVSIR GTWIGKHISP SSQDVILRMG KGEWIARYSY NNIRNNGGLT  300
GGWKHFSLDS NLEEGDACVF KPAGQINNTF VIDMSIFRVV PETVPLTPMS RGTRTGTRTG  360
TGRRGRKPAT MKSIQTQLSS P*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.18790.0pod| somatic embryo
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00468DAPTransfer from AT4G33280Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.14G079100.2.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0982360.0BT098236.1 Soybean clone JCVI-FLGm-23C7 unknown mRNA.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006595619.10.0uncharacterized protein LOC100813851 isoform X2
RefseqXP_006595620.10.0uncharacterized protein LOC100813851 isoform X3
RefseqXP_028199088.10.0B3 domain-containing protein REM16-like isoform X2
TrEMBLA0A0R0GHI10.0A0A0R0GHI1_SOYBN; Uncharacterized protein
TrEMBLA0A0R0GHZ70.0A0A0R0GHZ7_SOYBN; Uncharacterized protein
STRINGGLYMA14G08630.20.0(Glycine max)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.15e-23B3 family protein