PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.20G224000.2.p
Common NameGLYMA_20G224000, LOC100804367
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family Trihelix
Protein Properties Length: 644aa    MW: 74256.9 Da    PI: 6.6939
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.20G224000.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix61.52e-19150229286
             trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          W ++evlaL+++r++me+++ +       We+vs+k++e g++rs+++Ckek+e+ ++++ +i+ g++++++++ss+++++++le
  Glyma.20G224000.2.p 150 WNNDEVLALLRIRSSMESWFPEL-----TWEHVSRKLAELGYKRSAEKCKEKFEEESRYFNNINYGKNNNNNNNSSNYRFLSELE 229
                          ********************987.....9*****************************************************998 PP

2trihelix95.93.7e-30481569186
             trihelix   1 rWtkqevlaLiearr....emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          rW+k+evlaLi++r     + +++ ++g+ k plWe++s+ m e g++rs+k+Ckekwen+nk+++k+k+ +kkr s +s+tcpyf+ql 
  Glyma.20G224000.2.p 481 RWPKDEVLALINLRCtsvnNNNNEEKEGNNKVPLWERISQGMLELGYKRSAKRCKEKWENINKYFRKTKDVNKKR-SLDSRTCPYFHQLS 569
                          8************9843334444444689*********************************************8.9999********96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138374.6E-12148230No hitNo description
PROSITE profilePS500906.121149201IPR017877Myb-like domain
PfamPF138371.2E-19480570No hitNo description
CDDcd122034.45E-21480549No hitNo description
PROSITE profilePS500906.284481542IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0001158Molecular Functionenhancer sequence-specific DNA binding
GO:0005516Molecular Functioncalmodulin binding
Sequence ? help Back to Top
Protein Sequence    Length: 644 aa     Download sequence    Send to blast
MCERERQKSE RERGWCVVCK LLACSILLIL IYSEHRKEEV KKREVVTQNT NTNMFDGVPD  60
QFHQFITPRT SLPLHLPFPL HTSGGTPNTT TFPSNFDPYN HPHQLPLQPN NLLHPLHHKD  120
EDKEENTTVP MNLEIQRDQR QQLPELIDPW NNDEVLALLR IRSSMESWFP ELTWEHVSRK  180
LAELGYKRSA EKCKEKFEEE SRYFNNINYG KNNNNNNNSS NYRFLSELEQ LYHQGGSGDH  240
HLENTTQPPL QKQDKMGHHA LELEVEGDSR NVVDALVTKQ NEQSDEALAV EKITKDRKRK  300
RPDRFEMFKC FCESIVHKIM AQQEEMHNKL LEDMMKRDDE KFTREEAWKK QEIEKMNKEL  360
EMMAREQAIA GDRQANIIQI LNKFSATSSP ASHTLKKVNN DSNINTHITQ NPNPSQTENP  420
TLSVAQDTLQ VIPSTSSTST PALPQNPSTY SLNIQNNNNN IPVETNSVLN KGNEKDDVGR  480
RWPKDEVLAL INLRCTSVNN NNNEEKEGNN KVPLWERISQ GMLELGYKRS AKRCKEKWEN  540
INKYFRKTKD VNKKRSLDSR TCPYFHQLSS LYNQGKPVLQ SESHLNSPPN QNPEQVTPDQ  600
TTQAHESSSQ VGSGGGFSVQ QQQVDHGGEK TLMQVPSLDF DQF*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.418930.0flower
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00011PBMTransfer from AT5G28300Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.20G224000.2.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankEF2217540.0EF221754.1 Glycine max trihelix transcription factor (GT-2B) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_003556463.20.0trihelix transcription factor GTL2
TrEMBLI1NIP30.0I1NIP3_SOYBN; Uncharacterized protein
STRINGGLYMA20G36680.10.0(Glycine max)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.15e-95Trihelix family protein