PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OPUNC02G00050.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza
Family Trihelix
Protein Properties Length: 708aa    MW: 75066 Da    PI: 6.1349
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OPUNC02G00050.1genomeOGEView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix100.61.2e-31524608186
         trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                      rW+k+ev aLi++r+ +e+r+++  lk+plWeevs++m++ g++r++k+Ckekwen+nk+++k ke+ kkr + + +tcpyfd+l+
  OPUNC02G00050.1 524 RWPKHEVEALIRVRTGLEDRFQEPGLKGPLWEEVSARMAAAGYRRNAKRCKEKWENINKYFRKAKESGKKR-PAHAKTCPYFDELD 608
                      8********************************************************************98.99999*******98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.992517581IPR017877Myb-like domain
PfamPF138373.6E-23523610No hitNo description
CDDcd122031.89E-30523588No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 708 aa     Download sequence    Send to blast
MQSGYGGVSE FQQYIMDPGA FAMSAPPQQA QPASAAAAAA GGQELGAPFR YQPLHHHALP  60
QHHHHHHHPP PQMPPHFAHF GGGVPAASAG GIPFTQQLLH QAAAAGHHHH LQLFHEQHHH  120
QKHQQPPPPA RWAPQHHHHP HHHLGLDVEA AVPESSGAGA GSTASGANAP PGVPPFLAAA  180
MSFKLGVDGG GGSGATDDAL NDGGGAGSGM MLHGGGGGGG GDDEAATESR LRRWPGDEEA  240
SIKEPTWRPL DIDYIHSSSS KRAAGKDKPA TPESPAPPPP ANYFKNKADD NAAAASASAG  300
AVNYKLFSEL EAIYKPGSGG AQTGSGSGLT GDDNAMLAPP LADLPDAGAA DPPQLNTSET  360
SAGEDAHAVV QPQPQPQPSG ADAARRKRKR RRQEQLSASA SFFERLVQRL MEHQESLHRQ  420
FLDTMERRER ERAARDEAWR RQEADKFARE AAARAQDRAS AAARESAIIA YLEKISGETI  480
TLPPPAANPA PGAEDMSQDG VGKELVAYDG EGAERGALQL SSSRWPKHEV EALIRVRTGL  540
EDRFQEPGLK GPLWEEVSAR MAAAGYRRNA KRCKEKWENI NKYFRKAKES GKKRPAHAKT  600
CPYFDELDRL YSRSGGGGSS SAGGNGGAGE EAKGSSELLD AVVKYPDVRC GPPGFPFDGE  660
QNEEGRKEDG DEAHDDGDGE GDEEDVGVGV GRATGDREDP VDESHDDH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1384389RRKRKR
2385390RKRKRR
3385391RKRKRRR
4386391KRKRRR
5387391RKRRR
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00651PBMTransfer from LOC_Os02g01380Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankCP0126100.0CP012610.1 Oryza sativa Indica Group cultivar RP Bio-226 chromosome 2 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_025879174.10.0trihelix transcription factor GTL1
TrEMBLA0A0E0JUF00.0A0A0E0JUF0_ORYPU; Uncharacterized protein
STRINGOPUNC02G00050.10.0(Oryza punctata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP63993547
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.29e-38Trihelix family protein