PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OPUNC05G01900.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza
Family MYB_related
Protein Properties Length: 1857aa    MW: 200865 Da    PI: 8.0492
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OPUNC05G01900.2genomeOGEView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.11.7e-0710021044347
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                       +W +eE e++ +  + +G++ +++I++ ++ ++t  +c+++++k+
  OPUNC05G01900.2 1002 PWIQEEKEIFMEKLATFGKD-FSKISSFLQ-HKTTADCIEFYYKH 1044
                       899***************99.*********.************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466893.28E-139851047IPR009057Homeodomain-like
PROSITE profilePS5129314.5479981049IPR017884SANT domain
SMARTSM007174.8E-79991047IPR001005SANT/Myb domain
PfamPF002491.1E-510021044IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.8E-510021046IPR009057Homeodomain-like
PROSITE profilePS5129313.72812061257IPR017884SANT domain
SMARTSM007176.6E-412071255IPR001005SANT/Myb domain
SuperFamilySSF466891.79E-812111257IPR009057Homeodomain-like
CDDcd001670.0014412111249No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0016021Cellular Componentintegral component of membrane
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1857 aa     Download sequence    Send to blast
MWEEPDRART WASTAGASRP ATRRRRRGRG FGVGGEVVVV VVFFLGLPVR GSASSERSWP  60
LIVGAGAASS VFLPLLSLPH LSPLLLSFLL LPCAAPASVS ASTAAVLPAR RSARLRRRCR  120
LADVGFSAPA PLPASTPVAS PPPPPLVRVA VKGGLDEYVS GGGGDRRPAT TDKRRRRRRR  180
RLCWGRREGG GGAAELASSP RLCDLRRRPE LGIRAGWTTT TTTTTTPWMP PPPPDRRDFL  240
YRDGRRHDGD PLPPPAPTPP RWRDSPYHPP PPPPPLRDHS RPSPRRTPSS ASSDGYYRQG  300
GGSYDRSYPD ESLGYTPSRS DRYWLDDDGG GGGYKGFSRY GGGGGGGSRR DGRDIRGSYR  360
RSPFRGYGSD FSRNHQEHPP PPLRRSPLRS VAVPMSYDPP GDRADRGDRD HHHRVTPWRL  420
LRRRESRSDA ADAAGAGPVP VGQAATVAAS EKDVSARSSA VAAPQVSHEE APRKKPRLGW  480
GQGLAKYEKQ KVQGPAESAE AVAEGSPTAT EQKGVTHTPA PAPCASPVAA PSPAPCASPV  540
AVPSPAPCAS PVVAPSPAPP CKSPVPEDKS CELTANTVTE SNKNIPGPDV QACNNEVPTK  600
LDQLEGDPID SLAKVLSELV QHEDSCSGDS KRLSNVSKLL LLKESISKEL EKTELEIDSL  660
EGELKSVNAE ARNRTLKDPP SAVTYAQNPS PSPVKEQGEL TPSPKIPVEQ DAYVKGSDLM  720
EVETAQAHNA KAVSSEESVA CPGDAPGQVP AAADIIPSDP CGKTGSGIDV DIEQHEENPC  780
QDNFNAMKAD GSSDLTTRPC SYRNVKYNLM DQIIAANRSE ANKNSQLLFK PVPADRSNLD  840
LLASSYLSSQ MKNDVIIKKK HAILKNRQRF KEQILTFKFR VLRHLWKEDV RLLSVRKQRS  900
KSHKRTDQSN RASQSGSQRQ RSSNRSRLAV PAGNLSTFPI TEMSDVANKL FSEFQLKRCR  960
NYLKMPALII DEKEKACAKF VSKNGLVEDP VSVEKERALI NPWIQEEKEI FMEKLATFGK  1020
DFSKISSFLQ HKTTADCIEF YYKHHKSDSF REVKKLLDLR QQQQPASNYL GAVSGKKWNP  1080
EANAASLDML GVATEVAAQG LEYVNEVKKN SAKSILRTVC GADNSTKGSE DCVGDVSLHE  1140
KESVAADVLA GICGTLSPEG MGSCITSSAD PGQKIGIISR MEHLLTPEAD KNFDDDGTLS  1200
DQECEVDIVD WNDDEKSSFI EAMNRYGKDF ARISSYVKSK SFEQCKVFFS KARKSLGLDM  1260
IHQGAADAGF PTGDANGGRS GTDGACVAEM DSAICSAQSC PKMEIDACPV SGEIQGHNPL  1320
SDIASRQPEA DKSNEPDVVD INVKEGGSKA EKDCSILVDH KQLREDTHQT SYARIDINCP  1380
ESTDKLQDIE DVTPVDMHGD DLMATSIEQP LVEQPVAAHV ETRSSLHSEG IGMDVSRIEG  1440
CSHESAIGKG GKSTPSVCLP ANGVSKENII HFSNMDGASS ISPAFTSNYQ QSKLADSIQS  1500
KPKPLTPKDL MPVQFSSSLP DPTSICFEGI AAITTPNFED HGNRASIASG AKDVNMFQTF  1560
KDQCSNRHDA LFSNVDGYMQ QRRNSHFGTE VCGLSESTDI SQSDQFAVSK FQNGRSSSLG  1620
LSNGNLGVLS TGRREEAREG LFRPSSVKAS AGNEEQQKRP GDVKLFGQIL SHQSSLQSSG  1680
SSVHVSKSKP PSPKVDKSAS SRLLSNPRER LVYSSRPPSI VNLGLEERAM RSFDHMDGRT  1740
IQPEPMVMVA KCQRSSAGVP VYSTKNGALS VFAEFQQPSM PPHTSDHKLL ENFADLHKRN  1800
GIELLSGFQQ PGRLGGAGVL VSGVSDPVAA LKAQYGSGSK MLSSSNDVDT WKDIGSR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C7e-169601051494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D7e-169601051494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1165181RRPATTDKRRRRRRRRL
2171181DKRRRRRRRRL
3172177KRRRRR
4172180KRRRRRRRR
5172181KRRRRRRRRL
6173180RRRRRRRR
7174180RRRRRRR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK0706530.0AK070653.1 Oryza sativa Japonica Group cDNA clone:J023055D14, full insert sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015639126.10.0uncharacterized protein LOC9268524
RefseqXP_015639127.10.0uncharacterized protein LOC9268524
RefseqXP_015639128.10.0uncharacterized protein LOC9268524
RefseqXP_015639129.10.0uncharacterized protein LOC9268524
TrEMBLA0A0E0KY300.0A0A0E0KY30_ORYPU; Uncharacterized protein
STRINGOPUNC05G01900.20.0(Oryza punctata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP62993549
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.13e-78MYB family protein