PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Eucgr.G02815.1.p
Common NameEUGRSUZ_G02815, LOC104454203
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Myrtales; Myrtaceae; Myrtoideae; Eucalypteae; Eucalyptus
Family MYB
Protein Properties Length: 1760aa    MW: 191517 Da    PI: 5.9
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Eucgr.G02815.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.73.1e-09826867346
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
   Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                       +WT+eE e+++d  + +G++ +++Ia+ +  ++t  +c+ +++k
  Eucgr.G02815.1.p 826 PWTAEEREIFLDKLATFGKD-FSKIASFLD-HKTTADCVQFYYK 867
                       8*****************99.*********.***********98 PP

2Myb_DNA-binding254.6e-0810451085345
                        SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
   Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                         WT  E   + +av  +G++ +++I+r +g +R+  qc+ ++ 
  Eucgr.G02815.1.p 1045 DWTDREKSAFMRAVSSYGKD-FALISRFVG-TRSVDQCRVFFS 1085
                        5*****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.22E-15810871IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.1E-5820868IPR009057Homeodomain-like
PROSITE profilePS5129317.927822873IPR017884SANT domain
SMARTSM007171.1E-9823871IPR001005SANT/Myb domain
PfamPF002491.2E-6825867IPR001005SANT/Myb domain
PROSITE profilePS5129313.69610411092IPR017884SANT domain
SMARTSM007173.2E-710421090IPR001005SANT/Myb domain
SuperFamilySSF466893.28E-1010431093IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.607.1E-510441086IPR009057Homeodomain-like
PfamPF002496.0E-710451085IPR001005SANT/Myb domain
CDDcd001673.57E-610461084No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1760 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RSESLGSVAR WRDSSSSSSH HAGSRGFTRW GSTDFRRPPG  60
HGKQGGWHLF PEESGYGYTP SRSADKLLED ESGRRSGFRG DGKYSRSYRD NRGSFGQRDW  120
KVSHSWEASN GSYTPGRPTD NNCQRPADVL TQTSSPHSDL TNPRDYHQLH AKDHHDKSSD  180
VKPIDAKTSD ADGLTSCQKP ESSLVAVEWK PLKWPRSGSL SSRGSSLSHS GSSKSMGGVD  240
SGETKVGIST KDVTSVPSPS QDGAACVTSA APSEEMTSKK KPRLGWGEGL AKYEKKKVGG  300
LDENAPKNVM EPAHSLGLSV ADKSPRLAAF SDCASPATPS SVACSSSPGM EDKSFLKGSH  360
GDTDLNNLCR SPYPCSQDQQ EGFSFNLETM DANSIANMGS SLGELLQLDD PSSMDSGFVR  420
SSALNKLFLL RGDVLKALEM TELEIDSLET ERKTLISQSG DHGPCTASSS SLAAEANAKP  480
GEHDEISNVV SRPSPLEIFS SGNMDKGGLP LGEGRSEEIH AANNAEVDSP GTATSKFAEP  540
FCLSKGAVTS DKAKESECAE DMDATQVAEV GRGECAEDMD APQVAEVGRG KLSLEISVEK  600
TDASPSRDKT VVLFCEEDGA NNSAEMFINS EMMLPAFIVA SNKETASKAC EVFDKLLPRD  660
QSSFDISRVA DVASREIDAL IINNFVMRKR ALRFKERAVT LKFRALHYRW KEDLHLLSLS  720
KVRAKSQKKN ESSLRVTLNA HQKNRSSIRS RFSSPVRNLS PVPSSNVLSY TSKLLSDPRM  780
KSYRKYLKMP ALILDDREKN ASRFISTNGL IEDPCAFEKE RTMINPWTAE EREIFLDKLA  840
TFGKDFSKIA SFLDHKTTAD CVQFYYKNHK SDTFERTMKK LDFGKQGKSL AANNYLVTSE  900
KRWSREVNAV SLDVLGAASA MAAQQDDIML ERQGCDVGVF VNGFCNSRSS RADCGDLERS  960
SSFDVLGSER ETVAADVLAG ICGSMSSEAM SSCVTSSVDP GEGGRELKCS KVDFVRKRPF  1020
TPEFMENVEE ETCSGESSGE MDASDWTDRE KSAFMRAVSS YGKDFALISR FVGTRSVDQC  1080
RVFFSKARKC LNLDSLLPIC RSRGTPVSED ANGDGDMEDA CGLEISSAIC GDKLSSKMDE  1140
DLVVKDVNQV ESDCKSLNFG DDLNKSDGNA GSGELSCEYV KDMEIAVPDV CQMRDGDEVV  1200
SKGVGSISAS VGTQKSIADS DVTRVEATVA VGASLVESSV REDTDHGSCF SASADGEHAI  1260
RACTEGFKNI TWGQEALLPQ KCNGDTTHPS SLLSSSGESD TQINSSRLPA DRSPCFGFSH  1320
RSESEHQVSL VLDPVEISSS NSRHDEKFQG ATRSLSSDSA PNYCKISRTQ DRMSSSDGPP  1380
TKEKHKQFVS NDDYYKHVSG SSLLSQMESP ILPGYPIQQL HAQENSGSTS GEFSDVQKHL  1440
KPENSITCRY VVPDFHLKKC TTSKSHSSVA ELPLLPHKSE QLTGLSKIDM PSLPDSQKQS  1500
KSGDFKLFGQ ILSHPPSQPT PKVSVQDNDE NGAPQRKFGS KESDLKPSSN HSEGGNSAML  1560
KFDCNNYASL ENVPLRTYGF WDGNRIQISS LPDSALLLAK YPAAFSNFPL PTTKLEQHAM  1620
QPVPKSNECG LNGVSVFPSR EVANGNGIVD YRLYRSRDDG KVQPFAIDVK QRQDACPELQ  1680
RYNGFEAVSA GIQHQGRGMI AMNVMGRGGM LGGGPCNGVS DPVTAIKMQF AQSDPLVGHQ  1740
NGNIVREEES WRGKGDVGR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C9e-157938751294NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D9e-157938751294NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Cis-element ? help Back to Top
SourceLink
PlantRegMapEucgr.G02815.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010067289.10.0PREDICTED: uncharacterized protein LOC104454203
TrEMBLA0A059BI880.0A0A059BI88_EUCGR; Uncharacterized protein
STRINGXP_010067289.10.0(Eucalyptus grandis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-103MYB family protein