PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PK11574.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Cannabis
Family MYB
Protein Properties Length: 1817aa    MW: 199200 Da    PI: 5.4697
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PK11574.1genomeCCBRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding29.71.6e-09805846346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT+eE e++ d  +  G++ +k+Ia+ +  ++t  +c+++++k
        PK11574.1 805 PWTPEEKEIFMDKLASCGKD-FKKIASFLD-HKTTADCVEFYYK 846
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding331.4e-1010221062345
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                        WT eE  ++v+av+ +G + +++I++++   R++ qck ++ 
        PK11574.1 1022 DWTDEEKAIFVQAVTSYGRD-FAKISQCVR-SRSRDQCKVFFS 1062
                       5*****************77.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.2E-13789850IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.1E-6798850IPR009057Homeodomain-like
PROSITE profilePS5129315.321801852IPR017884SANT domain
SMARTSM007171.5E-8802850IPR001005SANT/Myb domain
PfamPF002498.2E-7804846IPR001005SANT/Myb domain
CDDcd001677.90E-7805847No hitNo description
PROSITE profilePS5129310.86410181069IPR017884SANT domain
SMARTSM007178.4E-910191067IPR001005SANT/Myb domain
SuperFamilySSF466893.59E-1010201069IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.2E-610221063IPR009057Homeodomain-like
PfamPF002495.5E-810221062IPR001005SANT/Myb domain
CDDcd001675.52E-710231061No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1817 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFRERKQE RSESLGSVAR WRDSSHHGSR ELNRWGSTDF RRPLGHGKQG  60
GWRFFSEVPG HGYASRSSEK VLEDDGFRPS NSRGEGKYGR SSRENRGSYN QREWRGHTWE  120
TNNGFPSTPG RVHDMNNEQK SREDLPPYSP YSNGSFGNTW DQAQFKDQHE KAGGSMGLGT  180
AQRCDRKHSL GINDWKPMKW TRSGSLSSRG SGFSHLSCSK TTGAVDSIDT KVESQMKNAT  240
PVQSPSADAN ACVASSAPSE EMTSRKKPRL GWGEGLAKYE KKKVEVPEVT FNRDGVFAVS  300
NAEPSHSLSS SLIDKSPRVT SFSDCASPAT PSSVACSSSP GVEEKSFGKA VSIDNDGSNL  360
CGSPGPVSLN HGPVSLNHSE GFSFNLEKMD SNSISTLGSS LVELLQSDYP SSVDSTYVRS  420
TAINKLLIWK AEISKTLEVT ETEIDSLENE LKSLKSITIG SSPSVSCSLP AEDRLISSEE  480
EVITHLVPRP ALLQIRSTDD AVAEELPISN SDKEEACANV KAEDVDSPGT VTSKFVEPLS  540
LAKAVSSSDL LNHATGNSDG IPSTNQDVQH SVPGSGGEET VPDTYEDCSM LTEGEITAPI  600
IDTLGSCTDG EVKLQSVILS SNKELAKGAH DVFNKLLPQN EFSLDSSEGL NASSQQDYTL  660
VKEKFLMRKQ FKRFKERVIT LKFRAFQSLW KEDMRLLSIR KHRAKSQKKL ELSLRSVHNG  720
YQKHRSSIRS RFSSPGNLSL VPTAEVVNFT SKLLLDSQVK LHRNNLKMPA LILDEREKIV  780
SRFISNNGLV EDPCVVEKER AMINPWTPEE KEIFMDKLAS CGKDFKKIAS FLDHKTTADC  840
VEFYYKNHKS DSFEKKKKLD CGKQVKSLAN ATYLKLNKKW NREMNAASLD ILDVASVMAA  900
NADDCMRNQK ACSGRLIFGG LSESKASRGD YGTDERSSSI DIVGNDRETF AADILAGLCG  960
SLSSEAMSSC ITSSVDPVEG YREWRSQKVD SVVRRPLTPD VTQYVDDGTC SDESCGEMDP  1020
TDWTDEEKAI FVQAVTSYGR DFAKISQCVR SRSRDQCKVF FSKARKCLGL DLIHPGPENE  1080
RTFTGDDANG SGSGSENVCA REMGSGICSD KSGSKMDEDL PLSAVKMKND ETDPAENFTS  1140
LTAQSRSEGK NEREQLNSKR NIDASESQLS DSCPTQSRPN VVSDGDSKRN IDVQSRLEEK  1200
NGREQLSSKR NIGASETQES RPNVVSDGDS KRNIDASESQ ESRPNVVSDG DSIITQGVGE  1260
AKVIPIQETL PVLSSIDTGR DDGDEQGTSV AELESVSEGN ENDKLFNTES VVGKKPVDEV  1320
SSDELANLMD RLDEKCNAST SGQSGLDSVV QDSNSTGNAS HMTSDRNSCS GFSLNPDYQR  1380
QVSIELNSKE KSCVIPLAQE IPLVSANSIS LNSGAIRCEK NGNEDKMSSS LDFQESRDVC  1440
HISVCKDESN AHVTGLPILT DAQSSQVLRA CPVRMNVKVE VNGDVRCRNS SEVQGLSSSE  1500
TSSNASLLQN CYLQRCSNVN PSCSTTQFPL MSQNTEQASD RRKSRSQSLS DSEKPSRIGG  1560
DVKLFGKILT APSLPKADSN YRDNEENEGS NNHKLSNQSN MKFANLHNSD GNSTLLNFDH  1620
KNYLGIENVQ MRNYSYWDGN RLQATFPSVP DSAILLAKYP AAFSNFSAPS STMEQQSLQS  1680
VVNSNELDVN GVSAYPTREI SNSKGMVDYQ VYRSREAAKV QPFTVDVNQR QDMFSEVQRR  1740
NGIETISSFQ HQGRGMVGIN VLGRGGIVVG GGCTTGVTDP VVALKRHFAK TEQQYGGQSS  1800
SIIREDESWR GNGDMSR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C5e-15768854994NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D5e-15768854994NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024023461.10.0uncharacterized protein LOC21396603 isoform X2
TrEMBLA0A2P5CPT90.0A0A2P5CPT9_TREOI; Octamer-binding transcription factor
STRINGXP_010099638.10.0(Morus notabilis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF49863352
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-167MYB family protein