PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023889168.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family MYB
Protein Properties Length: 2095aa    MW: 229922 Da    PI: 10.255
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023889168.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding32.91.5e-1010421084347
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                        +T+ E ++++ a+k+ +++ W  Ia+ ++ gRt k+c++++++ 
   XP_023889168.1 1042 TFTENERQIFIAAFKETPKK-WGEIASLLP-GRTYKDCIHHYYTN 1084
                       79******************.*********.***********986 PP

2Myb_DNA-binding24.85.1e-0812971336344
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrw 44  
                        W+  E+  ++++++ +G++ +++Ia++mg ++t  ++k+++
   XP_023889168.1 1297 YWSVPEQTDFAKYIAHFGTD-FASIAAHMG-TKTQTMIKNHY 1336
                       59999***************.*********.**********9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2095 aa     Download sequence    
MSSRYPPSDY RVPRDRSRSP SRFDRRASTA NAFENRPQPP PRISSDAPRG PRTQFDAPRP  60
TPGAGPSPTP SARPTFSSLR DAPPLSSGDR GRPFRERDDY RRDRPPSPRE RSPVRPFKAS  120
RDYPPRDADV VRPRRGSRDG PLSAGATYPD NPPFASTSFS RGGFTGRGRG RVDFEPSRGR  180
GARRNLDDRD LFRRERSPPP RWGARDLSRD GREPERRDER RFDRRDEDRR SEWAERERDS  240
VDRGRREQPP LRPEPRQLNG SPHSATGSHG GSQAPPIDPG RLALIEGSGA DVAARKQSVS  300
QNAAAAPPRR EQPETPSYLN GRADATANRY NQRGSSPPTQ APPVPAFTLS FAPTGPAVAT  360
TSGTSATKPS QPKPPAPSDD IPVQDHTKVE RRVEPPVDAP IAPKAQARAP LAPKAQLAEP  420
PPTAPRAPRA LELETSSGPH GRLQGVRSLE SLAGSRAPPS GPSFPRASVP PRSSSIAQTT  480
PSSASSLSTL TQPVSPMVQQ TREFAAPTGP RASRISPALA SVSPRPAFTS PRSETTGVQG  540
YAGGPRAQTP PLPSGPRKRS ISVSPKVSTS TVPTAPKADR VPPTAPRAGP IAPGRTTDRP  600
NGPPLWGVPS APRNLQWNQW KRPPPHAESK IVPAKRDSTG EERDRQAEIG SDDPTRSATG  660
ASPKVQHGSR QARTQDEDRM QIDNETHPPA HSAKQSFFGQ PMETAEDDEA SDVAGEEEEE  720
EIMSSSEDEG LDVEDEALFN AKFEKRKREL EAKLVDLSAR EYRATTPLEC IARLARISFS  780
DLQRVNEHQD MDVDDTSAKG HRLRPPPTQN SDSDGNADAI TPKHDEDARV QIRHGDDSPE  840
HIRYVRRPSP EIINLPYLTK ASQPLFKSEP LLESSRDEES TQVAVLDALR DGMEAIADAQ  900
ADCERDFLER LRTWRAQCAR LDREQEEHEK VERQKSMEPP PEIDAALTST SNVGPEGRRL  960
HKFSSEYIIE QVLKQSEETA RMEQEKLDRK AKKVQADMEK EASLPDLLSE EEVRSSIYID  1020
SNRYRDPESL PMVFSYLPPD DTFTENERQI FIAAFKETPK KWGEIASLLP GRTYKDCIHH  1080
YYTNKWDSRF KDRTKGKSRG GPGRGRGRGG KSALRGRGAA AMADLNRNEP VTAPNVSDSG  1140
RPKRAAAPTT FGERETDLKT APVGSSPVKK VNAVTKLDGA GEPEKKKRKG EKPGRKPKAQ  1200
QPLAALAAAP PITSPPKPFL PAMHSKEETV RAQNLADANL LAGLHTGHHH SMIVPADGQI  1260
SYAVHDTFAQ PPLPIDEPTR PKPLGPAVMS KSNASSYWSV PEQTDFAKYI AHFGTDFASI  1320
AAHMGTKTQT MIKNHYIRQV ESGNKAELEA SAKDADQRRK NGEDMGAPPT PTPIVKRKYD  1380
NPPTNAPRPL APHKNANAMD VDEASPQSRI IPSKHTSPQQ FVQQPRFTTT AQSTPVTAPS  1440
PLPAAAVPQH TPHIQPVPLA RPMQHSLSSQ MSFLADAMPE PRSNLQQSSQ SLRMISDAGN  1500
DPSVPTPLPP RSQPSLEMIR DLKAEQDRAF RVQQEQSQQD RIGLHRQIPM HQESMLASPA  1560
TQPRLAPTLE RQSSKEESAA TAPSRSVFAA TGLTGLSRPQ SSQPIFGLGG MGLGSPLSNR  1620
SPFRQAQIKR EEPRPSSVPA VPLPAIAPPA PAPEPPKRSN LMSILNSDPE PVAPPPKRES  1680
LSVAPLRTSS PAPRGSAYAQ ASTPSTLLDA SSRRDMLVQP PMAQSPFAPR PSYQHPGEKV  1740
GQPLSQPTTL KHELSSGGLP VSQPPKPDWA SHVHQRSSQP SQPTRSTLDS GDLRSSIFSH  1800
RPSALSSLNQ PVRGNPSPPP NAMLGHSRTS SLTAPTSQAP RDPQRPGLAM QQPPHQSHSS  1860
VPPLQPGSYG NQSSPSAFHQ HPQDLRNHAP HSHSASLGAG FGLAGAHHRG VSRDDHMRHE  1920
QAFAQREREE QEWRRRQQED AERRESQYRH DQERQQQQQQ QQQQQQTQQQ SQPVFGRSLP  1980
PAMQPAFEAA APPFGQGRPA YGLREASLRE VQSIVAEQSY MEEASRRRHE ALMLDRERDA  2040
AADYRRRQHD EPGFRRTPLG AGNGYAGPPP LPPSQQQQQQ QQQQQQQPPP PPSRR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110991109RGGPGRGRGRG
211691189KKVNAVTKLDGAGEPEKKKRK