PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023880040.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family MYB
Protein Properties Length: 2075aa    MW: 227763 Da    PI: 10.4017
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023880040.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.34e-0910341074345
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                        +T  E +++v a+k+ +++ W  Ia+ ++ gRt  +c+++++
   XP_023880040.1 1034 TFTDNERQIFVAAFKETPKK-WGEIASLLP-GRTYRDCIHHYY 1074
                       699*****************.*********.***********8 PP

2Myb_DNA-binding27.76.4e-0912881329346
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                        W+  E+  ++++++ +G++ +++Ia++mg ++t  ++k+++q 
   XP_023880040.1 1288 YWSVPEQTDFAKYIAHFGTD-FASIAAHMG-TKTQTMIKNHYQR 1329
                       59999***************.*********.***********96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2075 aa     Download sequence    
MSSRYPPSDY RPPRDRSRSP PRFDRRASSA NVFENRPPPP PRITSDAPRG PRSQFDASRA  60
PPGAGPSPTT PASRPAFSSL RDAPPLGSSD RARPFRERDD YRRDRPPSPR ERSPARPFKN  120
SRDYPPRDID VGRPRRGSRD GPLSAGATYP DSSSFAPGSF GGRGGFNGRG RGRVEFEAGR  180
GRGVRRNLDD RDLFRRDRSP PPRWGRDLSR DGREPERRDD RRFERRDDDR RPEWAERDRE  240
TTDRGRRDQP PLRPEPRQLN GSPHSATGSH AGSQAPPIDP GRLALIEGSS ADVAARRQSV  300
PQNAAPPPRR EVPETPSYLN GRADATANRY NQRGSSPPTQ APPVPAFTLS FAPTGPAAAS  360
STSNTSSSKP PRPKSAGPSD DVPVHEHTRV EKRVEPPVDA PVAPKAQARA PLAPKAQLAE  420
PPPSAPRAPR ALEVEPAPGP HGRLQGVRSM ESLAGSRAPP SGPSFPRASM PPRSSSTAQA  480
VAVSTSSLAT LTQPVSPNVQ QVREFSAPTG PRATRISPAP ASVSPRPAFA SPRSEAASFQ  540
RNQTPPPPSG PRRRSISVSP KVPASSVPTA PKADRIPPTA PRGGPMTSAR ATDRPGGPQS  600
WAVPSAPRNL QWNQWKRPLP PESKLVPAKR DSAGEEKDRQ VEVMSSDVTM SEASPKMQHD  660
SRQAQIQDEN KMQIDSDTNP PAHSAKQSFF GQPMETTEPD EVSDAADVED IMSSSEDEGL  720
DVEDEALFNA KFEKRKRELE AKLVDLSARE YRATTPLECI ARLARISIAD LQRANENQEM  780
DVDETNVAKG HRRKKPPTQN SDSDANPDAI TPTQEEDDTR VQIRDGGDDG PEHIRYVRRP  840
SPEIINLPYL TKTPQPLFKS EPLVASSLDE VTMQAAVYDA LRDDMGAISS AQADCEQDFL  900
EKLRTWRAQC AKLDREQEEH EKVERQKSME PLPEMDAPLA SINNAGSESR RLHKFSSEYI  960
IEQVLKQSEE TARMEQEKLD RKAKKVQADM EKEALLPDLL SVKEVRSGVY IDSNRYRDPD  1020
SLPMVFSYLP PDDTFTDNER QIFVAAFKET PKKWGEIASL LPGRTYRDCI HHYYANKWDN  1080
RFKDRTKGKA RGGPGRGRGR GGKSALRGRG AAAMADLNRN EPVAPPSVSD SGRPKRAAAP  1140
TTFGERETDL KTTPAGPSPV KKLGLAAKSD GTGEPEKKKR KGEKPGRKPK SQQPLAALAA  1200
APITSPSKSY LPTMQSKEEI VRAQNLADAN LLAGLHSGHH HPMMVPAEGQ LSYAVHDTFA  1260
QPPLPIEEAA RSKPIGPGAL SKSNASSYWS VPEQTDFAKY IAHFGTDFAS IAAHMGTKTQ  1320
TMIKNHYQRQ VESGNKAELE ASAKDADHRR KNGEDMGAPP TPTPIVKRKY ENPQSNAPRT  1380
LAPHKNATAM EIDETPPQSR IIPSKHTSPP QFVQQPRFTT TAQSAPVTAP SPLPTAAVLQ  1440
HTPHAQPITS ARPMQHSLSS RISFLSDTMP DARTTLQQTG QPLRMLSEAN NEAVVPTPLP  1500
ARSQPSLEMM RDLKAEQDRA FRVQQEQSQQ ERIGLHRQMP MHHETMQLSP ATQPRIAPPL  1560
ERQGSTEESA SSASSRAAFA GSGLASLSRP QSSQPIFGLG SMGLGSSLSN RSPFRQAQIK  1620
REDSRPSSVP AVPLPTIAPP VPAPEPPKRS NLMSILNSEP EAAAPPPKRE TPSVAPLRTH  1680
SPAPRLTAFS QSSTPAPLLD HSSRRDTRLQ SPMAQHAFAA RPPFQHSGDK AASHHSQPPP  1740
LNHELSSGGL PNPQPSKPDW ASHVHQRPSQ SGQSARPPLD GGDLRSSIFS HRSSALSPLH  1800
QPMRGNPSPP PNAMLGHSRT SSLNAQSSQA PRDSQRPGLA IQQPQHQAHS SAQPMQPGSY  1860
GSQSGGPFHQ HPPEMRNHAQ QSHNGAITAG FALPGGHHRG VSRDDLMRHE QAFAQRDREE  1920
HEWRRRQQED AERRESQFRH DQERQQHQQH QQQQSQQQSQ QAFGRSLPPA MQPSFEAPAP  1980
FGQNRPSFGL RETSLREVQS MVAEQTYIEE ANRRRLERIM MEREQEAAAD YRRRHDEPGF  2040
RRTPLGGGNG YGGPPPPPPQ QQQQQQPGPP PPSRR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110911101RGGPGRGRGRG
211611181KKLGLAAKSDGTGEPEKKKRK