PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0006080.1_g010.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family MYB_related
Protein Properties Length: 1967aa    MW: 220148 Da    PI: 9.0724
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0006080.1_g010.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding24.66e-08855891341
                                SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHH CS
            Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqck 41 
                                 W++ E e+++da +++G++ W+++a+ +  +R+ +++ 
  Pav_sc0006080.1_g010.1.mk 855 EWSKGELERFYDAYRKYGKD-WRKVAAAVR-NRSVEMVE 891
                                7*****************99.*********.***98875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1967 aa     Download sequence    
MKSKLLTFVV QHLSPPLIHG GFNTIVENSL SLSDSFPEMA QTPHSLLLLK RLSRCSPNLN  60
PPPAFLSLLK PFSTSPTPIS SLEPKPSSLS ARLSFVFDQI DAIEKERSEK DQTLQKIRAW  120
RESKKSQNNP ELGLGVVSDS ENNAKLISSD SVSSESERFE TPVAAKKEVE VVHPWPEWIE  180
LMERLVQQNY FDHRRNDEDR MIQDIGFNAS EAVLAAEEDA QGVDFKDFKT VQTACLNFGK  240
DRFDIMRSLS RQDIQVLVGF GCPSTDKKVV FSSKLLRKHT HLDEGDVCSS CSLRNSCERA  300
YLITNKEDEA RTIDIMRVLL AYGFDPVNGS VVNKSLLKQK SVKTVVRKLL HQVVKLSSVP  360
IDPNLPPPVI KKPPPKVKQP PPPPRRRVGR DDIEMKKGDW LCSKCDFMNF AKNTICLQCD  420
AKRPKRQLLP GEWECPGRNM ACFHCDCKRP PDEYLENKVQ EMQRGPRTRM EKTAVLHGDS  480
NAWNFDFDDN ESDGADVAAF EYADSSVIGE GSLGNQAQGQ NFGRPKNSRV PRVHNEEYSD  540
GDTVRPGRGF DDFNDEDDDI DNYELDTNNK NSAQSGSIDF SEFEGSESED IEGSDNSSHG  600
RRRTKSSYNK GLSFGSDDEL GLSSDVDDVD QTFGSRQGKL SKLSSGRRDF HRRGNFDIED  660
DSVSGSESDN DDFQSNKNRL RRPKENNFKG RGSLNFTRDT QFESSGMKGG RRNSFNNDFD  720
RSARGSHGSN KGFRGNDFDS QRMSNRGGDK QGFKGGPRRE GFGKSGGRNS FNDDFDKSAG  780
GSRGNNKGFR GNDFDGQRRS NRGADAHNFK GPRREGFGKQ QRGGVNEYGR DKDRGFDDYR  840
NSRRKRKLSD KLGPEWSKGE LERFYDAYRK YGKDWRKVAA AVRNRSVEMV EALYNMNRAY  900
LSLPEGTASV VGLKAMMTDH YNVMEGSDSE RESNDALGFS RKPQKRKLGK DQLSASKDAF  960
QSHSSASHEG CLSLLKRRRL DGGQPRAVGK RTPRFPVSYA YKKDDRDTYV SPIKKGRRSE  1020
GDNDDEVAHV AALLTEASQR GGSPQISQTP YRRPVHVKSS SVQSSERMHP PRGKARANLR  1080
DPSMDEDWLE GSIGSKGAET GDYARDSLEG VGTVEINWKG KKFYGKKEKA KDIGNHQFDD  1140
GGEACSGTEE GLNVSSRGKD DIEVSNTKGE RFSPQSQRKR SKKLYFGDES SCLDALQTLA  1200
DLSLMMPEST MESGSSVQLK EEGTNLDVED KFSVPEATST SQSRNKNKIP SAKHRVPFAI  1260
SGVEGTNSKK SKLGREPAFD ITAVSESEQQ LQSTTKTWKR KRKSSVSKIS NADAPIDSNL  1320
NEPLKTEAFG EEENKPVTKG KRTNQSSTPS KQWKSTRSLE GSLNSDYRRT GTDLTVTTAQ  1380
APTSNHVNLP TKRISRRKMY IPRTLHPKEK SSEKKLKNQL NIRSSSAQDR ALYLKEKTSC  1440
CLSSHLVRRW CTFEWFYSAL DYPWFAKREF EEYLNHVGLG HIPRLTRVEW GVIRSSLGKP  1500
RRFSEHFLHE EREKLKQYRE SVRKHYAELR TGDREGLPTD LARPLSVGQR VIALHPKTRE  1560
VHDGSVLTVD HDKCRVQFDR PDIGVEFVMD VDCMPLNPLD NMPEALRRQN FAFDKFSLTS  1620
MEANKNGNLN FGGPHLEKAT SPVNTSVKQG KGDSNHTTSQ PKAASADIGR AQAQQTTYSQ  1680
PGMVVAHNQA RDADIRALSE LTRALDKKEA LLMELRNTNN NILENQNSGE CSLKDSEPFK  1740
KHYATVSSAL LNLRQRNTYP ANSLPPWLKQ PANSTVYGGL PSSFDSSISQ ESGSSVAEIV  1800
EVSRSKAHMM VNAAIQAMSS RKGGEDAYVR IREALDSIDN QHLPSDSRLS LNRSQEQVNG  1860
NLGHRNQLIS STSDPNFTSD SPGPKPNTDT EKTEAQVLSD VISACVMAVH MIQTCTERQY  1920
PPAVVAQVLD YAVTSLHPRC PQNVGIYREI QMCMGRIKTQ ILALVPT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1941948RKPQKRKL
2976980KRRRL
313961417RRKMYIPRTLHPKEKSSEKKLK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G21430.20.0MYB_related family protein