PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PHT47422.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family MYB
Protein Properties Length: 1031aa    MW: 115314 Da    PI: 8.308
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PHT47422.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding22.33e-07331428147
                      TSSS-HHHHHHHHHHHHHTTTT...................................................-HHHHHHHHTTTS-HHHHHHH CS
  Myb_DNA-binding   1 rgrWTteEdellvdavkqlGgg...................................................tWktIartmgkgRtlkqcksr 43 
                      r+ W++eE e l+++vkq   +                                                   +W+ +a++   gR++ +c+sr
       PHT47422.1 331 RKDWSKEESENLAKGVKQQFQEmllqrsvnlpvnllsdedgcsresgdlddviasirdhkitpetmrsfipevNWDQVASMYLPGRSGGECQSR 424
                      788****************999**********************************************************9888********** PP

                      HHHH CS
  Myb_DNA-binding  44 wqky 47 
                      w+++
       PHT47422.1 425 WLNW 428
                      **98 PP

2Myb_DNA-binding25.82.4e-08438480446
                      S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      W   E++ l+++v+q  + +W  Ia+ +g  Rt+ qc s++q 
       PHT47422.1 438 WDLSEEKNLLQVVQQKRMSNWVDIAASLGVSRTPFQCLSHYQR 480
                      9999*************************************96 PP

3Myb_DNA-binding43.11e-13488532146
                      TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      rg WT eEd +l  av+ +G  +W  +a+ +  gRt+ qc +rw k
       PHT47422.1 488 RGDWTDEEDSRLCAAVETFGESNWQVVASVIE-GRTGTQCSNRWIK 532
                      899*****************************.***********87 PP

4Myb_DNA-binding38.92e-12542600247
                      SSS-HHHHHHHHHHHHHTTTT..............-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGgg..............tWktIartmgkgRtlkqcksrwqky 47 
                      g+W+++Ed++l+ av ++ ++               Wk++a++++ gRt  qc++rw + 
       PHT47422.1 542 GKWSADEDKRLKVAVMLFYPKcwrnigqsvpwrtpVWKKVAQYVP-GRTHVQCRERWVNT 600
                      89*****************99************************.***********987 PP

5Myb_DNA-binding51.52.3e-16609651347
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                       WT+eEd++l+ a++++G   W+++a++++  Rt++qc+ rw  +
       PHT47422.1 609 EWTEEEDLKLKSAINEHGYS-WSKVAACVP-SRTDNQCRRRWMVL 651
                      7*****************99.*********.***********865 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1031 aa     Download sequence    
MAFDSDDDDF SGTDSDEGFQ ADMEALKKAC LLSGKDADDL QPSSSSDDGH VAGANDVTLS  60
DTDADEDEDD DFELVRSIQE RFALSTEVHD PISMKPLCSI LPPGSEGDED DDFETLRVIQ  120
RRFTAYGDDN GNGRGGSLFD KFEQVGVTNI TSEKETSNNL FVETTNAEEG FPACVDRTIQ  180
ISEECSNDVA RSKNLTDGHD SGAETTAISV NSSRFPKSAH AFVDAIKKNR ACQKLIREKM  240
MQTEARLEEL KKLKERVKIL KSFQLTCKKK MGRALSQKRD ARVQLISLPK QRFSAKLPGK  300
KLSAIHSGPP ENSHVASFKE ALTQFAVSLS RKDWSKEESE NLAKGVKQQF QEMLLQRSVN  360
LPVNLLSDED GCSRESGDLD DVIASIRDHK ITPETMRSFI PEVNWDQVAS MYLPGRSGGE  420
CQSRWLNWED PLIKHEGWDL SEEKNLLQVV QQKRMSNWVD IAASLGVSRT PFQCLSHYQR  480
SLNASIIRGD WTDEEDSRLC AAVETFGESN WQVVASVIEG RTGTQCSNRW IKSLHPARRR  540
CGKWSADEDK RLKVAVMLFY PKCWRNIGQS VPWRTPVWKK VAQYVPGRTH VQCRERWVNT  600
LDPSLKLDEW TEEEDLKLKS AINEHGYSWS KVAACVPSRT DNQCRRRWMV LFPDEVPMLK  660
EAKKKRREAF ISNFVDREEE RPALKPNDIV LAHKLSKAGC ETTTANKKRK RRPRTAKDDK  720
TPKCDAVREM EKQHSEGSEG PESSDLINNQ ESDKILDGEE AVEQGCNDAR KNKRRSRRHP  780
RNTKKVKPND KVPEASASSA GESTVADDNI CKRRRRTSSL VKKKSRTIAS ASITVDSTIA  840
DGNSCKRRQR TSSQVKKKSS DKNARQLETS AVPGTLGGGT SGASVRHKLN KCNRLKNDDR  900
SCVGNLPETA DDCMTLASFV RKSRAKGCSL SFTKVARLHP DKAQGKAMAG DHSSRSCISG  960
GHDETEKRTS QECTSSNQIS GTEVGDDMPL SLFMGKVKKE KIEVGDDMPL AFFLGKVKRG  1020
QPSGAQDRKR K
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1645667RRRWMVLFPDEVPMLKEAKKKRR
2663667KKKRR
3707712KKRKRR
4708713KRKRRP
5770778RKNKRRSRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.11e-163MYB family protein