PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA03g33040
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family MYB
Protein Properties Length: 724aa    MW: 81987.3 Da    PI: 6.306
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA03g33040genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding24.37.5e-08331421147
                      TSSS-HHHHHHHHHHHHHTTTT............................................-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding   1 rgrWTteEdellvdavkqlGgg............................................tWktIartmgkgRtlkqcksrwqky 47 
                      r+ W++eE e l+++vkq   +                                            +W+ +a++   gR++ +c+srw+++
       CA03g33040 331 RKDWSKEESENLAKGVKQQFQEmllqrsvnlpvnllresgdlddviasirdhkitpetmrsfipevNWDQVASMYLPGRSGGECQSRWLNW 421
                      788****************999***************************************************9888************98 PP

2Myb_DNA-binding26.41.6e-08431473446
                      S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      W   E++ l+++v+q  + +W  Ia+ +g  Rt+ qc s++q 
       CA03g33040 431 WDLSEEKNLLQVVQQKRMSNWVDIAASLGVSRTPFQCLSHYQR 473
                      9999*************************************96 PP

3Myb_DNA-binding43.76.5e-14481525146
                      TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      rg WT eEd +l  av+ +G  +W  +a+ +  gRt+ qc +rw k
       CA03g33040 481 RGDWTDEEDSRLCAAVETFGESNWQVVASVIE-GRTGTQCSNRWIK 525
                      899*****************************.***********87 PP

4Myb_DNA-binding39.51.3e-12535593247
                      SSS-HHHHHHHHHHHHHTTTT..............-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGgg..............tWktIartmgkgRtlkqcksrwqky 47 
                      g+W+++Ed++l+ av ++ ++               Wk++a++++ gRt  qc++rw + 
       CA03g33040 535 GKWSADEDKRLKVAVMLFYPKcwrnigqsvpwrtpVWKKVAQYVP-GRTHVQCRERWVNT 593
                      89*****************99************************.***********987 PP

5Myb_DNA-binding52.11.5e-16602644347
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                       WT+eEd++l+ a++++G   W+++a++++  Rt++qc+ rw  +
       CA03g33040 602 EWTEEEDLKLKSAINEHGYS-WSKVAACVP-SRTDNQCRRRWMVL 644
                      7*****************99.*********.***********875 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.12326422IPR017877Myb-like domain
SMARTSM007171.6E-5330424IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.607.9E-11333349IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.607.9E-11395436IPR009057Homeodomain-like
SuperFamilySSF466898.99E-14404475IPR009057Homeodomain-like
PROSITE profilePS512948.15423475IPR017930Myb domain
SMARTSM007173.4E-6427477IPR001005SANT/Myb domain
PfamPF002492.9E-7431473IPR001005SANT/Myb domain
CDDcd001672.65E-4431475No hitNo description
Gene3DG3DSA:1.10.10.607.6E-12437483IPR009057Homeodomain-like
SuperFamilySSF466891.38E-18461530IPR009057Homeodomain-like
PROSITE profilePS5129419.54476531IPR017930Myb domain
SMARTSM007174.6E-13480529IPR001005SANT/Myb domain
PfamPF002492.2E-12481525IPR001005SANT/Myb domain
CDDcd001671.03E-9484527No hitNo description
Gene3DG3DSA:1.10.10.603.2E-16484530IPR009057Homeodomain-like
PROSITE profilePS500908.862529594IPR017877Myb-like domain
SuperFamilySSF466896.86E-5530569IPR009057Homeodomain-like
SMARTSM007171.6E-9533596IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.4E-13535584IPR009057Homeodomain-like
PfamPF002494.2E-11535593IPR001005SANT/Myb domain
CDDcd001671.48E-7536594No hitNo description
SuperFamilySSF466891.12E-25574646IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.3E-21585641IPR009057Homeodomain-like
PROSITE profilePS5129421.985595649IPR017930Myb domain
SMARTSM007173.8E-15599647IPR001005SANT/Myb domain
CDDcd001673.44E-12602645No hitNo description
PfamPF002491.6E-15602643IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 724 aa     Download sequence    Send to blast
MAFDSDDDDF SGTDSDEGFQ ADMEALKKAC LLSGKDADDL QPSSSYADGH VAGANDVTLS  60
DTDADEDEDD DFELVRSIQE RFALSTEVHD PISMKPLCSI LPPGSEGDED DDFETLRVIQ  120
RRFAAYGDDN GNGREESLFD KFEQVGVTNI TSEKETSNNL FVETTNAEEG FPACVDRTIQ  180
ISEECSNDVA RSKNLTDGHD SGAETTAISV NSSRFPKSAH AFVDAIKKNR ACQKLIREKM  240
MQTEARLEEL KKLKERVKIL KSFQLTCKKK MGRALSQKRD ARVQLISLPK QRFSAKLPGK  300
KLSAIHSGPP ENSHVASFKE ALTQFAVSLS RKDWSKEESE NLAKGVKQQF QEMLLQRSVN  360
LPVNLLRESG DLDDVIASIR DHKITPETMR SFIPEVNWDQ VASMYLPGRS GGECQSRWLN  420
WEDPLIKHEG WDLSEEKNLL QVVQQKRMSN WVDIAASLGV SRTPFQCLSH YQRSLNASII  480
RGDWTDEEDS RLCAAVETFG ESNWQVVASV IEGRTGTQCS NRWIKSLHPA RRRCGKWSAD  540
EDKRLKVAVM LFYPKCWRNI GQSVPWRTPV WKKVAQYVPG RTHVQCRERW VNTLDPSLKL  600
DEWTEEEDLK LKSAINEHGY SWSKVAACVP SRTDNQCRRR WMVLFPDEVP MLKEAKKIRR  660
EAFISNFVDR EEERPALKPK DIVLAHKLSK AGCETTSANK KRKRRYAFHN CAPDSFATCV  720
VLI*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C1e-264006419155MYB PROTO-ONCOGENE PROTEIN
1h89_C1e-264006419155MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1699704KKRKRR
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754421e-167HG975442.1 Solanum pennellii chromosome ch03, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016563173.10.0PREDICTED: uncharacterized protein LOC107862199
TrEMBLA0A1U8G3B30.0A0A1U8G3B3_CAPAN; uncharacterized protein LOC107862199
STRINGXP_009770188.10.0(Nicotiana sylvestris)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA108762124
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.11e-170myb domain protein 4r1