PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID SMil_00002802-RA_Salv
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Lamiales; Lamiaceae; Nepetoideae; Mentheae; Salvia
Family MYB_related
Protein Properties Length: 1633aa    MW: 178483 Da    PI: 6.6445
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
SMil_00002802-RA_SalvgenomeNDCTCMView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding24.84.9e-0810131053345
                             SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
        Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                              W+ eE   +++av  +G++ +  +++++  +++ +qck ++ 
  SMil_00002802-RA_Salv 1013 DWSDEEKSVFIQAVSSYGKD-FGMVSQCVR-TKSTNQCKVFFS 1053
                             5*****************99.*********.********8876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.39E-13780840IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.5E-4787837IPR009057Homeodomain-like
PROSITE profilePS5129317.31792843IPR017884SANT domain
SMARTSM007173.3E-8793841IPR001005SANT/Myb domain
PROSITE profilePS5129310.00310091060IPR017884SANT domain
SuperFamilySSF466893.12E-1010091060IPR009057Homeodomain-like
SMARTSM007171.3E-610101058IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.6E-410121054IPR009057Homeodomain-like
PfamPF002493.6E-610131053IPR001005SANT/Myb domain
CDDcd001673.12E-510141052No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1633 aa     Download sequence    Send to blast
MPPEPLPWDR RDFRKHDRSA SNARAVAGGL GRGGPPKWRD QQQQQRHHRA PPPYHPQQRW  60
YSDFRSPGQS KQGGWPMYSG DAGQGFMPFG SRCDRNLEDE GCQPFGSRGN GRHFRTDRGS  120
FTLKDWKGPS WEAAAVHNST GETITEVNKL RSIENTQPCH NRSSRSDCAS HAPLGSISLS  180
DQSQTESLSR EKNDNTTDET TGKGEESEKE NCLEPVEWKP LKWSRVGSLS SRSSCTSHPT  240
SSKSPGRDSI EVLLEVKPKN VALDKSLSVD VSCVSSNAPA QSEETGSRKK PRLGWGEGLA  300
KYEKKKVEGP EDGTTKSEPS LSAISTESPL SQSVNLLEKS PRIESLLECA SPATPSSVAC  360
SYSPGVKLIL FVNEFILIYF HETGVLEEKP VQDANLEFDA TNVSCSPSIA SQSHYGGSPI  420
NLQNLELESI ANLSSLINEL LQSNDACSAE NGDTQRMSMN KLLVWKVDVL KALEMTESEI  480
ESLEIELKSS IAKPRICCFH PAGPRSLAGE QQSRPCEVLD TASRYGVGHV LSKDDDIDIP  540
GSATSKFVDV LPAIFPSEKA EFSKGSRNLD VDKSCNLDKK FLKNVFSSSE SHGYVNSHVL  600
IGNTSHKDCN VDNIWGSILS SNRDVASRAL EQLNKLLPEQ CGFDGSVESI ISSLPRISAV  660
VKDRFLKRKQ FIQFKEKVLA LKFKVFQHFW KEGRVVSIRT LQGKTRKKFD PSRNSQKRNR  720
SRVSSYAGGC QTVPADEVIN FVNNLLSQLA FKPYRNTLKM PALILDKHVK MSRFISNNGL  780
VEDPCTVEKE RSIMNPWAAE ETEIFIEKLA AFGKDFMKIA SFLDHKTVAD CIEFYYKNHK  840
SKWFVEARKN SGFIKQRKSQ TTTYLVGSGK RRNREFNAAS LDMLGAASVI AANIGNGMDV  900
RQKCISRSSF GASSSCGGPR AADHLLKGSD STNMENNERE TEAADVLASI CGSLSSETTS  960
SSITSSIDLG DGYQDPSCPR ITLCIKRPLT PEVTQDTDGE CSDESCGQMN PTDWSDEEKS  1020
VFIQAVSSYG KDFGMVSQCV RTKSTNQCKV FFSKARKCLG LDLVQPGPGA ASDDVDGGGS  1080
DIEDDCNMGT YSGIDNNSSE FKMKENSPPP RKNSNHESEI VDTAPDFRIF KGNNGLGPLD  1140
STTNELVLEN SSILGGCGDD KPVTDVQDPQ TAAVASNMES EQEIKEEVPD WSNEVGKGIL  1200
VKASNGHCAE EKQCQVPVLP EDNLSIRDAD SIDVNDTRCG ISWKKFEPQL IGNVSHATVD  1260
AHSSTQTNQK SDVPKEADVG TCTAEKSCVS SLLQNGRLAS VASSTIFSVP IKYKKTSNNT  1320
PLLPVEASGN DGTHLPSYSL SKSTGSSQIL QGYPLSLQTM KGTNGDVSSA SPNAPRRGEN  1380
LNSDWRTDFS LQKCNEARQS NASFRPLERV RDRGTPPSCC SSAGDNLPRN GDVKLFGKIL  1440
SSSQQNPVEQ ADDNSSSPNH RTKSLNLIVS SEQKDSAQSK FDYNSYAPPE KTAVRRFALW  1500
DGSRIQTTLI PPIPDSARLL AKYPSAFSNF ALPSLCEPVD VAPVFQSREM SSSIGVKQLQ  1560
DDTLAEMQRR SASNAASGTS GVDIGGRGRM VFGGQYHSLT DPVAAIKMHY ARMQGGNGVD  1620
GNDRWISSRD VGR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C4e-16755842491NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D4e-16755842491NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1716721KRNRSR
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020548929.10.0uncharacterized protein LOC105161106
TrEMBLA0A4D9BK670.0A0A4D9BK67_SALSN; Uncharacterized protein
STRINGMigut.F01767.1.p0.0(Erythranthe guttata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA59042332
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-107MYB family protein