PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sp_139300_cpzh.t1
Common NameSOVF_139300
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Caryophyllales; Chenopodiaceae; Chenopodioideae; Anserineae; Spinacia
Family MYB
Protein Properties Length: 1600aa    MW: 173639 Da    PI: 6.4467
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sp_139300_cpzh.t1genomeTBVRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.53.6e-09794835346
                        SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
    Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                        +WT eE e++ d  + +G++ +++Ia+ +  ++t  +c+++++k
  Sp_139300_cpzh.t1 794 PWTFEEKEIFMDKLATHGKD-FRKIASFLD-HKTTADCVEFYYK 835
                        8*****************99.*********.***********98 PP

2Myb_DNA-binding25.72.7e-0810061046345
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
    Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                          WT +E  l+v+a+  +G++ +  I+r++  ++++ qck ++ 
  Sp_139300_cpzh.t1 1006 DWTDDEKSLFVQAFSSHGND-FLMISRCVR-TKSRDQCKVFFS 1046
                         5*******************.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.02E-14778840IPR009057Homeodomain-like
PROSITE profilePS5129315.24790841IPR017884SANT domain
SMARTSM007177.9E-9791839IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.7E-5792835IPR009057Homeodomain-like
PfamPF002491.2E-6794835IPR001005SANT/Myb domain
PROSITE profilePS5129311.06510021053IPR017884SANT domain
SMARTSM007173.0E-610031051IPR001005SANT/Myb domain
SuperFamilySSF466891.87E-910041053IPR009057Homeodomain-like
PfamPF002498.6E-610061046IPR001005SANT/Myb domain
CDDcd001672.88E-510071045No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1600 aa     Download sequence    Send to blast
MPPEPLPWDR KELFRDRKHS HEFGGGPPPS RWRDTSSSSS YGGSRNNNNN HHHHNNNDFS  60
PRWGFPSSEF RRPPGHGKQG GWHIYPEDHG YSTPRSGDRF MDDSNMFRQS GMRGDGRYGR  120
FYREGRPFGQ RDWRGHSWEA NHHHSGPANG VGRPHSVNDQ RSVGDSPVRA SHPHSDPRDQ  180
FHTKDQIDKM SDANGSGMGQ RVDRDSLVGS LDWKPLKWSR SGSLTSRGSG FSHSSSSKSI  240
GGESSDVKGD LPPKNVSPVQ SPSGDAVACG PSVTPEETSS RKKPRLGWGE GLAKYEKKKV  300
DGPEDNVNND DNAEPSHSNP SNLIVKSPKI TGFSDCSSPA TPSSFACSSS PGLEDRTYAK  360
TVNAGGDTSN FSVTSMPIPE SHLEESHFRL EKLELKSISN LGSLLTELLQ ADDQCSMDSG  420
FMRSSAFNKL LLWKGEISKT LELTETEIDS LENELTTLKS GNARSFHCPA SSNSGPADYR  480
DGPIDGQTVF PRPHPLEVVS NGDVILEDGL PCDNAGGVQA ESKDEDIDSP GTATSKFVEP  540
PIQKPVSCDV CKNPSYGETG FVESTQSTAD VGTSCQGHDR SLVQLNNSTA IGIEHPGVCD  600
YKEGNICDTI LASNKEFANR AAEGILKLLP NCDYSIEANR VSCMSADSTI KEKFLRRKRA  660
LRFKERVISL KYRAFHHLWK EDLRLLSLRS HRTKPLKKPD LSSRTLQIGS QKHRSSIRAR  720
FTSPAGSLSL VPTTEIINFT SKLLSDSNAK VYRSGLKMPA MILDEKEKMS KFVSSNGLIE  780
DPCAVEEERT MINPWTFEEK EIFMDKLATH GKDFRKIASF LDHKTTADCV EFYYKNHKSE  840
SFEKIKKLEV KKIGKPLSAD TYLVTSGKKW SREMNAASLD ILGAASVIAA EADQALETSK  900
LLSGKSSKTK SRGPAVIIEA SNSFYGAEDE RETAAADVLA GICGSLSSEA MSSCITSSVD  960
PGEGNQDWKH QKVGSSIRRP LTPEVTQSVD DGTCSDESCG EMDPTDWTDD EKSLFVQAFS  1020
SHGNDFLMIS RCVRTKSRDQ CKVFFSKARK CLGLDTLPTG SGSRGTHASN DTNEGGSDTE  1080
DAGIVETGSI VCSDKSGSRI DEDLLHVNQD VSKPEVTVAE SNILEESCGP GQLDHKDAGL  1140
ELAGVIDPIA RGRVVDVLVK DSATVVNTQR SDVQLMIQDD KVEVNPAIYR SGAEVAPHDS  1200
GAAKDVESSG ISSAPAEFAS PGDGLLKPST AGDNVHHLQS GSSVMNRGGE LCRDLEGFPL  1260
DLSIKADLHD VITHTVGAES IEHSFQDSFL RKCSRSISHN SVAELPLLEQ SKDEFGSSCS  1320
SKADKPCRNG NVKLFGKILS KSSSLDKPTC NTVLNDEKAK HHSSSGGKFD VKLAVNQDLK  1380
GNSTCFAEDH NSFMGLENVP IRSYGFWDGT KIQTGFPTLP DSALLLAKYP SAFSNYTISS  1440
PKPEKQTVAA VAVSSGNECN MNGACVFSPR EFSSSNGGSV IDYNNGSRKQ EGLQHPFRLE  1500
MKQRSKDVVV FPEVQRNGFE ALQQQGNGIV KLNVVQGGDN IIGGSCSGIS DPVAALKLHY  1560
ATQQYGNGKN GNGNGNGHAV NGIVREEPWR SQGGGGDVGR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-18758843994NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-18758843994NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021864415.10.0uncharacterized protein LOC110803228 isoform X1
TrEMBLA0A0K9QWA80.0A0A0K9QWA8_SPIOL; Uncharacterized protein
STRINGXP_010669645.10.0(Beta vulgaris)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-146MYB family protein