PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Lj5g3v1839400.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Loteae; Lotus
Family MYB
Protein Properties Length: 1406aa    MW: 152272 Da    PI: 6.4067
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Lj5g3v1839400.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.24.5e-09797838346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT eE e++++ ++ lG++ +++Ia+ +  ++t  +c+++++k
  Lj5g3v1839400.1 797 PWTSEEREIFLEKFAALGKD-FRKIASFLD-HKTTADCVEFYYK 838
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding31.73.5e-1010131054346
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                        WT +E   l++av  +G++ +++Iar++g +++ +qck ++ k
  Lj5g3v1839400.1 1013 DWTDDEKAALLQAVSSFGKD-FAKIARCVG-TKSQEQCKVFFTK 1054
                       5*****************99.*********.********88766 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.35E-13781841IPR009057Homeodomain-like
PROSITE profilePS5129315.635793844IPR017884SANT domain
SMARTSM007179.9E-9794842IPR001005SANT/Myb domain
PfamPF002491.9E-6796838IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.1E-5797838IPR009057Homeodomain-like
PROSITE profilePS5129314.06710091060IPR017884SANT domain
SuperFamilySSF466892.99E-1010091060IPR009057Homeodomain-like
SMARTSM007174.0E-910101058IPR001005SANT/Myb domain
PfamPF002491.2E-810131054IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.604.5E-710131054IPR009057Homeodomain-like
CDDcd001677.51E-810141054No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1406 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RSESVGSVAR WRDSSHHRDF HRWGSAEFRR PPGHGKQGGW  60
HLFSEESGHG YAGSRSGDKM VEEDSRPSIS RGDGKYGRSG SRENRGSSGP RDWRGHSWEN  120
SNGSQNFSRR PPDVNNDQRS VDDTLAHSSH PHSDFANNSW DQHHLKDQND KMGGFNGTGT  180
GPRCDRENSL GAADWKPLKW TRSGSLSSRA SGFSHTSSSR SMGGADSYEV KAELQPKNAT  240
ASKSHSGEAA TARVTSSAPS EDTTSRKKPR LGWGEGLAKY EKKKVEGPEL SVNKDGPVSS  300
TSNMEPCNVF SPNLVDKSPK LTGFSDCASP ATPSSVACSS SPGVDDKLFS KSANVDSDFS  360
NLTGSPAPGS QNHLQKFSFN LHNLDIYSLN NLGSSIVDLV QSDDPSSVDS GLGRSSAINK  420
LLIWKADISK VLEMTETEID SLENELKSLK SEPGDKCPPA PGSQVVGNNE IFCEDLAGVN  480
KVTRPVPLKI VSSDEPDVEK MPQSNNLHSI HENANEEGID SPGTATSKFV EPLPLIKAVS  540
SCEARGYDNF SADLNARQSA AVKSLVPCTT RRNASVSACG DSNTSMEVKV SLDASSGASL  600
SYSSEDIYNT IISSNKEIAD RAHDAFAKLL PKECCKIDNV GANNGSCTGG LIVEKIAEKK  660
QFARFKERVI ALKFKALHHV WKEDMRLLSM KKCRQKSHKK NELSVRTTYN GNQKNRSSIR  720
SRFTSPGNHL SMVPTSEIIN FTSTLLSESQ VKAQRNALKM PALILDEKEK LISKFISSNG  780
LVEDPLAVEK EKAMINPWTS EEREIFLEKF AALGKDFRKI ASFLDHKTTA DCVEFYYKNH  840
KSDCFVKLKK EGLCKLGKSF SAKTSLVASG KKWNREVNAA SLEILSAASV MADGMSGNKK  900
MRSRSFLLGG YGNVKKSKGE DSFIDRSSSF DIHGDERETA AAADVLAGIC GSLSSEAMSS  960
CITSSVDPVD GNTDRKFLKA NPLCRKPLTP DVTQNVEDET CSDQSSDEMD LSDWTDDEKA  1020
ALLQAVSSFG KDFAKIARCV GTKSQEQCKV FFTKARKCLR LDLMHPIPGN GRSLGNDDAN  1080
GGGSDTDDAC VVETGSAVGT DKSGTKTDED LPSMNAFHDE SNPVEPRNLS AELNESKENN  1140
LTAVDLEDIN LISDACAIKV ESKLGSDDDS EVVLASPDKS ASVSERAAMI MSGSIEGGKE  1200
RANKLGGAEL ISAPEVVELR ECNSVPADRL TSEVSLGGLG NKLKTQRVLS PHCFDDRDDK  1260
HEANTGVVEL KSSVQDSSTS LNVSLSSVGS SYSGLCFDSE NKHVFVGKPP ISALSLKEFH  1320
PTANSLLQNA AADVQSEKAA SQVQLSSTSD IQGIRDMHCH NPISNGDHQL PLPGNHLMAS  1380
ILQGYPLQVP IKKEVNVNMN CSGSED
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C5e-17753845293NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D5e-17753845293NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1690696KKCRQKS
Cis-element ? help Back to Top
SourceLink
PlantRegMapLj5g3v1839400.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027367113.10.0uncharacterized protein LOC113873262 isoform X2
TrEMBLA0A371EYZ90.0A0A371EYZ9_MUCPR; Nuclear receptor corepressor 1 (Fragment)
STRINGGLYMA20G31871.10.0(Glycine max)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-161MYB family protein