PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID NNU_023894-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; stem eudicotyledons; Proteales; Nelumbonaceae; Nelumbo
Family MYB
Protein Properties Length: 1420aa    MW: 158413 Da    PI: 8.3643
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
NNU_023894-RAgenomeCASView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding26.31.7e-08442486246
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      ++W+  Ed++l+  v+q G  +W  Iar++g+gRt+ qc  r+q 
    NNU_023894-RA 442 NPWSNNEDKKLLFIVQQSGLYNWIDIARELGTGRTPFQCLARYQR 486
                      69*****************************************96 PP

2Myb_DNA-binding43.48e-14495540248
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                        WT+++d +l  av+ +G ++W +Ia+ +  gRt+ qc +rw+k l
    NNU_023894-RA 495 RDWTEDDDAQLRAAVETFGEDDWQLIASNLE-GRTGTQCSNRWRKTL 540
                      57*****************************.************975 PP

3Myb_DNA-binding50.83.9e-16548591246
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      grWT++Ed++l+ av ++G++tW +Ia+ ++ gRt  qc++rw +
    NNU_023894-RA 548 GRWTADEDKRLKVAVMLFGPKTWMKIAQFVP-GRTQVQCRERWVN 591
                      8******************************.***********87 PP

4Myb_DNA-binding48.81.6e-15600642246
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      g+WT+eEd +l+ a+ q+G   W+++a+ ++  Rt++qc+ rw  
    NNU_023894-RA 600 GPWTEEEDSRLKAAILQHGYC-WSKVAASVP-PRTDNQCRRRWKV 642
                      79*****************99.*********.9**********86 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500905.053337435IPR017877Myb-like domain
SMARTSM007170.028341437IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.607.4E-7343360IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.607.4E-7408448IPR009057Homeodomain-like
SuperFamilySSF466895.47E-15418485IPR009057Homeodomain-like
PROSITE profilePS500909.059436488IPR017877Myb-like domain
SMARTSM007172.8E-5440490IPR001005SANT/Myb domain
PfamPF002491.1E-6442486IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.9E-13449496IPR009057Homeodomain-like
SuperFamilySSF466892.83E-19474543IPR009057Homeodomain-like
PROSITE profilePS5129420.565489544IPR017930Myb domain
SMARTSM007174.5E-15493542IPR001005SANT/Myb domain
PfamPF002491.5E-12495539IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.609.8E-17497543IPR009057Homeodomain-like
CDDcd001671.14E-12497540No hitNo description
Gene3DG3DSA:1.10.10.604.7E-17544593IPR009057Homeodomain-like
SuperFamilySSF466892.11E-26545640IPR009057Homeodomain-like
PROSITE profilePS5129412.369545593IPR017930Myb domain
SMARTSM007176.6E-13546595IPR001005SANT/Myb domain
PfamPF002492.1E-14548591IPR001005SANT/Myb domain
CDDcd001673.09E-11549593No hitNo description
PROSITE profilePS5129424.013594648IPR017930Myb domain
Gene3DG3DSA:1.10.10.602.0E-18594642IPR009057Homeodomain-like
SMARTSM007175.3E-13598646IPR001005SANT/Myb domain
PfamPF002491.3E-13600642IPR001005SANT/Myb domain
CDDcd001676.83E-12601643No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1420 aa     Download sequence    Send to blast
MASPRGDIDD EEEISGTDKD DALDEDMEAL RRACMITGTD LNDIKAASAD AADASDAEGN  60
SEDVDDIELV RSIRERFSIP QVDNPPIFMK PLRSIPPVTS DDEEDDFEIL RAIQMRFSRY  120
QSDAHKNKKN DHLHSSEWGT PNDLSVDICN SEDLGSGMLN SEGTHNVSQP LESFGGTGPG  180
NQPSCSFEWH QPEARKLSPL PLKYSSFPKS GQMFIDTIKK NRSCQKFIRS KLIQIEARIE  240
ENKKLMERVR ILKDFQISCK KRTGRALSQK KDPRVQLISL PKPRSSQNLK TASCHGAILI  300
RYFWCLQVND KKVSALSLGP AENSHVAEYK VVLSMLPHSL NRQPWTNVEK ENIRKGIKQQ  360
FQEMLLQKSM ELYGDLEGSG DSNAFDESIA SITDLEITPE KIRSFLPNVD WERLASMYVL  420
GHSGAECEAR WLNFEDPLIN HNPWSNNEDK KLLFIVQQSG LYNWIDIARE LGTGRTPFQC  480
LARYQRSLNA HIMKRDWTED DDAQLRAAVE TFGEDDWQLI ASNLEGRTGT QCSNRWRKTL  540
HPARQRVGRW TADEDKRLKV AVMLFGPKTW MKIAQFVPGR TQVQCRERWV NSLDPSLNLG  600
PWTEEEDSRL KAAILQHGYC WSKVAASVPP RTDNQCRRRW KVLYPHEVPL AQAARMIQKA  660
ALISNFVDRE AERPALGPHD FLPLPGMDSK SKTMNGNNTQ KEKKNSREKL KPKKKDTTTC  720
DAGKKINSRK SRTKAQVFTE EVLRLANVND VEASMRDDTI SKKQEKVPKR HAKVSKCTKP  780
AEENQGLSSP NDPEFLRIMD GNVQSSKGDN ATLKKKNPRP KRNKHVQPVE DHQLLLLSPD  840
NSSLLRITNG DDVDPSGGNY AISNKEKMPL VFLEGSKSTG PSKDQDIPLS PNHSPVSRMT  900
TDVNIVETLG KESRASKKVP NPCPERKKCI ITNGNNVQNS SGDNRVLRKS RKSKLSSKRE  960
KSIDSAEKHQ ELPLDTENSA DLRVNNGETL DNVNGETIPN KKRKVLNSCR KRNKHIELAG  1020
KAQDLSLPQE QSTFDAIAAA LEKPTAVVVA QQDGSKVSGT ENGLCSQGNL ETMDVDDASL  1080
TSLLHDALKK KVMPKPCPKR KKCTITNGDN RVLKNSRKSK LSSKREKSIE PAEKNEELLL  1140
DPVHSGDLRV SNDETLANVN GDTVPHKKRK VLSSCRKRNK HIELAGKAQD LSLPREQSAF  1200
DVTDVSLEKP TAIDSALQDS SKVSGTEDGV SQENPVTIDT DDVTLVSLLS DALKKRKLKL  1260
VCNGSQAVSF PRRKRNKILN MLSERQHPGH STLNTMDAND VSLHGMAGRE DCCEQGDLGK  1320
STNNDSVLGT AVSNDVLSVF LLSKDLSKRE PAASLQGNDV STSTTLTEEA VSLRNLKNGL  1380
HQKSLITSSS DVLGSECGEK HQHDGNGVVP PVMHDCTTEK
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C4e-324106402155MYB PROTO-ONCOGENE PROTEIN
1h89_C4e-324106402155MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112541274KRKLKLVCNGSQAVSFPRRKR
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010243797.10.0PREDICTED: uncharacterized protein LOC104587774 isoform X1
TrEMBLA0A1U7YTW80.0A0A1U7YTW8_NELNU; uncharacterized protein LOC104587774 isoform X1
STRINGXP_010243797.10.0(Nelumbo nucifera)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.10.0myb domain protein 4r1