PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID NNU_012420-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; stem eudicotyledons; Proteales; Nelumbonaceae; Nelumbo
Family MYB
Protein Properties Length: 1747aa    MW: 191933 Da    PI: 6.0576
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
NNU_012420-RAgenomeCASView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding254.5e-08765806346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT  E e++ +    +G++ +k+Ia+ +  ++t  +c+++++k
    NNU_012420-RA 765 PWTSKEKEIFMEMLSTFGKD-FKRIASFLD-HKTTADCIEFYYK 806
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding331.4e-109831023345
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                        WT eE  ++++a +++G++ +++I+r++  +R+  qc+ ++ 
    NNU_012420-RA  983 DWTDEEKSIFIQALRLYGKD-FSKISRYVS-TRSKDQCRIFFS 1023
                       7*****************99.*********.********8775 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.97E-14749811IPR009057Homeodomain-like
PROSITE profilePS5129314.482761812IPR017884SANT domain
SMARTSM007177.9E-8762810IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.4E-5765809IPR009057Homeodomain-like
PfamPF002498.6E-6765806IPR001005SANT/Myb domain
PROSITE profilePS5129314.6479791030IPR017884SANT domain
SMARTSM007172.5E-99801028IPR001005SANT/Myb domain
PfamPF002493.4E-99821022IPR001005SANT/Myb domain
SuperFamilySSF466896.55E-119831030IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.6E-69831024IPR009057Homeodomain-like
CDDcd001676.72E-89841022No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1747 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKEKKYE RSDALGSVSR WRDSHHGSRE FARWGSDEFR RPPGHGKQGA  60
YQLLSEESGH GCTPSRSSDR MVEDDFCRQS VSRAEGKYSR NSRENKGSVK GHLWDTGDAS  120
VSSFGRQHDI SAQRSVDDLL TYASHPHSDI ENSSLDQLHL KDHHDKMDSV HGLATGHRYN  180
KDHSLGSMAW KPLKWTRSSS LSSRSSGFSH SSSSKSIRAN LDDSKPELQP RKTTPVQSSS  240
GDAAEGVTTL TPFEDTYSKK KQRLGWGQGL AKYEKEKVEG PEETTGRIGL IACSNSPRTS  300
SGPVPSLADK SPRITGLSEC TSPATPSSVA CSSSPGMDDK HYNKVLNIEN DACNLGGSPS  360
HACQNCVEGF SVVLENLEPN KLDDLNSKFA DLLQAEDASS GDSSFMKSAA LNKLMLLKSD  420
VLKALEKTEC EIDLYESELK SLCSEPKKAG SSLTMSKFLQ GALEPCEEAD VASKEFVRPS  480
PLQLVSSDDM LVEVPLLCDG RLDAVNAETK DEDIYSPGTA SSKSVEPVSS MSQISVSDMV  540
KHDECSMQCE AIRPLADVPH YDDAMPLSDA ESVLHSSIMA FNRESARKAY EVFNNLLPSD  600
RHPTFSVGCS NLSSEHNNLI KEKLAMKKRL LKFKERVLTL KLRAFQHLWK EDLRLLSIRK  660
HRAKSQKRFE VSSRTSHSGS QKHRSSIRSR FTSPAGNLTL VPTTEIVDFA GKLLLDSQIK  720
ICRSSLRMPA LVVDEKEKRL LRFVTSNGLV EDPCAVEKER ALINPWTSKE KEIFMEMLST  780
FGKDFKRIAS FLDHKTTADC IEFYYKNHKS ESFGKIKKKL EFSNQGTNIP SSMYLVTSGK  840
KWNREVNAAS LDLLGAASVI AASADISSRV PQYCGGKLFL GYDHDMPRHD DCILEGSSSI  900
DIIGNEKEAA AADVLAGICG ALSSEAMSSC VTSSVDPGDG SQEWKCQKVS STKGRPLTPE  960
VSHTIDDDET CSDESCEEMD SMDWTDEEKS IFIQALRLYG KDFSKISRYV STRSKDQCRI  1020
FFSKARKCLG LDLLYSGPGN EEVPVSCTNG GRSDTEDACV VEMESAICST QSCSRMEVDL  1080
QASVTNINSE VSGHAEPTHL QTDHDRSSEK HVTEHLDQED SEIKVENVVP DDCWALKEPV  1140
SILGSGNNSA DPDVKIDATP EVVSSEDAAR VDAALSAEPS VLLSGTVAFI GDRETGGKVE  1200
IHQTVIFKEE SPSVGGQKEL KQSKLNAAVE LPVQCGSSEE PKIDSEERQH WSEKGLNDRQ  1260
EASSGAEPIS SASTSCCLIP DSSVKENCLP VTATDKRVKE DLISPATYQH QISLELLTSM  1320
QKPQAISWQQ KENCPVSVGL DLPDSSVHYE KSRRGASSSA LDLEVHDDKQ QQKSATTDIY  1380
QQYMLSHNSL NRVDPVQILR GYPLQVLNKK EINGNAETKS SEKSAIVQNF SKMDRNSHCN  1440
QYLVQDLYNE KCTSSRFPHS VAELPLLPKS LEQSSIDHTR SHSLNGSETE EQSRRTGDVK  1500
LFGQILSHPS VPKPNPTSPE NNEKGTSCKP SSNSLNFKFA PNHGIDGNAV TLKLDPNNHS  1560
GLEDIPTRSY GFWDGNRIQT GLSSLPDSAI LLSKYPAAFI DYATSSCRME KQPLPAVAKR  1620
NDRNMGCVSV FPTKDVNGTG GLTDYQVYRS YDGMKLQPFT VDVQRHDILT ELQKRNGLDG  1680
LSSFQHQGRG AVGMNVVGGG ILVGGSCTGV SDPVAAIKMH YATSERYGGQ SGSTRDDKSW  1740
HGGDIGR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C4e-16728813993NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D4e-16728813993NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010264747.10.0PREDICTED: uncharacterized protein LOC104602664 isoform X1
TrEMBLA0A1U8APE50.0A0A1U8APE5_NELNU; uncharacterized protein LOC104602664 isoform X1
STRINGXP_010264747.10.0(Nelumbo nucifera)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-147MYB family protein