PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Dusal.0370s00002.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Dunaliellaceae; Dunaliella
Family MYB_related
Protein Properties Length: 2721aa    MW: 282864 Da    PI: 9.0333
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Dusal.0370s00002.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.81.1e-0714341477347
                            SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
       Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                             W++ E + + d++ q++++ ++tIa+ ++ +Rt+ +c+ +++k+
  Dusal.0370s00002.1.p 1434 QWSEAERKVFMDLFLQHPKE-FRTIASALPGNRTAGDCVAFFYKH 1477
                            6*******************.**********************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.25E-914261480IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.608.7E-514291479IPR009057Homeodomain-like
PROSITE profilePS5129310.8114301482IPR017884SANT domain
SMARTSM007171.1E-514311480IPR001005SANT/Myb domain
CDDcd001670.0084914351477No hitNo description
PROSITE profilePS512935.35617431791IPR017884SANT domain
SMARTSM007171217441789IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2721 aa     Download sequence    Send to blast
MGDRRFPLRG GFHGPGPMFG PPGMRGGGGG GRGGPGWRHE PDFGPPVGLG RFGQPPPFLP  60
FPERERARAF SPPGLGRYGP PSNGPPAGFR RPDDGFGGRR FIGGRDDHLP HHRDYGPPFE  120
PRMGRGGSRG PGGGGGGWMP GAGGGSTYQE RSRSAMLMRP RSREPSPPYP VASDLLHPLA  180
RPQLPMSASR VGGSSFSDRE RLRELERARD RSRDRELNPP SSISHSSRSR SSSHHVSRAH  240
SMDGHHHHHH HHSSRVHSPS RSGDGGNTTA EAGELLPPPL VSRKPSGSHK AAAIASTGSP  300
PPSLSSTPRE CTSAWDLRPP PPASAAAKEP PLQPHVRSTN SGSKHGMPPP PSTTLPPDAP  360
HAPSASKPGA PTALSRSTSV PSQNGPSSSS ADPSSTLPLP PHMLGIGGRR EQSTARRPPS  420
PPLPPPPPPV PLPPAPSNRS SSRGQLHGSS CSSAPHKTPP SEPTSPRRSH STRQHQPPLP  480
SHIPPQAGPT SESVAAGAVS HLPLPSSSAR AQAPCLPRPV EPPDAGLPNM VWDIRTLADA  540
PDALGAAANA QQQQQQQEPL QHHAWEKAEI GFGSNKGHPH EEGPLHGPPT AESEKLVTEG  600
ALACAPQPPS AAPPACLPPA MRAASGVMDH TLPPPPPPPP SHLQHRPSTT ARAESGNAER  660
PLPSASPWLG APANTLLLKP QRHLSSSMDP LSDSAAAAQR SSPAGLSRSG GQGAWEHANA  720
SSSMLGLPGS LPWQHSLSQL PPQQQQQQQQ QQQQQVPLQP QPQPTPAPQQ NRRFGFGISR  780
RGSSMAKVGD SLGATGEGSN GSNTAAAAAA ANTQQRPAVQ SKQQQQQQQQ QHHHHHHQQQ  840
QQHQHQQQQH SGTAAWAAEQ QKGEGVLEKA SDQAANQPGA AHQQAPHTQG AATTRASTPK  900
HTSGADAARA QGGELPQVKE EWQGVEGKGV ATASPSVGGM KPEGKCAVAK KAVQELQHKM  960
QQQQEELEEK QRQQRQKEEE DRQQQVRKEE EEKERQLQRQ RHREVQQQEE KRREHLEQQQ  1020
QLLQEEHARQ QQQQQQRHQA ALELSARIEL LEAEISGLDT DVSSMQHQRT CMGSRMEELE  1080
RIAAASNAQQ FAAQQLSLEQ GPGGRGSSSS SSSSSGDASG SEEEDEEGHV SSGGSSSSAS  1140
SSDGSVSEQE AAAPEPVFHE GRGRGRGRGR GRGGRGRSVA APVRAEKRRS RAHKVLLPAK  1200
VEAASYMQLS SSQTESRLVE SSRETARASR EKVLALLPDD MAEQVRTSLE SGKPFTPLYT  1260
QPQSIPSWEA SEAAHEAQAP ALGAHLSMRA AAVRAKQSAL AALYTQHVAT LKRTLAESKA  1320
GANAGSEDDS QPQPPSTTPG GPAGLPQTPT NITRSSSRLC SAGSLAGMFG VARSDFDEQR  1380
ILSNLQAVEL LKEVTVMPDM ELDPWQARWQ AFDNRNGLNP DPLAELEARQ KSRQWSEAER  1440
KVFMDLFLQH PKEFRTIASA LPGNRTAGDC VAFFYKHQKL DEWANVRRKQ QLKKRKMNCD  1500
AKRHMVSPLV LAPIAKARGM QQQQHQQQQQ QQQQQQGTYA CNPPLDANHH SAFNSPLTRP  1560
ATAPPTAFTD FTPAPPPPPP SNSYPHPHST TLHHNHTPPH ASALANGVLH AAFPPPPTST  1620
RGRGRGRGRT RAPPQVAPPD HGPPLPPQPP VQGGGTAAHH AGSSTVPSTV QAGLGHLQGM  1680
PAAGAGAGGG GGVGGGAGGV VGGLTGLLMG EGDGNSQPAA AYVNSLPPRS TPDTAGALGR  1740
SGNSNGGWTE GEYVECVRAH GRDFRLISQQ LNMRTESTAK HFYYKNRHRL GLEQVLRDRE  1800
EGLPGAAKQA SRGGNALASP PAGSCGAGVQ LMRTLAQIQC VWYVASAQGL PHNSLRHSSL  1860
LSSLHGTLDD TSAALLLSLH PPQSSPSPAL HPQLFVQGLG LHTPQQLPNA HLPSLPHSHH  1920
PHHLLQQQQQ APAAAAAAAA LGSAAATPWM LPHLLLPGAA FMEQPQLQQQ QQQQPTSFQG  1980
LLGHLSAPPG AVLHPPHSQS QGPGGPGSSS SDCLASPPGI PLHQNSQGLS QERGSSHIRQ  2040
GPVVGLGEGI GGWGGAAGSS VGNQVPGLGV GEGVGGSAGA GSWGSKPGGE GGNAGGKGSS  2100
SAASQSQGQS SCNGPAMGIP TVLDLPLLDA GSIARHAAAA AASSGGPRKP LSSGVMPPLF  2160
QPTSSSPGQP TGSQPTPSAQ QPPQAAGAAS PSPQPHVLEA VLHQQACLAA LAPPTLLPHG  2220
LFGHHTNAPT NLLQPPQRKL PSPSVPPAHH ASTPLMNSLL ELSMLSPTLP SHAHIHPPPL  2280
FAASSSNRPL SHSGDSTAPP LTSAHKTGGL ECGGTALGVP RPPTATTLPS APDPAAAAAA  2340
AAAAAAAAAA FADSSGGHVA PDKMSCMGEG GSGNASVSGS AMLPTPPPSA ASGTAAAGGE  2400
ESRAKVRARS REEDLGGTDA GQEDGRPLKR QALSLPSTTL ADQTHLSPPT QPSLTQLLLG  2460
PQLGGSSSAP HLKAPPAPSA VPAPVACDTG MPSISNTPLP LDVASLPGLL HASLAPESQQ  2520
HQLQQLRDLL KGQRGGDETP CAGVDLLLKG QCSGDGTLRA VVDPLLKGQR DDVVPCAGKD  2580
LLLNAVPPSA SAPGGILLPA APVPHHDSQH LLTGPPRPES SNFPSGHQGP LAGAGGVPGS  2640
HPLFTADHPL LATGHPLIGS HQLASCVPGH QAGRDVPSAP LSMGSQALSV LDALECSSAT  2700
PESMAAAVLK DLHAGRQSAN *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111631172RGRGRGRGRG
216201630RGRGRGRGRTR
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP28831010
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.12e-12MYB family protein