PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID WALNUT_00007314-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Juglandaceae; Juglans
Family MYB
Protein Properties Length: 1691aa    MW: 185401 Da    PI: 5.9453
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
WALNUT_00007314-RAgenomeJHUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding28.34.1e-09792833346
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         +WT+eE e + d  + lG++ +++Ia+ +  ++t  +c+++++k
  WALNUT_00007314-RA 792 PWTPEERETFMDKLATLGKD-FRKIASFLD-HKTTADCVEFYYK 833
                         8*****************99.*********.***********98 PP

2Myb_DNA-binding29.41.8e-0910061046345
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
     Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                           WT eE   +++av  +G++ +  I+r++  +R++ qck ++ 
  WALNUT_00007314-RA 1006 EWTDEEKSMFIQAVSSYGKD-FVMISRCVR-TRSRDQCKVFFS 1046
                          7*****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.27E-13776836IPR009057Homeodomain-like
PROSITE profilePS5129314.931788839IPR017884SANT domain
SMARTSM007173.3E-8789837IPR001005SANT/Myb domain
PfamPF002499.0E-7791833IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.607.6E-6792840IPR009057Homeodomain-like
PROSITE profilePS5129310.66510021053IPR017884SANT domain
SMARTSM007171.5E-810031051IPR001005SANT/Myb domain
SuperFamilySSF466898.58E-1110041053IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.606.6E-610051047IPR009057Homeodomain-like
PfamPF002496.5E-710061046IPR001005SANT/Myb domain
CDDcd001673.76E-710061045No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1691 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHD RSSESPGSIA RWRDSSHHGS REFNRWGSAD CRRPPGHGKQ  60
GGWHLFSEES GHGYVPSRAG DKMLEDDSCR PYVSRGDVKY GRSSRENRFS QRGWKFHSWE  120
TSNVSPNTPA RLLDVSNNDL RSVDDTIPCP SHPSSDFVNT WDQLHLKDQH DKMGGVNGLG  180
TGQRCDRENS LGSTDWKPLK WSRSGSLSSR GSSFSHSSSS KSMGGVDSNE TKTDIQLKNS  240
TPVQSPSGDA AACVTSAAPS DETTSKKKPR LGWGEGLAKY EKKKVEGPDI SMDKDGAVFS  300
TSITEPIHSF ISNMADKSPR VAVFSDCASP ATPSSVACSS SPGVEEKSFG KAVNMDIDVS  360
NICVSPSAGS INHLEGFPLD LEKVDMTLMA NLGSSLIELL QSDDSSSVDS SFVRSTAINK  420
LLICKSEISK LLEVTESEID SLENELKFLK SESESGDPYP AASSSVLAEK TATPCVEQDV  480
ASNLFHRPEP LQIVSSGEAV TEKMPFSNGD LEDVHAAIKD EDIDSPGTAT SKFVEPLSLA  540
KMVPLSDKVE HGDSSGNCNA IQIKSQNEYV KCLVPGSVGE KTVAPVSSEV SLSTDGQYML  600
CDSIVASNRK CANRACGVFD KLLPREQHMT DVSRTVNSSS CQSASSVKEK FAKRKQFLRF  660
KERVITLKFK VFQHLWKEDM RLLSVRKHRP KSQKKFDLSL RTALTGNQKP RSSIRSRFSS  720
PAGNLSLVPT AEMINFTSKL LSDSQVKLCR NALKMPALIL DKREKLLSRF ISSNGLVEDP  780
CAVEKERAMI NPWTPEERET FMDKLATLGK DFRKIASFLD HKTTADCVEF YYKNHKSDCF  840
ERTKEKEAKA FCTNTYLVTS EKKWSREVNA ASLDILGTAS MMAACADDYE RNQHSSAEQV  900
VLGGYGDSKT SWGDDGILER SNHLDIIRDE RETVAADVLA GICGSLSSEA MISCITSSVD  960
PGESYREWKC QKVDSGIKWP SIPDVMHNFD DETCSDESCG EMDPSEWTDE EKSMFIQAVS  1020
SYGKDFVMIS RCVRTRSRDQ CKVFFSKARK CLGLDLIHPG PRNVGTPVTD DANGGGSDAE  1080
DACVVEAVET GSVICGNKLG CKLDEDLPLI TMNKNDDESD PAKIVNFESD RNRSEENNGM  1140
GHMDYEDFEA VETSVSDACQ AENIPELIVH GDSNIMNSVE KHSDSVHTRR STVVLAATET  1200
GGDQVIEQST SILEMASVRE GIKPVSSSPE ALMENKGLAS VGFENELSGQ ELLLPKCSLI  1260
RTHEKCGPSG LQSSVQDSNT IGNCSHPAAE SSCSGLHLNP EYQHKVSLEL DSMEKPYVIS  1320
LPLQNSPPTA TSPSQDTASI LCDKTLNQDR LSSTLDFRGN VPKQSPKSIS RDDFHQNLCS  1380
HSILSHDESS QILGGYPLQI SNKKEMNGDV SSRKLSEVQT LSQSESNVST RSVAQDCYLQ  1440
KCNSSKPHSS VAELPRLSQK IEKTILHSRA HSRSLSDTDK PCRNGDVKLF GQILSHPSST  1500
QKSNSNTHEN EEKGIHNSNL SSKLSNLKFS GYHDVDGNSS LLKEMSSSNG VVDCQLYRNR  1560
EGSKVQPFTV DMKQRQDIFS EIQRRNGFEA VSSLQPQGRG MVGMNVVGRK VIVGGPCTVV  1620
SDPVAAIKMH YAKSDQYGGQ TGSIIGEEES WRGKGDLGRA FVIPLQVLAF GEIPKVNTVQ  1680
VLEIALPVVF N
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C4e-15750841494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D4e-15750841494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_018828683.10.0PREDICTED: uncharacterized protein LOC108997053 isoform X1
TrEMBLA0A2I4FAL90.0A0A2I4FAL9_JUGRE; uncharacterized protein LOC108997053 isoform X1
STRINGEMJ215100.0(Prunus persica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF49863352
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein