PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa19g015550.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family MYB
Protein Properties Length: 1975aa    MW: 225510 Da    PI: 8.5536
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa19g015550.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding22.92e-07588631445
                      S-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding   4 WTteEdellvdavkqlGgg...tWktIartmgkgRtlkqcksrwq 45 
                      W+t  +  lv+a k ++++   +W+++a+ ++ g+t  qck ++ 
   Csa19g015550.1 588 WSTVQERALVQALKTFPKEtsqRWERVAAAVP-GKTMIQCKKKFA 631
                      ********************************.********9986 PP

2Myb_DNA-binding22.92e-0712531296445
                       S-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    4 WTteEdellvdavkqlGgg...tWktIartmgkgRtlkqcksrwq 45  
                       W+t  +  lv+a k ++++   +W+++a+ ++ g+t  qck ++ 
   Csa19g015550.1 1253 WSTVQERALVQALKTFPKEtsqRWERVAAAVP-GKTMIQCKKKFA 1296
                       ********************************.********9986 PP

3Myb_DNA-binding22.92e-0719181961445
                       S-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    4 WTteEdellvdavkqlGgg...tWktIartmgkgRtlkqcksrwq 45  
                       W+t  +  lv+a k ++++   +W+++a+ ++ g+t  qck ++ 
   Csa19g015550.1 1918 WSTVQERALVQALKTFPKEtsqRWERVAAAVP-GKTMIQCKKKFA 1961
                       ********************************.********9986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.287.1101.0E-1995190IPR001623DnaJ domain
SuperFamilySSF465658.64E-2096177IPR001623DnaJ domain
SMARTSM002711.5E-1698173IPR001623DnaJ domain
PROSITE profilePS5007617.99999181IPR001623DnaJ domain
CDDcd062571.18E-1199170IPR001623DnaJ domain
PfamPF002262.9E-1699178IPR001623DnaJ domain
PRINTSPR006258.7E-7104122IPR001623DnaJ domain
PRINTSPR006258.7E-7122137IPR001623DnaJ domain
PRINTSPR006258.7E-7153173IPR001623DnaJ domain
PROSITE patternPS006360158177IPR018253DnaJ domain, conserved site
PROSITE profilePS500906.272458513IPR017877Myb-like domain
SuperFamilySSF466898.58E-7459506IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.8E-4462505IPR009057Homeodomain-like
SMARTSM007175.1E-6462515IPR001005SANT/Myb domain
CDDcd001670.00411465505No hitNo description
PROSITE profilePS512937.972583638IPR017884SANT domain
SMARTSM007172.1E-7584636IPR001005SANT/Myb domain
SuperFamilySSF466895.31E-8585633IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.608.0E-4588632IPR009057Homeodomain-like
CDDcd001676.48E-5588634No hitNo description
SuperFamilySSF465659.16E-20760842IPR001623DnaJ domain
Gene3DG3DSA:1.10.287.1101.0E-19760855IPR001623DnaJ domain
SMARTSM002711.5E-16763838IPR001623DnaJ domain
PROSITE profilePS5007617.999764846IPR001623DnaJ domain
CDDcd062571.18E-11764835IPR001623DnaJ domain
PfamPF002262.9E-16764843IPR001623DnaJ domain
PROSITE patternPS006360823842IPR018253DnaJ domain, conserved site
PROSITE profilePS500906.27211231178IPR017877Myb-like domain
SuperFamilySSF466898.58E-711241171IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.8E-411271170IPR009057Homeodomain-like
SMARTSM007175.1E-611271180IPR001005SANT/Myb domain
CDDcd001670.0041111301170No hitNo description
PROSITE profilePS512937.97212481303IPR017884SANT domain
SMARTSM007172.1E-712491301IPR001005SANT/Myb domain
SuperFamilySSF466895.31E-812501298IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.608.0E-412531297IPR009057Homeodomain-like
CDDcd001676.48E-512531299No hitNo description
SuperFamilySSF465659.16E-2014251507IPR001623DnaJ domain
Gene3DG3DSA:1.10.287.1101.0E-1914251520IPR001623DnaJ domain
SMARTSM002711.5E-1614281503IPR001623DnaJ domain
PROSITE profilePS5007617.99914291511IPR001623DnaJ domain
PfamPF002262.9E-1614291508IPR001623DnaJ domain
CDDcd062571.18E-1114291500IPR001623DnaJ domain
PROSITE patternPS00636014881507IPR018253DnaJ domain, conserved site
PROSITE profilePS500906.27217881843IPR017877Myb-like domain
SuperFamilySSF466898.58E-717891836IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.8E-417921835IPR009057Homeodomain-like
SMARTSM007175.1E-617921845IPR001005SANT/Myb domain
CDDcd001670.0041117951835No hitNo description
PROSITE profilePS512937.97219131968IPR017884SANT domain
SMARTSM007172.1E-719141966IPR001005SANT/Myb domain
SuperFamilySSF466895.46E-819151963IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.608.0E-419181962IPR009057Homeodomain-like
CDDcd001676.48E-519181964No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1975 aa     Download sequence    Send to blast
MPSRRSDSAI KLITYSEELV DGKPFYAFSN CLPVKALNRE PAGHAFHSAA LKLHGCAEEP  60
TDNEDGDKKV GDDKEKEYVP SFNSYANKGK KKSGTQQQDH YALLGLSNLR YLATEDQIRK  120
SYREAALKHH PDKLATLLLL EETEEAKEAK KDEIESRFKA IQEAYEVLMD STRRRIFDST  180
DEFDDEVPSD CLPQDFFKVF GPAFKRNARW SVNQRIPDLG DENTPLKDVD KFYNFWYAFK  240
SWREFPDEEE HDLEQADSRE ERRWMEKENA KKTVKARKEE HARIRTLVDN AYRKDPRIVK  300
RKEEEKAEKQ QKKEAKLLAK KKQEEDAAIA AEEEKKRKEE EEKRAAESAQ QQKKNKEREK  360
KLLRKERNRL RTLSAPLVAQ RLLDISEEDI ENLCMSLNTE QLQNLCDRMG NKEGLELAKV  420
IKDGCNTSRN DEAESKEKES KKTNGGTEPT TRVSQLDSST VKKQPWSKEE IDMLRKGMVK  480
YPKGTSRRWE VVSEYIGTGR SVEEILKATK TVLLQKPDSA KAFDSFLEKR KPSASIASPL  540
STREELGESL PTATTTTKAS PSKETVVANS QSSDNNGEAG GSSDTDGWST VQERALVQAL  600
KTFPKETSQR WERVAAAVPG KTMIQCKKKF AELKEIIRNK KTGVFCYQIS LALPITSGAP  660
VHFYIMPSRR SDSAIKLITY SEELVDGKPF YAFSNCLPVK ALNREPAGHA FHSAALKLHG  720
CAEEPTDNED SDKKVGDDKE KEYVPSFNSY ANKGKKKSGT QQQDHYALLG LSNLRYLATE  780
DQIRKSYREA ALKHHPDKLA TLLLLEETEE AKEAKKDEIE SRFKAIQEAY EVLMDSTRRR  840
IFDSTDEFDD EVPSDCLPQD FFKVFGPAFK RNARWSVNQR IPDLGDENTP LKDVDKFYNF  900
WYAFKSWREF PDEEEHDLEQ ADSREERRWM EKENAKKTVK ARKEEHARIR TLVDNAYRKD  960
PRIVKRKEEE KAEKQQKKEA KLLAKKKQEE DAAIAAEEEK KRKEEEEKRA AESAQQQKKN  1020
KEREKKLLRK ERNRLRTLSA PLVAQRLLDI SEEDIENLCM SLNTEQLQNL CDRMGNKEGL  1080
ELAKVIKDGC NTSRNDEAES KEKESKKTNG GTESTTRVSQ LDSSTVKKQP WSKEEIDMLR  1140
KGMVKYPKGT SRRWEVVSEY IGTGRSVEEI LKATKTVLLQ KPDSAKAFDS FLEKRKPSAS  1200
IASPLSTREE LGESLPTATT TTKASPSKET VVANSQSSDN NGEAGGSSDT DGWSTVQERA  1260
LVQALKTFPK ETSQRWERVA AAVPGKTMIQ CKKKFAELKE IIRNKKTGVF CYQISLALPI  1320
TSGAPVHFYI MPSRRSDSAI KLITYSEELV DGKPFYAFSN CLPVKALNRE PAGHAFHSAA  1380
LKLHGCAEEP TDNEDSDKKV GDDKEKEYVP SFNSYANKGK KKSGTQQQDH YALLGLSNLR  1440
YLATEDQIRK SYREAALKHH PDKLATLLLL EETEEAKEAK KDEIESRFKA IQEAYEVLMD  1500
STRRRIFDST DEFDDEVPSD CLPQDFFKVF GPAFKRNARW SVNQRIPDLG DENTPLKDVD  1560
KFYNFWYAFK SWREFPDEEE HDLEQADSRE ERRWMEKENA KKTVKARKEE HARIRTLVDN  1620
AYRKDPRIVK RKEEEKAEKQ QKKEAKLLAK KKQEEDAAIA AEEEKKRKEE EEKRAAESAQ  1680
QQKKNKEREK KLLRKERNRL RTLSAPLVAQ RLLDISEEDI ENLCMSLNTE QLQNLCDRMG  1740
NKEGLELAKV IKDGCNTSRN DEAESKEKES KKTNGGTEPT TRVSQLDSST VKKQPWSKEE  1800
IDMLRKGMVK YPKGTSRRWE VVSEYIGTGR SVEEILKATK TVLLQKPDSA KAFDSFLEKR  1860
KPSASIASPL STREELGESL PTATTTTKAS PSKETVVANS QSSDNNGEAG GSSDTDGWST  1920
VQERALVQAL KTFPKETSQR WERVAAAVPG KTMIQCKKKF AELKEIIRNK KTGV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5dje_A6e-14195162820123Zuotin
5dje_B6e-14195162820123Zuotin
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116491667KKKQEEDAAIAAEEEKKRK
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa19g015550.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0081530.0AC008153.5 Arabidopsis thaliana chromosome 3 BAC F24K9 genomic sequence, complete sequence.
GenBankCP0026860.0CP002686.1 Arabidopsis thaliana chromosome 3, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010486796.10.0PREDICTED: dnaJ homolog subfamily C member 2-like
TrEMBLA0A178VPF00.0A0A178VPF0_ARATH; Uncharacterized protein
STRINGXP_010486796.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM23612755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06110.20.0DnaJ domain ;Myb-like DNA-binding domain