PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.2117s0051.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family MYB
Protein Properties Length: 678aa    MW: 77150.1 Da    PI: 8.3417
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.2117s0051.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding22.52.7e-07494534239
                          SSS-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHH CS
      Myb_DNA-binding   2 grWTteEdellvdavkqlGgg...tWktIartmgkgRtlkq 39 
                          ++W++eE + l ++++++++g   +W+ I++++g+gR+  +
  Cagra.2117s0051.1.p 494 KSWSKEEIDMLRKGITKFPKGtsqRWEVISEYIGTGRSVDE 534
                          58**********************************99765 PP

2Myb_DNA-binding27.66.8e-09621664445
                          S-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHHHHHHHH CS
      Myb_DNA-binding   4 WTteEdellvdavkqlGgg...tWktIartmgkgRtlkqcksrwq 45 
                          W++  +  l++a+k ++++   +W++Ia  ++ g+t +qck ++ 
  Cagra.2117s0051.1.p 621 WSAVQERALIQAFKTFPKEtnqRWERIATAVP-GKTMNQCKKKFA 664
                          ********************************.********9986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF465651.31E-1975178IPR001623DnaJ domain
Gene3DG3DSA:1.10.287.1102.1E-1996200IPR001623DnaJ domain
SMARTSM002712.5E-1499174IPR001623DnaJ domain
PfamPF002261.5E-16100179IPR001623DnaJ domain
PROSITE profilePS5007617.512100182IPR001623DnaJ domain
CDDcd062571.33E-12100171IPR001623DnaJ domain
PRINTSPR006252.6E-7105123IPR001623DnaJ domain
PRINTSPR006252.6E-7123138IPR001623DnaJ domain
PRINTSPR006252.6E-7154174IPR001623DnaJ domain
PROSITE patternPS006360159178IPR018253DnaJ domain, conserved site
PROSITE profilePS500906.133488531IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.601.2E-4489535IPR009057Homeodomain-like
SMARTSM007177.2E-4492544IPR001005SANT/Myb domain
SuperFamilySSF466896.8E-7494537IPR009057Homeodomain-like
PfamPF002491.6E-5494535IPR001005SANT/Myb domain
CDDcd001670.00305495535No hitNo description
SMARTSM007172.9E-9617669IPR001005SANT/Myb domain
SuperFamilySSF466892.62E-9618666IPR009057Homeodomain-like
PROSITE profilePS500907.166621667IPR017877Myb-like domain
PfamPF002491.7E-7621665IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.604.9E-6621665IPR009057Homeodomain-like
CDDcd001671.99E-6621667No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 678 aa     Download sequence    Send to blast
MQSWGINSAI KLLTYSNELS GGQALYASSN CHPVKALNRE PAGHAFHAAA LKLRGCAKEA  60
TGKNEDTDKK VPKEKDNEFI PSYDSHNNKG KKKSGKQQHD HYALLGLGNL RYLATEDQIR  120
KSYREAALKH HPDKLATLLL AEETEEAKQA KKDEIESHFK LIQEAYEILM DPLKRRIFDS  180
TDEFDDEVPT DCAPQDFFKV FGPAFKRNAR WSSASHVPDL GDENTPLEEV DRFYSYWYGF  240
KSWREFPEEE EHDIEQAESR EEKRWMEREN AKKSNKARKE EHARIRILVD NAHKKDIRIL  300
KRKEEEKAMK LQMKEAKVMA KKKLEEEAAA AIEEEKRRKE EEAKRAAEAA QLHKRAKEKN  360
KKLLQKERSR LRTLSAPVLS QKLLGISVDH VEDLCMSLNT EQLRKLCDKM KNKEGLALAK  420
VLKNGNSIDD DETESKEEEV QVAVKQNGHI EAKVETNGHV EARVETNGHV EARVETNGLV  480
EARVDTATHQ KKEKSWSKEE IDMLRKGITK FPKGTSQRWE VISEYIGTGR SVDEILKATK  540
TVLLHKPDSA KAFDSFLEKR KPAASIASPL STREELGEPI ILTKPHAEDN STKTETTEKN  600
GKGEENNSEQ DAAAVSDPEG WSAVQERALI QAFKTFPKET NQRWERIATA VPGKTMNQCK  660
KKFAELKDII RTKKPTA*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5dje_A1e-1419629920123Zuotin
5dje_B1e-1419629920123Zuotin
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1320338KKKLEEEAAAAIEEEKRRK
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.2117s0051.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAP0020300.0AP002030.1 Arabidopsis thaliana genomic DNA, chromosome 5, TAC clone:K16F4.
GenBankCP0026880.0CP002688.1 Arabidopsis thaliana chromosome 5 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006287223.10.0dnaJ homolog subfamily C member 2
TrEMBLR0FE250.0R0FE25_9BRAS; Uncharacterized protein
STRINGCagra.2117s0051.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM23612755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06110.20.0DnaJ domain ;Myb-like DNA-binding domain