PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.1655s0025.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family MYB_related
Protein Properties Length: 647aa    MW: 73722 Da    PI: 8.7071
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.1655s0025.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding24.46.6e-08590633445
                          S-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHHHHHHHH CS
      Myb_DNA-binding   4 WTteEdellvdavkqlGgg...tWktIartmgkgRtlkqcksrwq 45 
                          W+t  +  lv+a k ++++   +W+++a+ ++ g+t  qck ++ 
  Cagra.1655s0025.1.p 590 WSTVQERALVQALKTFPKEtsqRWERVAAAVP-GKTMIQCKKKFA 633
                          ********************************.********9986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.287.1107.8E-2195192IPR001623DnaJ domain
SuperFamilySSF465655.63E-2195177IPR001623DnaJ domain
SMARTSM002712.3E-1798173IPR001623DnaJ domain
PfamPF002262.0E-1799178IPR001623DnaJ domain
CDDcd062572.36E-1399170IPR001623DnaJ domain
PROSITE profilePS5007618.34999181IPR001623DnaJ domain
PRINTSPR006251.4E-8104122IPR001623DnaJ domain
PRINTSPR006251.4E-8122137IPR001623DnaJ domain
PRINTSPR006251.4E-8153173IPR001623DnaJ domain
PROSITE patternPS006360158177IPR018253DnaJ domain, conserved site
PROSITE profilePS500906.156458513IPR017877Myb-like domain
SuperFamilySSF466893.14E-7460506IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.0E-4462505IPR009057Homeodomain-like
SMARTSM007173.1E-6462515IPR001005SANT/Myb domain
PfamPF002498.2E-6463505IPR001005SANT/Myb domain
CDDcd001675.58E-4465505No hitNo description
SMARTSM007172.3E-7586638IPR001005SANT/Myb domain
SuperFamilySSF466891.72E-8589642IPR009057Homeodomain-like
PfamPF002498.0E-6590634IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.5E-4590634IPR009057Homeodomain-like
PROSITE profilePS500906.841590636IPR017877Myb-like domain
CDDcd001671.37E-5590636No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 647 aa     Download sequence    Send to blast
MPSRRSDSAI KLIAYSEELV DGKPFYAFSN CLPVKALNRE PAGHAFHSAA LKLHGCAEEP  60
ADNEDSDKKV GDDKEKEYVP SFNSYANKGK KKSGTQQQDH YALLGLSNLR YLATEDQIRK  120
SYREAALKHH PDKLATLLLA EETEEAKEAK KDEIESRFKA IQEAYEVLMD PTRRRIFDST  180
DEFDDEVPSD CLPQDFFKVF GPAFKRNARW SVNQRIPDLG DENTPLKDVD KFYNFWYGFK  240
SWREFPDEEE HDLEQADSRE ERRWMEKENA KKTVKARKEE HARIRTLVDN AYRKDPRIVK  300
RKEEEKAEKQ QKKEAKLLAK KKQAEDAAIA AEEEKRRKEE EEKRAAESAQ QQKKNKEKEK  360
KLLRKERNRL RTLSAPLVAQ HLLDISEEDI ENLCMSLNTE QLQNLCDKMG NKEGLELAKV  420
IKDGCESSRN DEAETKEKES KKTNGGPEPK TRVSQLDSST QKKQPWSKEE IDMLRKGMIK  480
YPKGTSRRWE VVSEYIGTGR SVEEILKATK TVLLQKPDSA KAFDSFLEKR KPSASISSPL  540
STREELGESL PTMTTTKASP SKETVVGKSS SSQVSDTNGE AGGSSDADGW STVQERALVQ  600
ALKTFPKETS QRWERVAAAV PGKTMIQCKK KFAELKEIIR NKKTGV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5dje_A2e-1419529820123Zuotin
5dje_B2e-1419529820123Zuotin
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1319337KKKQAEDAAIAAEEEKRRK
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.1655s0025.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0081530.0AC008153.5 Arabidopsis thaliana chromosome 3 BAC F24K9 genomic sequence, complete sequence.
GenBankCP0026860.0CP002686.1 Arabidopsis thaliana chromosome 3, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006297179.10.0dnaJ homolog subfamily C member 2
RefseqXP_023641194.10.0dnaJ homolog subfamily C member 2
TrEMBLR0G3N00.0R0G3N0_9BRAS; Uncharacterized protein
STRINGCagra.1655s0025.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM23612755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06110.20.0DnaJ domain ;Myb-like DNA-binding domain