PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.1889s0015.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family MYB
Protein Properties Length: 1656aa    MW: 181903 Da    PI: 6.7689
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.1889s0015.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding33.41e-10870911346
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
      Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                          +WT eE e+++   + +G++ +k+Ia+++   +t  +c+++++k
  Cagra.1889s0015.1.p 870 PWTSEEKEIFLSMLAIHGKD-FKKIASYLT-EKTTADCIDYYYK 911
                          8*****************99.********9.9**********98 PP

2Myb_DNA-binding284.9e-0910851126447
                           S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
      Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                           WT +E   +++++ ++G++ +++I+r++g +R++ qc+ ++ k+
  Cagra.1889s0015.1.p 1085 WTDDERSAFLQGFSLFGKN-FASISRYVG-TRSPDQCRVFFSKV 1126
                           *****************99.*********.********998776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.17E-14853914IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.609.0E-7862913IPR009057Homeodomain-like
PROSITE profilePS5129314.857866917IPR017884SANT domain
SMARTSM007174.2E-10867915IPR001005SANT/Myb domain
PfamPF002491.5E-8869911IPR001005SANT/Myb domain
CDDcd001671.82E-8870912No hitNo description
PROSITE profilePS5129312.8910801131IPR017884SANT domain
SMARTSM007175.1E-810811129IPR001005SANT/Myb domain
SuperFamilySSF466891.25E-910841131IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.5E-410841125IPR009057Homeodomain-like
PfamPF002491.4E-710851125IPR001005SANT/Myb domain
CDDcd001676.35E-710851126No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1656 aa     Download sequence    Send to blast
MPQDHASWDR KELLRQRKHD RPEQSFDSPF RWRDSPTTPS SHHVPREFSR WGGSGDFRRP  60
SCHGKQGGRH QFVEEGSHGY TSSRSSARIF ENDYYRPSAS RGDWRYTRNC RDDRASVSQK  120
EWKCNTWDMS NGSSRSFERP FGIRNGRRSV DERPLHASDT HTTMVNSLDP TNSAHQPDTE  180
ICTPVRSLRF KNEQKFSDQR LSLPSDPHSD CVRLFEQASS ENNYGNKICS PAKQCNDLMY  240
GRRIANDNSL DPPILNAELE GTWEQLHMKD PQEDNKLHGI TDLDDARKCA KESSLGAIGK  300
LPLWNSSGSF ASQSSGFSHS SSLKSVGAVD STDRKTEVLP KIATVTQSSS GDATPCATTT  360
HLFEEMSSRK KQRLGWGEGL AKYEKKKVDV NTNEDGTTLL ENGLDEQHSL NKNIADKSPT  420
AAILPDYGSP TTPSSVACSS SPGFADKSSA KAAIAASDVS NMCRSPSPVS SIHLERFPVN  480
IDELDNISME RFGCLLNELL CTDEPGTGDS SSVQLTSMNR LLAWKSEILK AVEMTESEID  540
LLENKHRTLK LEGGRHCHVG SSSYFCEGDA DVPKEQEASC ILGPKAAATP VAEALVRSPV  600
HQSSLAKVSV DVCEDNNEEV KFLSQSFATV DSNEDILPKL SMKAVTSSKE ISTPAFVNQE  660
TVELSSADDS MASNEDILCA KLLSSNKKYA CESSGVFNEL LPRDCSFDES RYFGICQMQF  720
DSHVKEKLAD RVELLRAREK ILLLQFKAFQ LSWKKDLDQL ALTKYQSKSS RKSDLYPNAK  780
NGGYLKLPQP VRLRFSSSAP RRDSVVPTTE LVSYMEKLLP GTNLKPYRDI LRMPAMILDE  840
RERVMSRFIS SNGLVEDPCD VEKERTMINP WTSEEKEIFL SMLAIHGKDF KKIASYLTEK  900
TTADCIDYYY KNHKSDCFGK IKKQRAYGKE GKHTYMLAPR KKWKREMGAA SLDILGAVSI  960
IAANAGKVAS TRQISSKRIT LRGCSSSNSL QHDGNNSEGC SYSFDFPRKR TVGADVLAVG  1020
PLSSEQINSC LRTSVSSRER CMDHLKFNPV VKKPRISHTL HNENSNEEDD SCSEESCGET  1080
GPIHWTDDER SAFLQGFSLF GKNFASISRY VGTRSPDQCR VFFSKVRKCL GLEFIQSGSG  1140
NLSTSVSVDN GNEGGGSDLE DPCPMESNSG ICNNGVCAKM DINSPTSPFN MNQNGANHSG  1200
SANVKADLSR SEQENGLTYI HLKDGRNLVS NAYIKGDLPG LVSESCRDLV DINTVENQSQ  1260
AAGISKSSDL LSMEIDEGVL TSVAVSSEPL YCGLSVLSNV IVETPTESSQ MGSGDQGAAT  1320
MLKLNSKNQD GVMQAANRTK NPGLDPESAP SGFKYPECLH HVPIEVCTEN PIGVSVPRGN  1380
PNCHTEAKSG NSLVGQAVET HDLGWQFSKE NLELNGRLQV IGHVNPEQNG QLNSINAESC  1440
QIPQRSVTQD PSRISRSKSD LIVKTQRTGE GFSLNKCTSS APNSLTVSHK EGKSGHSRSH  1500
SFSLSDTERL DKNGDVKLFG TVLTADENGI KQKHNPGGSV RSSSTLSRDH DTRHHYINQQ  1560
HLQNVPITSY GFWDGNRIQT GLTSLPESAK LLASCPEAFS THLKQQVGSN KEIRRDVNGG  1620
GILSFGKHNE DRAEASSAKD GGNIGGVNGV AEAAT*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C3e-15828918493NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D3e-15828918493NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.1889s0015.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006290487.10.0uncharacterized protein LOC17885771
TrEMBLR0HEB90.0R0HEB9_9BRAS; Uncharacterized protein
STRINGCagra.1889s0015.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein