PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa04g038070.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family MYB
Protein Properties Length: 1860aa    MW: 203014 Da    PI: 6.9945
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa04g038070.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding37.36.5e-12879920346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT eE e++++  +++G++ +k+Ia+++   +t  +c+++++k
   Csa04g038070.1 879 PWTSEEKEIFLKMLALHGKD-FKKIASYLK-QKTTADCIDYYYK 920
                      8*****************99.********9.9**********98 PP

2Myb_DNA-binding26.91.1e-0810951136447
                       S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                       WT +E   +++++ ++G++ +++I+r++g  R++ qc+ ++ k+
   Csa04g038070.1 1095 WTDDERSAFIQGFSLFGKN-FASISRYVG-SRSPDQCRVFFSKV 1136
                       *****************99.*********.********998776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.4E-15862923IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.2E-7871922IPR009057Homeodomain-like
PROSITE profilePS5129315.717875926IPR017884SANT domain
SMARTSM007171.3E-10876924IPR001005SANT/Myb domain
PfamPF002498.6E-10878920IPR001005SANT/Myb domain
CDDcd001677.47E-9879921No hitNo description
PROSITE profilePS5129312.510901141IPR017884SANT domain
SMARTSM007176.0E-810911139IPR001005SANT/Myb domain
SuperFamilySSF466892.62E-910941141IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.2E-410941135IPR009057Homeodomain-like
PfamPF002494.6E-710951135IPR001005SANT/Myb domain
CDDcd001671.03E-610951136No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1860 aa     Download sequence    Send to blast
MPQDHASWDR KELLRQRKHD RPEPSSFDSP FRWRDSPSST PSSHHVPREF SRWGSGDFRR  60
PSSSSSSSCH GKQGGRHQFA EESSHGYTSS RSSARIFDND YYRPSASRGD WRYTRNCRDD  120
RVSVSQKEWK CNTWEMSNGS SRSFERPFGI RNGRRSVDER PLHASDTHTT AVNSFDPTNS  180
AHHPDTEVYT PVRSLKFKSE QKFSDQRLSL PSDPHSDCVS LFERPSSENN YGNKVCSPAK  240
QCNDLIYGRR IANDNSLDPP ILNAELEGTW EQLHLKDPQE DNRLHGISDL DGARKCAKES  300
ALGAIGKLPL WNSSGSFASQ SSGFSHSSSL KSLGAVDSSD RKTEALPKTV TVTQSSSGDA  360
TACATTTHLS EEMSSRKKQR LGWGEGLAKY EKKRVDVNTN EDGTTLLENG SEELHSLNKN  420
IADKSPTAAT VPDYGSPTTP SSVACSSSPG FADKSSAKAA IAACDVSNIC RSPSPVSSFH  480
LEQFPINIEE LDNISMERFG CLLNELLCTD DPGTGDSSSV QLTSMNRLLS WKSEILKAVE  540
MTESEIDLLE NKHRALKLER GRHCHVVGPS SHFCEGDANV PKEQEASCIL GPKVAAPSVA  600
ETLVRSLVHQ SGLAKVPVDV FEDNPGEVKS LSQSFATVES NEDLLPMPSM KAAASSKEIN  660
TPAFVNQEII ELSAADDSMA SKEDLLCATL YSSNKKYACE SSGVFNDLLP RNFCSFDGSR  720
FPSICHTQFD SHVKEKIADR VELLRAREKI LLLQFKAFQL SWKKDLHQLA STKYQSKSSK  780
KTELYPNAKN GGYLKLPQPV RLRFSSSAPR KDSVIPTTEL VSYMEKLLPG THLKPFRDIL  840
RMPAMILDEK ERVMSRFISS NGLVEDPCDV EKERTMINPW TSEEKEIFLK MLALHGKDFK  900
KIASYLKQKT TADCIDYYYK NHKSDCFGKI KKQRAYGKEG KHTYMLAPRK KWKRDMGTAS  960
LDLLGSVSVI AAANAGKVAP TRPIASKRIT LRGCSSSNSL QHDGNNSEGC SYSFDFPRKR  1020
TVGADVLAVG PLSSEQINSC LRTSVSSRER FMDHLKFNPV VKKPRISHTL HNENSNEEDD  1080
SCSEESCGET GPIHWTDDER SAFIQGFSLF GKNFASISRY VGSRSPDQCR VFFSKVRKCL  1140
GLEFIQSGSG NVSTSVSVDN GNEGGGSDLE DPCPMESNSG ICNNGVCAKM DMNSPTSPFN  1200
MNQDGANHSG SANVKADLSR SEQENGLTYM QDGSNPVNNA YINGGLPGLI SESCRDLVDI  1260
NTVESQSQAA GKSKSNDLLS MEIDEGVLTS VAVSSEPLYC GLSVLSNVIV ETPAESSRKG  1320
SGDQGSAIPK LSSKNQDGVM QAANRTRNSG LEPESAPSGF KYPDCLHHVP IEVCTENPIG  1380
VSVPRGNPNC HTEGKSGHSL VGQAVETHDL GWQSSKDNVE LDGQLQVIGH VNPEQNGHLN  1440
ANYAESCQIP KRSAIQDPSR ISRSKSDLIV KTQRTGEGFS LNKCTSSAPN LVSSEPLYCG  1500
LSVLSNVIVE TPAESSRKGS GDQGSAIPKL SSKNQDGVMQ AANRTRNSGL EPESAPSGFK  1560
YPDCLHHVPI EVCTENPIGV SVPRGNPNCH TEGKSGHSLV GQAVETHDLG WQSSKDNVEL  1620
DGQLQVIGHV NPEQNGHLNA NYAESCQIPK RSAIQDPSRI SRSKSDLIVK TQRTGEGFSL  1680
NKCTSSAPNL LAVSHKEGRS GHSRSHSFSL SDTERLDKNG DVKLFGTVLT ADENGIKQKH  1740
NPCGSVRSSS TLSRDHDIRL HYINQQHLQN VPITSYGFWD GNRIQTGLTS LPESAKLLAS  1800
CPEAFSLHLK QQVGKEIRLD VNGGGILSFG KHNEDRAEAS SAKDGCNIGG LNGVAEAAT*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C7e-16837927493NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D7e-16837927493NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa04g038070.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010503962.10.0PREDICTED: uncharacterized protein LOC104781070 isoform X1
RefseqXP_019100288.10.0PREDICTED: uncharacterized protein LOC104781070 isoform X2
RefseqXP_019100289.10.0PREDICTED: uncharacterized protein LOC104781070 isoform X3
TrEMBLD7LU400.0D7LU40_ARALL; Myb family transcription factor
STRINGXP_010503962.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein