PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pp3c12_21290V3.4.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Bryophyta; Bryophytina; Bryopsida; Funariidae; Funariales; Funariaceae; Physcomitrella
Family MYB
Protein Properties Length: 2461aa    MW: 263656 Da    PI: 6.7153
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pp3c12_21290V3.4.pgenomeCOSMOSSView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.38.7e-0911031146347
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
     Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                           WT  E el+ ++v+++G++ +++Ia ++g++++  qck ++ k 
  Pp3c12_21290V3.4.p 1103 QWTDRERELFTEGVRLFGKD-FERIAVHVGTTKSVGQCKAFFCKT 1146
                          5*****************99.*******************99886 PP

2Myb_DNA-binding346.9e-1115251566346
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                          +WT+eE e+++d ++ +G++ W++  ++++  ++l q+k ++q+
  Pp3c12_21290V3.4.p 1525 SWTQEEKEKFADIIRNHGKD-WTRLHECLP-SKSLTQIKTYFQN 1566
                          7*****************99.*********.************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.18E-13840903IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.9E-4849896IPR009057Homeodomain-like
PROSITE profilePS5129317.276852903IPR017884SANT domain
SMARTSM007172.1E-7853901IPR001005SANT/Myb domain
PROSITE profilePS5129317.87810991151IPR017884SANT domain
SMARTSM007171.3E-811001149IPR001005SANT/Myb domain
SuperFamilySSF466892.39E-911031153IPR009057Homeodomain-like
PfamPF002498.8E-711031145IPR001005SANT/Myb domain
CDDcd001671.43E-511041145No hitNo description
SuperFamilySSF466892.69E-1015201570IPR009057Homeodomain-like
PROSITE profilePS512938.54215211572IPR017884SANT domain
SMARTSM007177.1E-915221570IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.4E-715231567IPR009057Homeodomain-like
CDDcd001672.21E-615251568No hitNo description
PfamPF002493.2E-815251566IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2461 aa     Download sequence    Send to blast
MYSPGRGDAR ATALPNARDS STSMPIDHAS AWHREPYNRD ARLDRGDRER IIGGKREWNS  60
DRASFGAPYP RAFGGLHPVN GSLGPPNKRR LNGRHTGFHN EGGRYVSPGH RGEPPFGSVG  120
PVENGLGRKF DRESFYPGSG CPVGGSGFGN GVGVDQRAKE LEERHKKEPF LLLRDGRDGG  180
SCGGIEHGVR SFPSQAVCEA EKSCWERERS KGAVGNGWES SQKLQRDLRE RVGGKAEGYS  240
RREGKLSQHD GGREWSQADK ELEGRRVDGY GSSSRDGLTG DGVGGIKEAV GGCEWRRKER  300
SSSVSASRSG SYVASSGTAK GPVGRPQALD VSGSASSPQV SPHSTASPGV QEASRDDGSQ  360
PSPPKRPRLG WGQGLAKYEK KKVVETDEVG VSASGGSVSG RERSSVSGAT TEVVQTQESQ  420
AAEESPVSVV SNLSTLPPLT PSASPSQQAA EVVSKAMVEC DGVEQEVVGK VCERVAAEAE  480
AGGWISKPEG VVADDVEKRV EDGIDDGEAM DVDAAEVHPG RVDSRESGEA MQVACSGRNW  540
SGGEECGNGR WGSDGAGAAG NVGERVGGGV SEGGHGGSVD DGGVLEEGEL GEEKQEEWTS  600
KQMSVGLESE GNPAYSPRAE TEPAQREEMD VTAGVSKCGV AEKEEAERKV ISWDVEVVAR  660
SLMEENKKRA EQARETFVHL LREGVGVEGT LYRCPAEAGV WKENVERHYR NQERMLEKMG  720
ERRQSLRFAE QVLAMRFRAL KEAWKQEQVG MRQQQRGTKP VRRWEVEKRN GTALHCHRSS  780
LRLRPVQAGM EKVEAVSEEC MKKVMAKAVV GPVRGVLKMP SMIVGQENRL ARRFESKNAL  840
VEDPVGMERE RKSMNPWSWE EKRVFLEKFA VYNKNFSKIA SHLELKTTAD CVEFYYRNQK  900
SEDFERIRRR QQLKKRRDYS RVGGSFLSTG LQTSSQRREA NGHGRTEGAN VQTVGAVVGV  960
SHISVGTKAA RSSMQQKPVE RQRVSSALEP GSLPGAVEIG KGVSGKENKW CGTGGVSGSA  1020
AGRGGIFGMV LSGATVSCGL SSAVAGAVKI GRERSSVKTM VDAGLVVASC GQIEQHVSAP  1080
MVTGNGYLQI YLEKSAGKEG DAQWTDRERE LFTEGVRLFG KDFERIAVHV GTTKSVGQCK  1140
AFFCKTRKRL GLDKLVEKYE DSLKVRVGAV MAESPECADV GRGEAAGVLA EDVRVSGLAV  1200
SAGPGMEVES DQDKGDESEK SVEVVLPVDV QEMEAVEAEA GEGVRVEEIV VEEVAVESEV  1260
VEVVPVYNGV LEDVPVHNGV LKEAPVDNGV LEDELVETPS VVSESVSEAV GASCIVSISK  1320
HAMMESGVCL TQSSVILDSV MKAVCEVPAT VAHSMGECGA EVSEDDRVEF VAVEKETFAE  1380
AVEVVVAKDG NTGNNSTDLA IGYVVEEDAA VIRKDAESDM ESCKCEPDTG LVKAVGMLAK  1440
ISTLAADEQE AKAVPTVDVK ADPESPQVAT SLSTDSASFP SLPAAVSHSG GSVFSNSSTQ  1500
GSQQGRERVG RVGGGETKSR REPTSWTQEE KEKFADIIRN HGKDWTRLHE CLPSKSLTQI  1560
KTYFQNSKAK LGLLNAEGVN ILGGRVAGSR KRKVDEAEDG SNNVACLSSG NELKGGGVSA  1620
RDMDGVCQNV KAAVGGVPSM GMGSGPSGLE SMLYPIFGQR VEDQIALEKF MRMYSANGLA  1680
QNIAGGVNPF VQQYGFPMFP NAGHQRASQL SLQQQLAASL AQKSGQAKGL QQQELQGSGS  1740
VQQSGQASVQ QKAAHQLQSA QMVRNQQLLA SMMQHQAAAH HQQNQTSKSV PHPLQPQVVV  1800
LQPGHAGQLQ QQPLVLNQQV IHHPQAQQGG SQQEQGSPGH LVGNPSLGAC SSAQVPNQMK  1860
QLLQHQALLQ QQQFQLALQH LQQQQQQQQL QQHQHHQDQT QAQVQVQAQA QIQALEHSHS  1920
HSQSHIHQPQ YSLASLLQPL VVQKPKVVVP LPPQVTRTPS PPVLQQGGLP HRHDQHQHGP  1980
SHTFLRGNVA SAVSSNPGEA STQQGEIGEQ WGSISHGGGS KERNFSRADL HQLSGAGQMV  2040
KPSNAEPSRP ATESPQARIG DVKLFGQSLL SQPTSCAVSQ NVARGSFSAA EKVSLQQSPV  2100
STAVSSLATS MPVPAASRVK PYGKESAFRG VPFSSEGRQG GWPPLGNRGS MEMWNIMNNA  2160
HSQVGQVLKS KVEELELHNM QECRMSREAQ DAESDQSTPK STHKPQHLEQ ARGAQDKSGS  2220
GGFSSLAYGV GKSRACVNEH GVSSGEVSRT DPERRGESIG LDSGCSERSS MMASRSSRGQ  2280
AESFAMQATG PFANGQQVPR TVIDALMAIA ELHRIRSSQF NSSVGQPRLE DHQWESLVQH  2340
TTKGMAVDAV KANPSLSLNR PIATPQTHLR GNGVSQHCVR DSGLYFTQHY PNLPGQSGVV  2400
SSAWNGGTGL VHPCDVQRML VNPALSSFLP SALSRASTAT EKYLGSSEVR DSDGGRDGTC  2460
*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-15811905194NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-15811905194NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ppa.109160.0protonema
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024390845.10.0uncharacterized protein LOC112289670 isoform X2
TrEMBLA9SRL30.0A9SRL3_PHYPA; Predicted protein
STRINGPP1S143_55V6.10.0(Physcomitrella patens)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-34MYB family protein