![]() |
PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
Basic Information? help Back to Top | |||||||||
---|---|---|---|---|---|---|---|---|---|
TF ID | 55883 | ||||||||
Common Name | MICPUCDRAFT_55883 | ||||||||
Organism | |||||||||
Taxonomic ID | |||||||||
Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; Mamiellales; Mamiellaceae; Micromonas; Micromonas pusilla
|
||||||||
Family | MYB_related | ||||||||
Protein Properties | Length: 3029aa MW: 316228 Da PI: 5.6045 | ||||||||
Description | MYB_related family protein | ||||||||
Gene Model |
|
Signature Domain? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
1 | Myb_DNA-binding | 42.8 | 1.2e-13 | 384 | 429 | 2 | 47 |
SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS Myb_DNA-binding 2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 ++WT+eE+ ++ + +++G+g+W++I++ + + Rt+ q+ s+ qky 55883 384 QPWTEEEHRMFLVGLAKYGKGNWSAISQNVVLSRTPTQIMSHAQKY 429 79*******************************************9 PP |
Protein Features ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Database | Entry ID | E-value | Start | End | InterPro ID | Description |
Gene3D | G3DSA:1.25.40.10 | 2.5E-11 | 138 | 232 | IPR011990 | Tetratricopeptide-like helical domain |
PROSITE profile | PS50293 | 11.619 | 139 | 225 | IPR013026 | Tetratricopeptide repeat-containing domain |
SuperFamily | SSF48452 | 2.71E-9 | 139 | 229 | IPR011990 | Tetratricopeptide-like helical domain |
PROSITE profile | PS51294 | 15.32 | 378 | 434 | IPR017930 | Myb domain |
SuperFamily | SSF46689 | 1.12E-12 | 380 | 435 | IPR009057 | Homeodomain-like |
SMART | SM00717 | 3.3E-10 | 382 | 432 | IPR001005 | SANT/Myb domain |
Gene3D | G3DSA:1.10.10.60 | 1.3E-11 | 384 | 434 | IPR009057 | Homeodomain-like |
Pfam | PF00249 | 1.3E-11 | 384 | 429 | IPR001005 | SANT/Myb domain |
TIGRFAMs | TIGR01557 | 2.2E-12 | 385 | 432 | IPR006447 | Myb domain, plants |
CDD | cd00167 | 1.57E-9 | 385 | 429 | No hit | No description |
Gene3D | G3DSA:1.10.10.60 | 1.7E-4 | 495 | 563 | IPR009057 | Homeodomain-like |
PROSITE profile | PS50090 | 5.912 | 496 | 559 | IPR017877 | Myb-like domain |
SuperFamily | SSF46689 | 3.89E-6 | 499 | 561 | IPR009057 | Homeodomain-like |
SMART | SM00717 | 0.027 | 500 | 561 | IPR001005 | SANT/Myb domain |
CDD | cd11660 | 4.11E-7 | 503 | 559 | No hit | No description |
Gene Ontology ? help Back to Top | ||||||
---|---|---|---|---|---|---|
GO Term | GO Category | GO Description | ||||
GO:0006355 | Biological Process | regulation of transcription, DNA-templated | ||||
GO:0005634 | Cellular Component | nucleus | ||||
GO:0003677 | Molecular Function | DNA binding | ||||
GO:0005515 | Molecular Function | protein binding |
Sequence ? help Back to Top |
---|
Protein Sequence Length: 3029 aa Download sequence Send to blast |
MALQLAALND PLSGARGDGG VEEARRTTEA EESLLTSAYE TALLFIRDGK REDAVRELRG 60 ILRNALMTDR DDDDDDAPSL TPTMTQVKFL ALKNLGRLVA DLAIEREDAG SSGSDDDDDD 120 DDADEDADLT ATERDLAADY AEALRAYAAA VEIDATDSSL WRRLGALASR RNLLHLARHA 180 LERGLVARPN HPLLLEDLAE VLLAVGDYPA CAHVAGLLLR LDPRHSRARE MRAGGEGAER 240 LRLTAAHEAA RRRAARRGRA AARHRKRGRE EETAARCARD DEATRDECER GEDVAVGGAL 300 IVKLERRTWE AVACALAATM DPTTSGGAAP ATRKKRAGGV AGDAVDADAA RPSTPPATEA 360 EPAGTRGEGG AGRTWRCPNR APPQPWTEEE HRMFLVGLAK YGKGNWSAIS QNVVLSRTPT 420 QIMSHAQKYY NHLSKHGAWE RAWAPTASEA AAEAAAEAAL LSATAAAEAD EIDLTKDDSP 480 VPVRVPVPVR VPASEPETRV RTPWAEQDVD FLLEGYEQFG PDGEDSKTLW VDILAAGVKQ 540 NLFKHRTPVD LKDKYRNVLF KRAREAANAS SPPPPPPPPA HPKTPEPRHW AVGVAIPGST 600 LGRRVRFVVP ASGGGGGFEE AKPVSGIDRP SRAGAGAGAG GGDEDDATTT TTTAAGIDEA 660 PRAMDVDAAA GTTTTTTTTT GKAANAKAKP GRKPKPKREG TRRGRSASAS EISTEPTSAP 720 AVPPPPAAAA AAEAMDVDVA APSGAESALE EAAGASGATS DAPRAGAGAG AGAVEKPPAA 780 GGKKKDSAPA RKSKRQIEKE DQEERERLHK EAWERRRAEQ QVALALAAEE NKWRPMKNLA 840 NDLASTALGG DSAAAMRADA DAAAIVAAAP VIQDDAAEKE KERESESAAP DSAPDSSAPP 900 EGAEEEPEPE EDKEEAADVA AFLGSVTSRN SGAADVAWRL LLHVTSRWCP TSEGASTDDV 960 APDAAAAEKL KAKRSNQWTK NRARDVSGGF CLKNAPRPAT LLRLAAAFGV GDGAGAPRVH 1020 LCLADAAVRA SRAALGAAYA ATHHRRRKAD GYEGPVLSDA AAKAAAEVTE GLPASHFVDA 1080 ANAHLDAAVA AADDDDFALM AEYHFVRGNL AAAEGSIALA EEHLRASVDA ATLAESGSGS 1140 EEGAKQRLLR DASCDGAADR LSAAAARAAL ADVRLHAVVA GADASLQAGN TAELLAVLAP 1200 LLLPLDASGA SAGAPQRGGS HAPVVLTPTQ REKALRTLAK AAKAEGDAHY PLAAGGNAAA 1260 AKVAEYALAT EVRARGALFA AVTAGGAGDA DVVVTRDALT DVFDVLQRAR STRTAGAFSP 1320 AAVAAAAARD AAERKKSAAD AMEDYDDDDD DDDAREMATQ HAARAIASLA PLFSKQIEAQ 1380 YASRAIAPGE TPTRRKNKVE QAKEDAAATI MELSAEALSV IQQLTSPPDD VVFTGDDGAP 1440 EEDFPDDNVT VADAPLGDAD APSPLSVLEL HERLHAALAD CRCCCGAAAR GGGSFLRDAI 1500 PTLTTARAPF ARESRRLKEL ARKAEESAAA RAKEERELAK KAAAERRAAL RAERAAKGEN 1560 AAEEEDEEED EEEEEEDDAD ARDKAAAKKT KAAGRGRPRG RAGAGGDDKE KNKAKAAFVR 1620 QNQKNDVRGG GGDGTVMGDG FVHYAGDVYV QSTQRPGGTH HDLAFALFRK DKTSYAFKDC 1680 DATGKYSKGK DQAILRSKIA VGRYLEREGL SEEEVEKREN ALKEAKAKAE AEAKENALKH 1740 PHVARECVDF LGSPPPKPST SSLPPLPPLA APKPKPSTRR EKIDAALKRF DDMIAQCTYC 1800 LYGAELGDVP RRCRDEGGAS GKELVLTSRR ACAELWRHVL PYAENLRSLG NGHGFKTVLD 1860 AVRRVFPPES ALPRDSPDVV DEYLKALPAF ADAEVLNPWD VASEASRGAW GRDEEFKLAR 1920 EAVSRAPPKR QREAKARASR KLAAVAEDEE GGGGGDGGGG DGDGDVAMID LADGPTPAPT 1980 PGPSPAPTPA SAPAPSPGGN MWRGLADMLG GPGIETRGVD RGASSPNADA AGGGGGATGD 2040 AEEEKEPLID PAVEFASAHE TLHRFCARVA DVEENDAAFE RVSETLPVIL DLMYHCVPVT 2100 REEAASRAAR WATQAAERAK TAKKGAKSRD APVAAATAAA RHAALLKADL THNPSSFDAW 2160 LALADHLDSC KDLALNDAAK LVTTYQWRRS PEAEARARRC QLALRRACCA AIVAATCDEE 2220 RSAAYERAGL AAYEHVSRAP PFHDGRRFVM SRDAGWRRSL GLCRDAFDGA ASAAPEEWTH 2280 RLMGVKIGRK LGEPLDVLFR RVDETLKLAP GNLEVFYQTH TTRVKALLKI AASSAAGWTM 2340 PAGAAGGRGK AAAKAALAKD LRLVASHAFD PEAVKSKDAG WDALWDDAVA GVRACAKMLP 2400 TYHKAHYRIA WARLRKPGDA ADAVARVAEA KDALLPLFKT AENASGFAKA DVAQYRPHGG 2460 GPPRFAVNMW EIEDGNSGVT RGACRRGVVA GAFKLETVGL NESARKYVSA VRRATRVYVC 2520 MSFALGDLSP LVAAPGFIGD EKNKFAKSMR DIKSLAFGLA VRAVAAACAA VVPDPRSLEL 2580 AYYAWAEHGC KAAATAWDEA VAASVHEIKI ERDAAAAARG DDDDDDDDAP AKLLGEPFSA 2640 ALKNCAGPEL DARVDASAFE TLARAHVASL KDERDVATLR ALLTDAAKKL ADASKRAARA 2700 TKGAEAAAAA AATAAATAPA TRLRAFVRDA LVLTIERAIA AGDVRVVAVA ENDGDAENAE 2760 DAAAIAVVEA AATRRRVNAN VVAGALGLHK EAELAASPAA TEAHEASRYA VHADALAEAA 2820 SASAKAAADG LRHVEAAAAA AARDAGVVAM DVVNESAQVF AATVQSPTKQ LERAVRARDE 2880 LAVAHAARLE AETAATARAA AARGRADALL AAAEAARLAT ASATALLRRA LVAALGADKT 2940 NVDDAALASI AEEYVKEAGL WDPIAEARKT GGGRWGGGGG AGKTATATVA ATEGSGGGGG 3000 GSAKKRVSVG DVGQGSGERA TRLSTGGE* |
Nucleic Localization Signal ? help Back to Top | |||
---|---|---|---|
No. | Start | End | Sequence |
1 | 691 | 698 | RKPKPKRE |
2 | 2971 | 2979 | GGRWGGGGG |
Regulation -- PlantRegMap ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Source | Upstream Regulator | Target Gene | ||||
PlantRegMap | Retrieve | - |
Annotation -- Protein ? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-value | Description | ||||
Refseq | XP_003057099.1 | 0.0 | predicted protein | ||||
TrEMBL | C1MNG1 | 0.0 | C1MNG1_MICPC; Predicted protein | ||||
STRING | XP_003057099.1 | 0.0 | (Micromonas pusilla) |
Orthologous Group ? help Back to Top | |||
---|---|---|---|
Lineage | Orthologous Group ID | Taxa Number | Gene Number |
Chlorophytae | OGCP322 | 15 | 29 |
Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Hit ID | E-value | Description | ||||
AT3G11280.2 | 9e-18 | MYB family protein |
Link Out ? help Back to Top | |
---|---|
Phytozome | 55883 |
Entrez Gene | 9682103 |
Publications ? help Back to Top | |||
---|---|---|---|
|