PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.008G288900.1
Common NameB456_008G288900, LOC105765654
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1081aa    MW: 122752 Da    PI: 8.1979
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.008G288900.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding29.12.3e-09434479146
                         TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         r++WT+eEd+ l+  v+++G ++W  Ia  +  +Rt+ qc  r+q 
  Gorai.008G288900.1 434 RNPWTAEEDKNLLLIVQEMGIDNWFDIAVSLASNRTPFQCLARYQR 479
                         79******************************9***********96 PP

2Myb_DNA-binding42.21.9e-13488531246
                         SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                           WT+eEd++l  av+ +G  +W ++a+t+  gRt+ qc +rw k
  Gorai.008G288900.1 488 REWTEEEDDQLRIAVEVFGESDWQSVASTLK-GRTGTQCSNRWKK 531
                         68****************************9.***********98 PP

3Myb_DNA-binding51.13e-16541584246
                         SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         grWT +Ed++l+ av ++G+++W++Ia+ ++ gRt  qc++rw +
  Gorai.008G288900.1 541 GRWTRDEDKRLKVAVLLFGPKNWRKIAQVVP-GRTQVQCRERWVN 584
                         8******************************.***********87 PP

4Myb_DNA-binding41.14.2e-13594636347
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                          WT+eEd +l  a++++G   W+++a ++   Rt++qc  rw ++
  Gorai.008G288900.1 594 IWTEEEDSRLEAAIEEHGYC-WSKVATCVA-SRTDNQCWRRWKTL 636
                         5*****************99.*********.***********976 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.017329428IPR017877Myb-like domain
SMARTSM007175.1E-4333430IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.604.0E-12336352IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.0E-12401441IPR009057Homeodomain-like
SuperFamilySSF466893.0E-16410477IPR009057Homeodomain-like
PROSITE profilePS5009010.011429481IPR017877Myb-like domain
SMARTSM007171.4E-10433483IPR001005SANT/Myb domain
PfamPF139211.2E-8437483No hitNo description
Gene3DG3DSA:1.10.10.601.4E-14442489IPR009057Homeodomain-like
SuperFamilySSF466899.66E-20465536IPR009057Homeodomain-like
PROSITE profilePS5129416.84482533IPR017930Myb domain
SMARTSM007171.2E-11486535IPR001005SANT/Myb domain
PfamPF002492.1E-11488531IPR001005SANT/Myb domain
CDDcd001679.58E-10489533No hitNo description
Gene3DG3DSA:1.10.10.602.9E-16490536IPR009057Homeodomain-like
PROSITE profilePS5129421.891534590IPR017930Myb domain
SuperFamilySSF466897.61E-25538633IPR009057Homeodomain-like
SMARTSM007171.4E-14539588IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.608.2E-18541589IPR009057Homeodomain-like
PfamPF002497.8E-15541584IPR001005SANT/Myb domain
CDDcd001676.26E-12542586No hitNo description
Gene3DG3DSA:1.10.10.601.0E-15590636IPR009057Homeodomain-like
SMARTSM007173.3E-12591639IPR001005SANT/Myb domain
PROSITE profilePS5129415.255593641IPR017930Myb domain
CDDcd001673.07E-10595636No hitNo description
PfamPF002495.7E-11595636IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1081 aa     Download sequence    Send to blast
MSLKDEYDTV NDEEEEVDDV ELSGNENGVG FDEDMEALKK ACLRTGTDLN GLDIVSVDNE  60
RPSTSTAASP ASADSGSEDD LEIFRSIRNR LALSEDVYEP LSLKPLCTFP PISSDEDAED  120
DFETLRIIQK RFLAYSTDYT WKNSREDNME KTDPIYMTRT PLKDATGNDI CEKFQDYEQN  180
GNISHVSSDN AEMQPLSLVQ CDQSDVNVLS TYKSSRFPKS AQLLFDAIKK NRSYQMFLRS  240
KLAHIEVKIE ENKKLKERVK ILRDFQVSCK KLTGRALSAK KDPRIQLISA RKLRTFEDPE  300
INDKRVTADY GPLENSSVAS YRMALIDFPL KLERKNWSRE ERENLEKGIR QQFQERALQV  360
SVGWLSTSDG SPQDGNNLDG IIATVKDLEI TPERIREFLP KVDWNQLASF YVKGRCGAEC  420
ETRWLNHEDP LINRNPWTAE EDKNLLLIVQ EMGIDNWFDI AVSLASNRTP FQCLARYQRS  480
LNPCILKREW TEEEDDQLRI AVEVFGESDW QSVASTLKGR TGTQCSNRWK KSLHPTRQRV  540
GRWTRDEDKR LKVAVLLFGP KNWRKIAQVV PGRTQVQCRE RWVNSLDPAL NVGIWTEEED  600
SRLEAAIEEH GYCWSKVATC VASRTDNQCW RRWKTLHPEE VPLLQEARKI RKAALISNFV  660
DRESERPALG PNDFNIQLPM ITATSEPSKE KGKRRRRRPE YEKENAAALR LSPEKRSHKS  720
CRKGAQTTTG RNPPLENNNC TEPAEDVTFQ KKRKREPPSG NNNHIKPAQH VAIQKKRKQP  780
LSGHVNCSDR KQDGAVQTYK RKQQSGSSKF VKSVQDNCSS HLLSALCITG NHEAESFGSS  840
LTVKRRKNHK ASPKQFSKRS MCTESHEEQY SICSEISVFS GGDDGAEVMQ NSGVESEILG  900
ADDTSRKAKP RSKRKTCMNS LTSQSSRTIV AEHFKNLSAA KNTKKNRTKQ QQSKSRKSNK  960
PSGDENGQTD GDHQTLACFL RNKLKKGGCE IVDNACLSEG MDERSKIDQT QFSLQHCDGE  1020
NGTNIEIVDV VNKTVASRDI VREPTKINKE DITLACLCKR LKKKRCVTIA QSSNHGDMSE  1080
*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C1e-284036274155MYB PROTO-ONCOGENE PROTEIN
1h89_C1e-284036274155MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1692697KRRRRR
2943949KKNRTKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ5266206e-41HQ526620.1 Gossypium herbaceum clone NBRI_A_EYI1BW401A8HI1 simple sequence repeat marker, mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012440312.10.0PREDICTED: uncharacterized protein LOC105765654 isoform X2
TrEMBLA0A0D2RPF50.0A0A0D2RPF5_GOSRA; Uncharacterized protein
STRINGGorai.008G288900.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM65042744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.10.0myb domain protein 4r1
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]