PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_01428_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1079aa    MW: 122737 Da    PI: 8.2197
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_01428_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding31.73.5e-10434479146
                                 TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
             Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                                 r++WT+eEd+ ++  v+++G ++W  Ia  +g +Rt+ qc  r+q 
  Cotton_A_01428_BGI-A2_v1.0 434 RNPWTAEEDKNILLIVQEMGIDNWFDIAVSLGSNRTPFQCLARYQR 479
                                 79******************************************96 PP

2Myb_DNA-binding42.21.9e-13488531246
                                 SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
             Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                                   WT+eEd++l  av+ +G  +W ++a+t+  gRt+ qc +rw k
  Cotton_A_01428_BGI-A2_v1.0 488 REWTEEEDDQLRIAVEVFGESDWQSVASTLK-GRTGTQCSNRWKK 531
                                 68****************************9.***********98 PP

3Myb_DNA-binding50.74.1e-16541584246
                                 SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
             Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                                 grWT +Ed++l+ av ++G+++W++Ia+ ++ gRt  qc++rw +
  Cotton_A_01428_BGI-A2_v1.0 541 GRWTRDEDKRLKVAVLLFGPKNWRKIAEVVP-GRTQVQCRERWVN 584
                                 8******************************.***********87 PP

4Myb_DNA-binding41.14.2e-13594636347
                                 SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
             Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                                  WT+eEd +l  a++++G   W+++a ++   Rt++qc  rw ++
  Cotton_A_01428_BGI-A2_v1.0 594 IWTEEEDSRLEAAIEEHGYC-WSKVATCVA-SRTDNQCWRRWKTL 636
                                 5*****************99.*********.***********976 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.202329428IPR017877Myb-like domain
SMARTSM007172.5E-4333430IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.605.2E-12336352IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.2E-12401441IPR009057Homeodomain-like
SuperFamilySSF466895.35E-17410477IPR009057Homeodomain-like
PROSITE profilePS500909.86429481IPR017877Myb-like domain
SMARTSM007171.3E-10433483IPR001005SANT/Myb domain
PfamPF002492.8E-9434479IPR001005SANT/Myb domain
CDDcd001679.93E-7436481No hitNo description
Gene3DG3DSA:1.10.10.608.7E-13442491IPR009057Homeodomain-like
SuperFamilySSF466891.17E-19466536IPR009057Homeodomain-like
PROSITE profilePS5129416.84482533IPR017930Myb domain
SMARTSM007171.2E-11486535IPR001005SANT/Myb domain
PfamPF002492.1E-11488531IPR001005SANT/Myb domain
CDDcd001678.29E-10489533No hitNo description
Gene3DG3DSA:1.10.10.603.4E-16492536IPR009057Homeodomain-like
PROSITE profilePS5129421.927534590IPR017930Myb domain
SuperFamilySSF466891.38E-24538633IPR009057Homeodomain-like
SMARTSM007179.4E-15539588IPR001005SANT/Myb domain
PfamPF002498.5E-15541584IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.4E-20541591IPR009057Homeodomain-like
CDDcd001674.26E-12542586No hitNo description
SMARTSM007173.3E-12591639IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.609.6E-16592636IPR009057Homeodomain-like
PROSITE profilePS5129415.255593641IPR017930Myb domain
PfamPF002495.7E-11595636IPR001005SANT/Myb domain
CDDcd001672.58E-10595636No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1079 aa     Download sequence    Send to blast
MSLKDQYDTV NDEEEDIDDV ELSGNENGVG FDEDMEALKK ACLRTGTDLN GLDIVSVDNE  60
RPSTSTAASP ASADSGSEDD LEIFRSIRNR LALSEDVYEP LSLKPLCTLP PISSDEDAED  120
DFETLRVIQK RFLAYSTDCT RKNSREDNVE KTEPIYMTRT PLKDATGNDI CENFLDYVQT  180
GNISHVSSDN AEMQPLSLVQ CDQSDVNVLS TYKSSRFPKS AQLLFDAIKK NRSYQTFLRS  240
KLAHIEVKIE ENKKLKERVK ILRDFQVSCK KLTGRALSAK KDPRIQLISA RKLRTFEEPE  300
VNDKRVTADY GPLENSSVAS YRMALIDFPL KLERKNWSRE ERENLEKGIR QQFQERALQV  360
SVGWLSTSDG SPQDGNNLDG IIATVKDLEI TPERIREFLP KVDWNQLASF YVKGRSGAEC  420
ETRWLNHEDP LINRNPWTAE EDKNILLIVQ EMGIDNWFDI AVSLGSNRTP FQCLARYQRS  480
LNPCILKREW TEEEDDQLRI AVEVFGESDW QSVASTLKGR TGTQCSNRWK KSLHPTRQRV  540
GRWTRDEDKR LKVAVLLFGP KNWRKIAEVV PGRTQVQCRE RWVNSLDPTL NVGIWTEEED  600
SRLEAAIEEH GYCWSKVATC VASRTDNQCW RRWKTLHPEE VPLLQEARKI RKAALISNFV  660
DRESERPALG PNDFNIQLPM ITATSEPSKE KGKRRRRRPE SEKENAAALR LSPEKRSDKS  720
CRKGAQTTTG RNPPLENNNC TEPAEDVTFQ KKRKREPPSG NNNHTKPAHV AIQKKRKQPH  780
SGHVNCSDRM QDGAVQTYKR KQQSESSKFV ESVQDNCSSH LLSTLCMTGN HEAESFGSSL  840
TVKRRKNHKA SPKQFPKRSI CTESHEEQYS ICSEIPMFSG GDDGAEVMQN SGVESEILCA  900
DDASRKAKPR SKRKTCINSL TSKSSRTIVA EHFKNLSATK NTKKNRTKQQ QSKSRKSNKP  960
SGDEDGQTDG DHQTLACFLR NKLKKRRCEI VDNACLSEGM DERSKIDQTQ FGLQHCDGEN  1020
GTNIEIVDVV NKTVAPRDIV REPSKINKED ITLACLRKRL KKKRRVTIAQ SSNHGDMSE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C2e-284036274155MYB PROTO-ONCOGENE PROTEIN
1h89_C2e-284036274155MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1692697KRRRRR
2942948KKNRTKQ
310601064KKKRR
410601065KKKRRV
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ5266204e-48HQ526620.1 Gossypium herbaceum clone NBRI_A_EYI1BW401A8HI1 simple sequence repeat marker, mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017636111.10.0PREDICTED: uncharacterized protein LOC108478190
TrEMBLA0A1U8LUX30.0A0A1U8LUX3_GOSHI; uncharacterized protein LOC107931053 isoform X1
STRINGGorai.008G288900.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM65042744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.10.0myb domain protein 4r1