PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A12G2728
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1034aa    MW: 117455 Da    PI: 8.0442
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A12G2728genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding31.83.3e-10399444146
                      TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      r++WT+eEd+ ++  v+++G ++W  Ia  +g +Rt+ qc  r+q 
      Gh_A12G2728 399 RNPWTAEEDKNILLIVQEMGIDNWFDIAVSLGSNRTPFQCLARYQR 444
                      79******************************************96 PP

2Myb_DNA-binding42.31.8e-13453496246
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                        WT+eEd++l  av+ +G  +W ++a+t+  gRt+ qc +rw k
      Gh_A12G2728 453 REWTEEEDDQLRIAVEVFGESDWQSVASTLK-GRTGTQCSNRWKK 496
                      68****************************9.***********98 PP

3Myb_DNA-binding50.83.9e-16506549246
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      grWT +Ed++l+ av ++G+++W++Ia+ ++ gRt  qc++rw +
      Gh_A12G2728 506 GRWTRDEDKRLKVAVLLFGPKNWRKIAEVVP-GRTQVQCRERWVN 549
                      8******************************.***********87 PP

4Myb_DNA-binding41.14e-13559601347
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                       WT+eEd +l  a++++G   W+++a ++   Rt++qc  rw ++
      Gh_A12G2728 559 IWTEEEDSRLEAAIEEHGYC-WSKVATCVA-SRTDNQCWRRWKTL 601
                      5*****************99.*********.***********976 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500905.517294393IPR017877Myb-like domain
SMARTSM007170.0037298395IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.7E-10301317IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.7E-10366406IPR009057Homeodomain-like
SuperFamilySSF466891.41E-16376443IPR009057Homeodomain-like
PROSITE profilePS500909.86394446IPR017877Myb-like domain
SMARTSM007171.3E-10398448IPR001005SANT/Myb domain
PfamPF002492.7E-9399444IPR001005SANT/Myb domain
CDDcd001678.18E-7401446No hitNo description
Gene3DG3DSA:1.10.10.608.2E-13407456IPR009057Homeodomain-like
SuperFamilySSF466891.1E-19431501IPR009057Homeodomain-like
PROSITE profilePS5129416.84447498IPR017930Myb domain
SMARTSM007171.2E-11451500IPR001005SANT/Myb domain
PfamPF002492.0E-11453496IPR001005SANT/Myb domain
CDDcd001676.94E-10454498No hitNo description
Gene3DG3DSA:1.10.10.603.2E-16457501IPR009057Homeodomain-like
PROSITE profilePS5129421.975499555IPR017930Myb domain
SuperFamilySSF466891.04E-24503598IPR009057Homeodomain-like
SMARTSM007179.4E-15504553IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.606.9E-18506554IPR009057Homeodomain-like
PfamPF002498.1E-15506549IPR001005SANT/Myb domain
CDDcd001673.45E-12507551No hitNo description
Gene3DG3DSA:1.10.10.609.8E-16555601IPR009057Homeodomain-like
SMARTSM007173.3E-12556604IPR001005SANT/Myb domain
PROSITE profilePS5129415.255558606IPR017930Myb domain
CDDcd001672.08E-10560601No hitNo description
PfamPF002495.4E-11560601IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1034 aa     Download sequence    Send to blast
MSLKDQYDTV NDEEDIDDVE LSGNENGVGF DEDMEALKKA CLRTGTDLNG LDIVSVDNER  60
PSTSTAASPA SADSGSEDDL EIFRSIRNRL ALSEDVYEPL SLKPLCTLPP ISSDEDAEDD  120
FETLRVIQKR FLAYSTDCTR KNSREDNVEK TEPIYMTRTP LKDATGNDIC ENFLDYVQTG  180
NISHVSSDNA EMQPLSLVQC DQSDVNVLST YKSSRFPKSA QLLFDAIKKN RSYQTFLRSR  240
ALSAKKDPRI QLISARKLRT FEEPEVNDKR VTADYGPLEN SSVASYRMAL IDFPLKLERK  300
NWSREERENL EKGIRQQFQE RALQVSVGWL STSDGSPQDG NNLDGIIATV KDLEITPERI  360
REFLPKVDWN QLASFYVKGH SGAECETRWL NHEDPLINRN PWTAEEDKNI LLIVQEMGID  420
NWFDIAVSLG SNRTPFQCLA RYQRSLNPCI LKREWTEEED DQLRIAVEVF GESDWQSVAS  480
TLKGRTGTQC SNRWKKSLHP TRQRVGRWTR DEDKRLKVAV LLFGPKNWRK IAEVVPGRTQ  540
VQCRERWVNS LDPALNVGIW TEEEDSRLEA AIEEHGYCWS KVATCVASRT DNQCWRRWKT  600
LHPEEVPLLQ EARKIRKAAL ISNFVDRESE RPALGPNDFN IQLPMITATS EPSKEKGKRR  660
RRRPESEKEN AAALRLSPEK RSDKSCRKGA QTTTGRNPPL ENNNCTEPAE DVTFQKKRKR  720
EPPSGNNNHT KPAHVAIQKK RKQPHSGHVN CSDRMQDGAV QTYKRKQQSE SSKFVESVQD  780
NCSSHLLSTL CMTGNHEAES FGSSLTVKRR KNHKASPKQF PKRSICTESH EEQYSICSEI  840
PMFSGGDDGA EVMQNSGVES EILCADDASR KAKARSKRKT CINSLTSKSS RTIVAEHFKN  900
LSATKNTKKN RTKQQQSKSR KSNKPSGDED GQTDGDHQTL ACFLRNKLKK RRCEIVDNAC  960
LSEGMDERSK IDQTQFGLQH CDGENGTNIE IVDVVNKTVA PRDIVREPSK INKEDITLAC  1020
LRKRLKKKRR VTIA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C2e-283685924155MYB PROTO-ONCOGENE PROTEIN
1h89_C2e-283685924155MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1657662KRRRRR
2907913KKNRTKQ
310251029KKKRR
410251030KKKRRV
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ5266204e-48HQ526620.1 Gossypium herbaceum clone NBRI_A_EYI1BW401A8HI1 simple sequence repeat marker, mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016718327.10.0PREDICTED: uncharacterized protein LOC107931053 isoform X1
TrEMBLA0A1U8LUX30.0A0A1U8LUX3_GOSHI; uncharacterized protein LOC107931053 isoform X1
STRINGGorai.008G288900.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM65042744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.11e-180myb domain protein 4r1