PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.013G026300.2
Common NameB456_013G026300
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1389aa    MW: 150637 Da    PI: 6.4495
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.013G026300.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.38.5e-09610651346
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         +WT eE e++ d  + +G++ +++Ia  +  ++t  +c+++++k
  Gorai.013G026300.2 610 PWTSEEKEIFMDKLAAFGKD-FRKIATFLD-HKTTADCVEFYYK 651
                         8*****************99.*********.***********98 PP

2Myb_DNA-binding30.58.2e-10829868445
                         S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
     Myb_DNA-binding   4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                         WT eE   +++av  +G++ ++ I+r++  +R++ qck ++ 
  Gorai.013G026300.2 829 WTDEEKSVFIQAVSSYGKD-FAMISRCVR-TRSRDQCKVFFS 868
                         *****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.97E-14594655IPR009057Homeodomain-like
PROSITE profilePS5129314.53606657IPR017884SANT domain
SMARTSM007171.6E-7607655IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.6E-5609655IPR009057Homeodomain-like
PfamPF002497.1E-6609651IPR001005SANT/Myb domain
PROSITE profilePS5129311.174824875IPR017884SANT domain
SMARTSM007171.0E-7825873IPR001005SANT/Myb domain
SuperFamilySSF466892.65E-10827875IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.7E-6828869IPR009057Homeodomain-like
PfamPF002493.1E-7829868IPR001005SANT/Myb domain
CDDcd001676.51E-7829867No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1389 aa     Download sequence    Send to blast
MFTYPSRTHS DFVNTWNQLQ KDHHDNRTCG VNGLGTGQRC ERENSLGSVD WKPLKWSRSG  60
SLSSWGSGFS HSSSSKSLGG VDSGEAKLEL HQKNLAPVQS PSGDAAACVT SAPPSDETTS  120
RKKPRLGWDK SPRVLGFSDC SSPATPSSVA CSSSPGVEEK SFGKAANIDN DVNNLCGSPS  180
FGSQNQLEGS SFSLEKLDIN SIINMGSSLI DLLQSDEPST MDSSFVQSTA INKLLLWKGD  240
ILKALEMTES EIESLETELK SSKDDPGRRC QCPATSSSLP VRENGKSCEE QEAASSMIPQ  300
PAPLKIDPSN DVLEVLQEAN ADIKDGVIDS PGTATSDFML SSSLEKAESL CDVVKAQDCS  360
GNSSSAQLKT MEEVILATDS CNEEAAAVIS GEGSVLVKID NEAHVPESSN SDAGGENMTC  420
DVILTTNKEL ANRSSLVFKK LFPKDQYSIE ISEISNAVRG QISSLIREKI AMRKRHLRFK  480
ERVLTLKFKA FQYAWKEDML SPAMRKYWAK SQKKYELSLR STYGGYQKHR SSSRSRVASS  540
AGNLVLEPTA EMINFTSKLL LDSHVKLYRN ALKMPALILD EQEQLSRFIS SNGLVEDPCA  600
IEKERALINP WTSEEKEIFM DKLAAFGKDF RKIATFLDHK TTADCVEFYY KNHKSECFKK  660
TKKKLDLTKQ GKSSANTYLL TSGKKWSKEF NAASIDVLGS ASVIATHAES GMQKHQTSSS  720
RIFFGGRYSK ISRADDRIAD RLSSFDIIGN DRETAAADVL AGICGSLSSE AMSSCITSSL  780
DPGESFHRDW KCHKVDSLLK RRSTSNVAQN VDDGTCSDES CGEMDPADWT DEEKSVFIQA  840
VSSYGKDFAM ISRCVRTRSR DQCKVFFSKA RKCLGLDLID PRTRNLGTPM SDDANGGGSD  900
AEDACVLERL VVSSDKLGSK PEDLPSNIVC TNMDERNPTS KPILPTDLNV PDENNRKLVD  960
HRDSEAVQTV DSDAGLAELI SECSVDMNID SKAGSLQVQK SFVALGNLNA GRDVTEQGVS  1020
VAVSASLGAA AHPCTPSLDS VAVSKPATSL YENDTKCSAE TSSQSICRID SNKASDGSVG  1080
KNSCSGFSLS AKGLHQIPPD LDSAKKPSVS NNSSANGSAL HDSDGLRCEK ICNLGRLSST  1140
LDYKENEAKQ AQKSVREDES GRLSGKTSVN VTEPHRILRG YPLQVSTLKE MNGDVKCLAT  1200
SKRGSAGPCL AQECYLQKCN SSKSAAELPL LVENLEQAKD RPKSHCRISD TENPGRNGNV  1260
KLFGQILNSS SRDDKVSHFS KQNTEPSNSK PIGNNVDGNS KFDANNHVVE NVPKRSYGFS  1320
DGKRIQTGLS SLPDSSILMA KYPSAFANYP PTSSSQMEQQ ALQTVVHGTD RTLNGVSFPL  1380
KGNKQQQR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-17569659494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-17569659494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5788050.0JX578805.1 Gossypium hirsutum clone NBRI_GE10901 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017619123.10.0PREDICTED: uncharacterized protein LOC108463731 isoform X2
TrEMBLA0A0D2V8D60.0A0A0D2V8D6_GOSRA; Uncharacterized protein
STRINGGorai.013G026300.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-156MYB family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]