PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.013G026300.4
Common NameB456_013G026300
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1431aa    MW: 154855 Da    PI: 6.388
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.013G026300.4genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.28.8e-09652693346
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         +WT eE e++ d  + +G++ +++Ia  +  ++t  +c+++++k
  Gorai.013G026300.4 652 PWTSEEKEIFMDKLAAFGKD-FRKIATFLD-HKTTADCVEFYYK 693
                         8*****************99.*********.***********98 PP

2Myb_DNA-binding30.58.5e-10871910445
                         S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
     Myb_DNA-binding   4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                         WT eE   +++av  +G++ ++ I+r++  +R++ qck ++ 
  Gorai.013G026300.4 871 WTDEEKSVFIQAVSSYGKD-FAMISRCVR-TRSRDQCKVFFS 910
                         *****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466899.72E-14636697IPR009057Homeodomain-like
PROSITE profilePS5129314.53648699IPR017884SANT domain
SMARTSM007171.6E-7649697IPR001005SANT/Myb domain
PfamPF002497.3E-6651693IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.7E-5651697IPR009057Homeodomain-like
PROSITE profilePS5129311.174866917IPR017884SANT domain
SMARTSM007171.0E-7867915IPR001005SANT/Myb domain
SuperFamilySSF466892.73E-10869917IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.8E-6870911IPR009057Homeodomain-like
PfamPF002493.2E-7871910IPR001005SANT/Myb domain
CDDcd001676.74E-7871909No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1431 aa     Download sequence    Send to blast
MFTYPSRTHS DFVNTWNQLQ KDHHDNRTCG VNGLGTGQRC ERENSLGSVD WKPLKWSRSG  60
SLSSWGSGFS HSSSSKSLGG VDSGEAKLEL HQKNLAPVQS PSGDAAACVT SAPPSDETTS  120
RKKPRLGWGE GLAKFEKKKD GGPDTSINSG GAAISLCNTE PNTSLNSNLV DKSPRVLGFS  180
DCSSPATPSS VACSSSPGVE EKSFGKAANI DNDVNNLCGS PSFGSQNQLE GSSFSLEKLD  240
INSIINMGSS LIDLLQSDEP STMDSSFVQS TAINKLLLWK GDILKALEMT ESEIESLETE  300
LKSSKDDPGR RCQCPATSSS LPVRENGKSC EEQEAASSMI PQPAPLKIDP SNDVLEVLQE  360
ANADIKDGVI DSPGTATSDF MLSSSLEKAE SLCDVVKAQD CSGNSSSAQL KTMEEVILAT  420
DSCNEEAAAV ISGEGSVLVK IDNEAHVPES SNSDAGGENM TCDVILTTNK ELANRSSLVF  480
KKLFPKDQYS IEISEISNAV RGQISSLIRE KIAMRKRHLR FKERVLTLKF KAFQYAWKED  540
MLSPAMRKYW AKSQKKYELS LRSTYGGYQK HRSSSRSRVA SSAGNLVLEP TAEMINFTSK  600
LLLDSHVKLY RNALKMPALI LDEQEQLSRF ISSNGLVEDP CAIEKERALI NPWTSEEKEI  660
FMDKLAAFGK DFRKIATFLD HKTTADCVEF YYKNHKSECF KKTKKKLDLT KQGKSSANTY  720
LLTSGKKWSK EFNAASIDVL GSASVIATHA ESGMQKHQTS SSRIFFGGRY SKISRADDRI  780
ADRLSSFDII GNDRETAAAD VLAGICGSLS SEAMSSCITS SLDPGESFHR DWKCHKVDSL  840
LKRRSTSNVA QNVDDGTCSD ESCGEMDPAD WTDEEKSVFI QAVSSYGKDF AMISRCVRTR  900
SRDQCKVFFS KARKCLGLDL IDPRTRNLGT PMSDDANGGG SDAEDACVLE RLVVSSDKLG  960
SKPEDLPSNI VCTNMDERNP TSKPILPTDL NVPDENNRKL VDHRDSEAVQ TVDSDAGLAE  1020
LISECSVDMN IDSKAGSLQV QKSFVALGNL NAGRDVTEQG VSVAVSASLG AAAHPCTPSL  1080
DSVAVSKPAT SLYENDTKCS AETSSQSICR IDSNKASDGS VGKNSCSGFS LSAKGLHQIP  1140
PDLDSAKKPS VSNNSSANGS ALHDSDGLRC EKICNLGRLS STLDYKENEA KQAQKSVRED  1200
ESGRLSGKTS VNVTEPHRIL RGYPLQVSTL KEMNGDVKCL ATSKRGSAGP CLAQECYLQK  1260
CNSSKSAAEL PLLVENLEQA KDRPKSHCRI SDTENPGRNG NVKLFGQILN SSSRDDKVSH  1320
FSKQNTEPSN SKPIGNNVDG NSKFDANNHV VENVPKRSYG FSDGKRIQTG LSSLPDSSIL  1380
MAKYPSAFAN YPPTSSSQME QQALQTVVHG TDRTLNGVSF PLKGNKQQQR *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-17611701494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-17611701494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5788050.0JX578805.1 Gossypium hirsutum clone NBRI_GE10901 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016704621.10.0PREDICTED: uncharacterized protein LOC107919754
TrEMBLA0A0D2V9N80.0A0A0D2V9N8_GOSRA; Uncharacterized protein
STRINGGorai.013G026300.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-171MYB family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]