PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.013G026300.1
Common NameB456_013G026300
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1588aa    MW: 173056 Da    PI: 6.708
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.013G026300.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.19.9e-09809850346
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         +WT eE e++ d  + +G++ +++Ia  +  ++t  +c+++++k
  Gorai.013G026300.1 809 PWTSEEKEIFMDKLAAFGKD-FRKIATFLD-HKTTADCVEFYYK 850
                         8*****************99.*********.***********98 PP

2Myb_DNA-binding30.39.5e-1010281067445
                          S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
     Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                          WT eE   +++av  +G++ ++ I+r++  +R++ qck ++ 
  Gorai.013G026300.1 1028 WTDEEKSVFIQAVSSYGKD-FAMISRCVR-TRSRDQCKVFFS 1067
                          *****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.05E-13793854IPR009057Homeodomain-like
PROSITE profilePS5129314.53805856IPR017884SANT domain
SMARTSM007171.6E-7806854IPR001005SANT/Myb domain
PfamPF002498.2E-6808850IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.1E-5808854IPR009057Homeodomain-like
PROSITE profilePS5129311.17410231074IPR017884SANT domain
SMARTSM007171.0E-710241072IPR001005SANT/Myb domain
SuperFamilySSF466893.04E-1010261074IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.4E-610271068IPR009057Homeodomain-like
PfamPF002493.6E-710281067IPR001005SANT/Myb domain
CDDcd001677.59E-710281066No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1588 aa     Download sequence    Send to blast
MPQEPLPWDR KDIYKDRKHE RAELQPPPLL AARWREASSM SSYQHGSFRE FARWGSADFH  60
RPPGHGKQGN WHLFPEDIGG HGYVPWRSSD KILDGETYRQ SVSRGDGKYG RSYSRDNNRG  120
SYNQRDWRGH SLETSNGSPN TSVRPHDVNN EQRSVDDMFT YPSRTHSDFV NTWNQLQKDH  180
HDNRTCGVNG LGTGQRCERE NSLGSVDWKP LKWSRSGSLS SWGSGFSHSS SSKSLGGVDS  240
GEAKLELHQK NLAPVQSPSG DAAACVTSAP PSDETTSRKK PRLGWGEGLA KFEKKKDGGP  300
DTSINSGGAA ISLCNTEPNT SLNSNLVDKS PRVLGFSDCS SPATPSSVAC SSSPGVEEKS  360
FGKAANIDND VNNLCGSPSF GSQNQLEGSS FSLEKLDINS IINMGSSLID LLQSDEPSTM  420
DSSFVQSTAI NKLLLWKGDI LKALEMTESE IESLETELKS SKDDPGRRCQ CPATSSSLPV  480
RENGKSCEEQ EAASSMIPQP APLKIDPSND VLEVLQEANA DIKDGVIDSP GTATSDFMLS  540
SSLEKAESLC DVVKAQDCSG NSSSAQLKTM EEVILATDSC NEEAAAVISG EGSVLVKIDN  600
EAHVPESSNS DAGGENMTCD VILTTNKELA NRSSLVFKKL FPKDQYSIEI SEISNAVRGQ  660
ISSLIREKIA MRKRHLRFKE RVLTLKFKAF QYAWKEDMLS PAMRKYWAKS QKKYELSLRS  720
TYGGYQKHRS SSRSRVASSA GNLVLEPTAE MINFTSKLLL DSHVKLYRNA LKMPALILDE  780
QEQLSRFISS NGLVEDPCAI EKERALINPW TSEEKEIFMD KLAAFGKDFR KIATFLDHKT  840
TADCVEFYYK NHKSECFKKT KKKLDLTKQG KSSANTYLLT SGKKWSKEFN AASIDVLGSA  900
SVIATHAESG MQKHQTSSSR IFFGGRYSKI SRADDRIADR LSSFDIIGND RETAAADVLA  960
GICGSLSSEA MSSCITSSLD PGESFHRDWK CHKVDSLLKR RSTSNVAQNV DDGTCSDESC  1020
GEMDPADWTD EEKSVFIQAV SSYGKDFAMI SRCVRTRSRD QCKVFFSKAR KCLGLDLIDP  1080
RTRNLGTPMS DDANGGGSDA EDACVLERLV VSSDKLGSKP EDLPSNIVCT NMDERNPTSK  1140
PILPTDLNVP DENNRKLVDH RDSEAVQTVD SDAGLAELIS ECSVDMNIDS KAGSLQVQKS  1200
FVALGNLNAG RDVTEQGVSV AVSASLGAAA HPCTPSLDSV AVSKPATSLY ENDTKCSAET  1260
SSQSICRIDS NKASDGSVGK NSCSGFSLSA KGLHQIPPDL DSAKKPSVSN NSSANGSALH  1320
DSDGLRCEKI CNLGRLSSTL DYKENEAKQA QKSVREDESG RLSGKTSVNV TEPHRILRGY  1380
PLQVSTLKEM NGDVKCLATS KRGSAGPCLA QECYLQKCNS SKSAAELPLL VENLEQAKDR  1440
PKSHCRISDT ENPGRNGNVK LFGQILNSSS RDDKVSHFSK QNTEPSNSKP IGNNVDGNSK  1500
FDANNHVVEN VPKRSYGFSD GKRIQTGLSS LPDSSILMAK YPSAFANYPP TSSSQMEQQA  1560
LQTVVHGTDR TLNGVSFPLK GNKQQQR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-17768858494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-17768858494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5788050.0JX578805.1 Gossypium hirsutum clone NBRI_GE10901 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016704621.10.0PREDICTED: uncharacterized protein LOC107919754
TrEMBLA0A0D2S7670.0A0A0D2S767_GOSRA; Uncharacterized protein
STRINGGorai.013G026300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]