PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.011G240800.1
Common NameB456_011G240800, LOC105776280
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1722aa    MW: 188089 Da    PI: 6.2216
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.011G240800.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.11.7e-07813854346
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         +WT +E e++    + +G++ +++Ia+ +  ++t  +c+++++k
  Gorai.011G240800.1 813 PWTSQEKEIFMAKLAAFGKD-FRKIASFLD-HKTTADCVEFYYK 854
                         8*****************99.*********.***********98 PP

2Myb_DNA-binding33.21.2e-1010311071345
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
     Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                           WT eE   +++av  +G++ ++ I+r++g +R++ qck ++ 
  Gorai.011G240800.1 1031 HWTDEEKSAFLQAVSSYGKD-FDMISRYVG-TRSRDQCKVFFS 1071
                          6*****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.32E-13797858IPR009057Homeodomain-like
PROSITE profilePS5129314.387809860IPR017884SANT domain
SMARTSM007174.0E-7810858IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.606.9E-5813854IPR009057Homeodomain-like
PROSITE profilePS5129313.03510271078IPR017884SANT domain
SMARTSM007174.5E-910281076IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.8E-610291072IPR009057Homeodomain-like
SuperFamilySSF466895.38E-1110291078IPR009057Homeodomain-like
PfamPF002491.4E-810311071IPR001005SANT/Myb domain
CDDcd001677.91E-810321070No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1722 aa     Download sequence    Send to blast
MPPEPLPWDR KDFYKERKHE RTQSLPQQPL TARWRESSSM SPYQHASFRE FTRWGSADFR  60
RPPGHGRQGS WHLFAEENGG HGYVPSRSSN KILDDENFRQ LDSCVDGKYS RNSRENNRGS  120
SSQRDWRGHS WENCNGSPST PGRPHLVNNE RRSVDDMPTY LSHTHSDFVN TWDQLQKSQH  180
DNKTIAVNGL GTGQKCQSEN FVGSIDWKPL KWTRSGSLSS RGSGFSHSSS SKSLGGVDSG  240
EGKLELQQKN LTPVQSPSGD AAACVTSPAP SDETSSRKKP RLAWGEGLAK YEKKKVEGPD  300
TSIDRAGAKI SVRNTEFNNF LSSNLADKSP RVLGFSDCAS PATPSSVACS SSPGVEEKSF  360
GKAANVDNDT SNLCGSPTLG SQNHLEGPSF NLEKLDINSI INMGSSLTNL LQADDPCTVD  420
SSFVRSTAIS KLLLWKSDVL KALEMTESEI DSLESELKLL KGDSRSRCPC PATSSSFPEE  480
HGKACGEQEA ASSLIPRPAP LQIDACGDVL VGKQPLCNGV LEEVNDDVKD GDIDSPGTAT  540
SKFMEPLSLE KAVSPSDVVK FHECSGDFGT VQLMSMGKVI LATGSGNAGT ATTISAEGSV  600
LKRIDNDAHV PESSNSDVGD ENVMYEMILA TNKELAHVAS EVFNKLLPKD QYNSEIGNVA  660
CTQSDSAIRN KIAIRKQYLR FKERVLTIKF KAFQNAWKED LRSPSMRKYR AKSQKKYEFS  720
LRSAHGGYQK HRSSIHSRLT SPAGNPILEP RAEMINFTSK LLLGSHGRLY RNALKMPALI  780
LDEKEKKVSR FISSNGLVED PCAIEKERAL INPWTSQEKE IFMAKLAAFG KDFRKIASFL  840
DHKTTADCVE FYYKNHKSEC FEKTKKNDLS KQQGKSAVNT YLLTSGKKRG RELNAASLDV  900
LGAASVIAAH AESGMRNRHT SGRILLRGRF DSKRSQLDDS IAERSSNFDI VGSDQDTVAA  960
DVLAGICGSF SSEAMSSCIT SSADPGEGYH HDWKCHKVDS VVKRPSTSDV LQNVDGDTCS  1020
DESCGEMESS HWTDEEKSAF LQAVSSYGKD FDMISRYVGT RSRDQCKVFF SKARKCLGLD  1080
LIHSRTRNMG TPMSDDANGG ETDTEDACVQ ESSVVCSEKL GSKVEEDLPS TIVSMNVDES  1140
DLTREANLQS DHNISEGNIE RLADHKDSVA AEVNFSNVDH TEPISECGAG DMDVDSNQAE  1200
SLHVQNNVAL ANISALENHV AEEGVSVAVS ASHGGTGDCH PSLDASVEPK SGAAVLSTEG  1260
FGNNLEAQET LSSKNVMDVR DTRCNAEIDS QVICRPDLDK SSGESIDKNS CLDFSFNSEG  1320
LRQVPLDLGS AGKPSILLFP NENFSAKNSA SHSDASQCEK ICNQDRLSAT LAYQGNEDKQ  1380
PNNAVSGHEP EHLSGKPSVD LAELQISTLK EMDIDIGHSQ LPEVKRLSTS GKGVTGLYLV  1440
QDYLQKCNGP KSPSEFPQLV QNLEQTNSRP KSHSRSLSDT EKPCRNGNVK LFGQILNSSS  1500
QDDGKIRFPE QSMKSSNLNF RGHNNVDGNA SFSKFDQNII FAPENVPRRS YGFWDGNRIQ  1560
TGLSSLPDSE ILVAKYPAAF VNYPASSSQM QLQASRTIVR NTDRNMNGVS VFTPREISSN  1620
NGVMDYQVYG GHDCTKVVVP FAMDMKRREM FSEMQRRNGF DAISNLQHQG RGMVGMNVVG  1680
TGVGGVVGGS CPNLSDPVAV LRMQYAKTEQ YGGQSGSIMR E*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-16771862494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-16771862494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012454301.10.0PREDICTED: uncharacterized protein LOC105776280 isoform X1
RefseqXP_012454302.10.0PREDICTED: uncharacterized protein LOC105776280 isoform X1
TrEMBLA0A0D2VR780.0A0A0D2VR78_GOSRA; Uncharacterized protein
STRINGGorai.011G240800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]