PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.001G028200.2
Common NameB456_001G028200, LOC105764005
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1012aa    MW: 111491 Da    PI: 4.7146
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.001G028200.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding61.22.2e-193985148
                        TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
     Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48
                        +g WT+eEde+l +av+q+ g++Wk+Ia+++   Rt+ qc +rwqk+l
  Gorai.001G028200.2 39 KGQWTPEEDEILRKAVQQFKGKNWKKIAECFK-DRTDVQCLHRWQKVL 85
                        799****************************9.************986 PP

2Myb_DNA-binding67.13e-2191137148
                         TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
     Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                         +g+W++eEdel++++v+++G+++W+tIa++++ gR +kqc++rw+++l
  Gorai.001G028200.2  91 KGPWSKEEDELIIELVNKFGPKNWSTIAQHLP-GRIGKQCRERWHNHL 137
                         79******************************.*************97 PP

3Myb_DNA-binding49.97.2e-16143185145
                         TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
     Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                         + +WT+eE++ l++a++ +G++ W+  ++ ++ gRt++ +k++w+
  Gorai.001G028200.2 143 KEAWTQEEELALIRAHQVYGNK-WAELSKFLP-GRTDNAIKNHWN 185
                         579*******************.*********.***********8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5129418.8263485IPR017930Myb domain
SuperFamilySSF466897.9E-193799IPR009057Homeodomain-like
SMARTSM007173.4E-163887IPR001005SANT/Myb domain
PfamPF002491.5E-163985IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.8E-244196IPR009057Homeodomain-like
CDDcd001679.44E-154285No hitNo description
PROSITE profilePS5129433.06786141IPR017930Myb domain
SuperFamilySSF466891.72E-3288184IPR009057Homeodomain-like
SMARTSM007173.9E-2090139IPR001005SANT/Myb domain
PfamPF002496.1E-2091137IPR001005SANT/Myb domain
CDDcd001671.12E-1793137No hitNo description
Gene3DG3DSA:1.10.10.602.7E-2897140IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.607.4E-22141192IPR009057Homeodomain-like
SMARTSM007179.1E-16142190IPR001005SANT/Myb domain
PROSITE profilePS5129420.316142192IPR017930Myb domain
PfamPF002493.5E-14143186IPR001005SANT/Myb domain
CDDcd001673.16E-12145185No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1012 aa     Download sequence    Send to blast
MDGDRTTSTP SVVPSISDGA QRMRAFHGRT SGPTRRSTKG QWTPEEDEIL RKAVQQFKGK  60
NWKKIAECFK DRTDVQCLHR WQKVLNPELV KGPWSKEEDE LIIELVNKFG PKNWSTIAQH  120
LPGRIGKQCR ERWHNHLNPA INKEAWTQEE ELALIRAHQV YGNKWAELSK FLPGRTDNAI  180
KNHWNSSVKK KVDSYVASGL LEQFQFPLLT NQSQSMPSSS SRIQRNVDDS GAKSRTEADD  240
ISECSQEPTM AGCSQTTSDL ANAAVHTREH SHLTEISGVG KEKNSCPAPC SEEYYPSLED  300
VSFSIPEIPG EAGYSTCGDY QFGLTNLPNP SSLELGPESS GFKNHCIDTS RCHEVMNVAL  360
QTSVGLNAPT SFINTVTTSD KQEHMLITDD ECCRVLFSET VTDGCFVSED LTQGYNMVES  420
SSQASDIQKS ETGALQSNCP SRSEVLPTSC CQPFVPPLIS VEDGTTLIYG RELGQLTGQP  480
FETQEQELTM NVRDGFICTS DDHTYGTDMQ ERSYLDKDSP KLVPVNTFGS ESNAMQTCPI  540
VDDKPNLPAE QDEGGLCYEP PRFPSLDIPF FSCDLISSGS DKQQEYSPLG IRQLMMSSMN  600
CISPFRLSDS PLWDDSPDAK LKSAAKTFTG TPSILKKRHH DLLSPLSERK CGKKLETDMT  660
SNLSKEFSCL DVMLDASGTG NTSQESPSEC KTKSGVFIEE KENLCQAVDQ EQYNGGDHTE  720
PLDDEGQKKD SNGINSQGDI EKEACVTDAK DKTYANASDK IVQRPPEVLV EHNLNDLLLF  780
SPDHVGLKAD RPLLSSSTLT PRNQGLASEC FSGNACIIVS SPTPQIKNSE SQSISSATLE  840
NLADNAGNGA AIENYNMFSE TPLKRSIESS SAWKSPWFIN SFVPGPRIDT EITIEDIGYL  900
MSPPDRSYDA IGLMKQLSEH TAATYADALE VLGNETPESI VKGRRSNNSK MDKENNQMET  960
QSHLASDILV ERRILDFSEC GTPGKGTENR KSSTAVSFSS PTSSYLLKGC R*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C1e-70391926159MYB PROTO-ONCOGENE PROTEIN
1h89_C1e-70391926159MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012437928.10.0PREDICTED: myb-related protein 3R-1-like
RefseqXP_012437996.10.0PREDICTED: myb-related protein 3R-1-like
TrEMBLA0A0D2LU270.0A0A0D2LU27_GOSRA; Uncharacterized protein
STRINGGorai.001G028200.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32730.20.0Homeodomain-like protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]