PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.001G207600.1
Common NameB456_001G207600, LOC105771448
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 418aa    MW: 46777 Da    PI: 8.1684
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.001G207600.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS341.81.1e-104354161374
                GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPil 91 
                         l++lLl+cA++ ss++l++a+++L ++s las +gd+mqRl a f+ ALa rl++ +++l k l+ ++ ++  +++  +a+ lf  v+P+l
  Gorai.001G207600.1  35 LIQLLLTCAKHASSSNLHRADECLRQISLLASVSGDSMQRLSAWFASALAVRLVKRWPGLHKVLNYTQLPK--QDQLGQAQPLFGRVCPYL 123
                         689*****************************************************************995..777788999********* PP

                GRAS  92 kfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlv 182
                          fs+ +  +++ +a+ ge+ +H +D++ +    W++Ll+++++  +g+p+l++T +++    +k  lee+g rL k Ae+lg+pf+f +l 
  Gorai.001G207600.1 124 GFSYAIISRTLIKAMTGERVIHLVDLGSGDANLWIPLLRSFSCLLDGQPHLKVTCMNA----NKAILEELGPRLVKEAEALGLPFQFAPL- 209
                         **********************************************************....99*************************5. PP

                GRAS 183 akrledleleeLrvkpgEalaVnlvlqlhrll..............desvsleserdevLklvkslsPkvvvvveqeadhnsesFlerfle 259
                           +l +l+l++L vk+gEala  ++l+lh+ll               +  + +++  ++L++++s sPk++ +ve+eadhn +++++rf+e
  Gorai.001G207600.1 210 NVSLRELTLDKLGVKSGEALAFISILNLHSLLaeddsvdahfshnkTNGIKDSKQMFRFLSTIRSSSPKLFFLVEKEADHNLNKLVDRFVE 300
                         889*************************************999976333444446778********************************* PP

                GRAS 260 aleyysalfdsleaklpres..eerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdg 348
                          l+yysa+fds++a+++ ++  +er ++E++ +g+ei+n+vaceg er+erhe++++W  r+++aGFkpv + ++ +++ak +++ ++ +g
  Gorai.001G207600.1 301 GLHYYSAVFDSVDATFGGNTssRERLVLEEM-FGKEIENIVACEGVEREERHERYGRWMVRFGQAGFKPVMMWHDSTEDAKQMVEACGRNG 390
                         ************9999876657999999999.*********************************************************** PP

                GRAS 349 yrveeesgslvlgWkdrpLvsvSaWr 374
                         y++ +e++sl+++W+drpL++vSaW+
  Gorai.001G207600.1 391 YKIVNERASLMICWHDRPLYAVSAWT 416
                         *************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098547.8069396IPR005202Transcription factor GRAS
PfamPF035144.0E-10235416IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 418 aa     Download sequence    Send to blast
MMRSSPKPEV TLSLALSPGS TLQEVVKPQE RGVRLIQLLL TCAKHASSSN LHRADECLRQ  60
ISLLASVSGD SMQRLSAWFA SALAVRLVKR WPGLHKVLNY TQLPKQDQLG QAQPLFGRVC  120
PYLGFSYAII SRTLIKAMTG ERVIHLVDLG SGDANLWIPL LRSFSCLLDG QPHLKVTCMN  180
ANKAILEELG PRLVKEAEAL GLPFQFAPLN VSLRELTLDK LGVKSGEALA FISILNLHSL  240
LAEDDSVDAH FSHNKTNGIK DSKQMFRFLS TIRSSSPKLF FLVEKEADHN LNKLVDRFVE  300
GLHYYSAVFD SVDATFGGNT SSRERLVLEE MFGKEIENIV ACEGVEREER HERYGRWMVR  360
FGQAGFKPVM MWHDSTEDAK QMVEACGRNG YKIVNERASL MICWHDRPLY AVSAWTC*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-472741611379Protein SCARECROW
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012448371.10.0PREDICTED: scarecrow-like protein 3
TrEMBLA0A0D2M1880.0A0A0D2M188_GOSRA; Uncharacterized protein
STRINGGorai.001G207600.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2066156
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G50420.11e-104scarecrow-like 3
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]