PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.009G289600.1
Common NameB456_009G289600, LOC105767612
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 455aa    MW: 51731.3 Da    PI: 7.4973
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.009G289600.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS310.53.7e-95714521374
                GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPil 91 
                         l++lLl  A+ v++++ + a   L++l + +s  gd++qR++ayf+ +L arl+   s  y+ +  ++t+    +ee+ a++ ++ vsP++
  Gorai.009G289600.1  71 LIHLLLITATSVDENNVNSALDNLTQLYQSVSLMGDSVQRVVAYFADGLVARLLTRKSPFYDMIMKEPTA----EEEFLAFTCLYRVSPYY 157
                         689****************************************************999***999888886....78889999********* PP

                GRAS  92 kfshltaNqaIleavege.....ervHiiDfdisqGlQWpaLlqaLasRp..egppslRiTgvgspesgskeeleetgerLakfAeel.gv 174
                         +f+h+taNqaI ea e+e     +++H+iDfd+ +G+QWp+L+q+L++++  ++  slR+Tg+g+    s  el+et++rL +fA+ + ++
  Gorai.009G289600.1 158 QFAHFTANQAIIEAYEKEeefnnRALHVIDFDVAYGFQWPSLIQSLSEKAsgGNRISLRLTGFGA----SFAELQETENRLVSFAKGFrNL 244
                         ****************9977777788***********************9634455*********....9****************9758* PP

                GRAS 175 pfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyys 265
                          fef+ l    l   +l +Lr k++E++aVnlv++l +l  +s+++++    +Lk v+s +P +v +veqe  ++  sFl rf+e+l+y++
  Gorai.009G289600.1 245 VFEFQGL----LRGSKLTNLRKKKNETVAVNLVFHLNTLN-NSMKISD----TLKSVRSHRPSIVMLVEQEGGRSPRSFLSRFMESLHYFA 326
                         ******8....7777889******************9996.6666666....*************************************** PP

                GRAS 266 alfdsleaklpreseerikvErellgreivnvvacegae....rrerhetlekWrerleeaGFkpvplsekaakqaklllr........kv 344
                         a+fdsl+  lp es er ++E+  lg+ei++++ c ++e    +  r e++e+W++r+e+ GF+  +ls+k   qaklll+        ++
  Gorai.009G289600.1 327 AMFDSLDDCLPLESAERLSIEKNHLGKEIKSMINCDKDEennkSSSRYEKMETWKSRMESHGFEGTKLSSKCLIQAKLLLKitthycplQC 417
                         ***********************************887633336789********************************983332222212 PP

                GRAS 345 k...sdgyrveee..sgslvlgWkdrpLvsvSaWr 374
                         +   + g+rv e+   ++l lgW+dr L+++SaW+
  Gorai.009G289600.1 418 EgevNGGFRVFERdeAKALSLGWQDRCLLTASAWQ 452
                         21222347777653355667**************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098548.8445419IPR005202Transcription factor GRAS
PfamPF035141.3E-9271452IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 455 aa     Download sequence    Send to blast
MEDEPLNLSL AIMADPGGEK INRKRKRRVP LPSSYEGCEG KIFKLLQVRE EMLKLDYKRK  60
GVVEDGKALH LIHLLLITAT SVDENNVNSA LDNLTQLYQS VSLMGDSVQR VVAYFADGLV  120
ARLLTRKSPF YDMIMKEPTA EEEFLAFTCL YRVSPYYQFA HFTANQAIIE AYEKEEEFNN  180
RALHVIDFDV AYGFQWPSLI QSLSEKASGG NRISLRLTGF GASFAELQET ENRLVSFAKG  240
FRNLVFEFQG LLRGSKLTNL RKKKNETVAV NLVFHLNTLN NSMKISDTLK SVRSHRPSIV  300
MLVEQEGGRS PRSFLSRFME SLHYFAAMFD SLDDCLPLES AERLSIEKNH LGKEIKSMIN  360
CDKDEENNKS SSRYEKMETW KSRMESHGFE GTKLSSKCLI QAKLLLKITT HYCPLQCEGE  420
VNGGFRVFER DEAKALSLGW QDRCLLTASA WQCV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A7e-557945127378Protein SCARECROW
5b3h_A7e-557945126377Protein SCARECROW
5b3h_D7e-557945126377Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12227RKRKRR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012442637.10.0PREDICTED: scarecrow-like protein 21
TrEMBLA0A0D2SA130.0A0A0D2SA13_GOSRA; Uncharacterized protein
STRINGGorai.009G289600.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15761916
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G04890.13e-59SCARECROW-like 21
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]