PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.005G075100.1
Common NameB456_005G075100
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 457aa    MW: 52194.7 Da    PI: 7.6023
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.005G075100.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS287.73.2e-88824541374
                GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPil 91 
                         l++ Ll +A+ v++++++ a + L +l  +++  gd++qR++a+f+ +L ar++   s  y+ +  ++t+    ++++ a++ ++ vsP++
  Gorai.005G075100.1  82 LIHSLLIIATSVDKNNMNSALENLIQLYPTVTLMGDSVQRVVAHFADGLFARILTPKSPFYDMVMKEPTT----EQQFLAFTSLYRVSPYY 168
                         6899***************************************************9999**988777776....677788899******** PP

                GRAS  92 kfshltaNqaIleavege......ervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAee.lgvp 175
                         +f+h+taNqaI ea+e++      +++H+iDf++s+G+QWp+L+q+L++ +++  slR+Tg g+    s eel+et+ rL +fA+  +++ 
  Gorai.005G075100.1 169 QFAHFTANQAIIEAFEKDqetnnnRALHVIDFNVSYGFQWPSLIQSLSQ-SGKRVSLRLTGYGR----SLEELQETEARLVSFAKGfCNLV 254
                         ***************998667776788********************98.78899*********....9***************97369** PP

                GRAS 176 fefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysa 266
                         fef+ l  ++ ++l +++ r k++E++aVnlv++l +++    +     +++Lk v sl+P +v++veqe +    +Fl rf+e+l+y++a
  Gorai.005G075100.1 255 FEFQGL-LRSSSKLIINQ-REKKNETVAVNLVFHLSNFM----E----MSQTLKSVHSLKPSIVILVEQEGNPRVRNFLSRFMESLHYFAA 335
                         *****6.66666664555.7899***************9....3....4569*************************************** PP

                GRAS 267 lfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdg......... 348
                         +fdsl+  lp+es er ++Er  lg+ei+ ++  e+ +    +e++++W++++e+ GF+ ++ls+k   qaklll+    +          
  Gorai.005G075100.1 336 MFDSLDDCLPQESTERLSIERNHLGKEIKAMINSENLD----EENKGTWKNMMERHGFGGMKLSSKCLIQAKLLLKVRTHNYcplpcegen 422
                         *************************************9....8999*****************************8654332111111111 PP

                GRAS 349 ....yrveee.sgslv.lgWkdrpLvsvSaWr 374
                             +rv ++ +g ++ lgW+dr L+++SaW+
  Gorai.005G075100.1 423 tnggFRVFQRdEGKALsLGWQDRCLLTASAWQ 454
                         11126654334555555**************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098547.33956419IPR005202Transcription factor GRAS
PfamPF035141.1E-8582454IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 457 aa     Download sequence    Send to blast
MEDRDEEEEH LNLSLAIVTD PNGENSRKRK RKLLNICNPL NPSHEGYFEG KIFKLLQVRE  60
EMLKLDHKRS KGLAENGKGL HLIHSLLIIA TSVDKNNMNS ALENLIQLYP TVTLMGDSVQ  120
RVVAHFADGL FARILTPKSP FYDMVMKEPT TEQQFLAFTS LYRVSPYYQF AHFTANQAII  180
EAFEKDQETN NNRALHVIDF NVSYGFQWPS LIQSLSQSGK RVSLRLTGYG RSLEELQETE  240
ARLVSFAKGF CNLVFEFQGL LRSSSKLIIN QREKKNETVA VNLVFHLSNF MEMSQTLKSV  300
HSLKPSIVIL VEQEGNPRVR NFLSRFMESL HYFAAMFDSL DDCLPQESTE RLSIERNHLG  360
KEIKAMINSE NLDEENKGTW KNMMERHGFG GMKLSSKCLI QAKLLLKVRT HNYCPLPCEG  420
ENTNGGFRVF QRDEGKALSL GWQDRCLLTA SAWQCV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-519045327378Protein SCARECROW
5b3h_A1e-519045326377Protein SCARECROW
5b3h_D1e-519045326377Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12631RKRKRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012482346.10.0PREDICTED: scarecrow-like protein 21
TrEMBLA0A0D2RH060.0A0A0D2RH06_GOSRA; Uncharacterized protein
STRINGGorai.005G075100.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15761916
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G03450.14e-53RGA-like 2
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]