PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.003G130600.1
Common NameB456_003G130600, LOC105788811
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 770aa    MW: 86043.9 Da    PI: 6.6952
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.003G130600.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS382.16.7e-1173967661373
                GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPil 91 
                         l++lL+ cA+a++s+d+  a++l++++++++sp+gd  qRla+yf+ AL+arla++++++y +l +++ts    ++ l+a++++  v+P++
  Gorai.003G130600.1 396 LRTLLILCAQAITSNDNVTAKELIKQIRQHSSPYGDGSQRLAHYFVDALEARLAGTGTQIYTSLIAKRTS---AADMLKAYQVYISVCPFV 483
                         5789************************************************************999998...9***************** PP

                GRAS  92 kfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnv 180
                         k+  + aN+ I +a+e+++++HiiDf+i +G++WpaL++ La+Rp+gpp+lRiTg++ p++g   +e+++etg+rL k++e+ +vpfe+++
  Gorai.003G130600.1 484 KVPIIFANNYISKAAEKATKLHIIDFGIFYGFHWPALIHRLANRPGGPPKLRITGIEFPQPGfrPAEAVQETGRRLVKYCERYNVPFEYHA 574
                         *************************************************************9***************************** PP

                GRAS 181 lvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsl 271
                          +a+++e++++e+L+++++E++aVn+  +  +llde+v l+s+rd+vL+l+++++P+v+v++  + ++n++ F++rf eal ++salfd+ 
  Gorai.003G130600.1 575 -IAQKWETIRTEDLKINSDEVIAVNCLCRFRNLLDETVVLNSPRDTVLNLIRKINPDVFVHSVVNGSYNAPFFVTRFREALFHFSALFDMC 664
                         .7***************************************************************************************** PP

                GRAS 272 eaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgW 362
                         e+++++e++ r+++E+++ grei+n++aceg+er+er e++++W+ r ++aGF ++pl+ + +k +k  ++  +   ++v+ +  ++++gW
  Gorai.003G130600.1 665 ETNVSHEDNMRSMLEQKFYGREIMNIIACEGTERVERPESYKQWQVRNMRAGFVQLPLNPELMKRVKERVKARYHSDFMVDVDGRWMLQGW 755
                         ***********************************************************************999777************** PP

                GRAS 363 kdrpLvsvSaW 373
                         k+r +++ SaW
  Gorai.003G130600.1 756 KGRIIYASSAW 766
                         *********** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098567.592370747IPR005202Transcription factor GRAS
PfamPF035142.3E-114396766IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009410Biological Processresponse to xenobiotic stimulus
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005829Cellular Componentcytosol
Sequence ? help Back to Top
Protein Sequence    Length: 770 aa     Download sequence    Send to blast
MDSHLTGFPH TVNGFKINDG YLLPNPNVYP KFEISDGVGS NDQSLDFSSL GVPFLPSLGL  60
GDSSFASILS MGSMGKEGDT FSPTDDTDVS DTVLKYISQV LLEEDMEEKP CMFHDSLALQ  120
AAEKSLYEVL GESYPPRDQA PVCVDPSVES PDNCSFGTSS DHSIHSGSSS CTSYSIESQW  180
NGDFSENNNR PSLLQTSIPE NFVFQSTVDP GSRFSSHSQN GSANNGNGFR GSPASEFLVP  240
NYFSQSELAL HFKRGFEEAS KFLPKGNQLN VGFKSNALTS ELKQKASNTV VKVESDRKEY  300
SPPRLIRKKS HEREDEDLEE RNNKQSAVLG DESELSDMFD KVLICAGRRG QSSSSTADET  360
LPNGPSKTLL PNEQTNGSNS GKARGKKQGK KKVVDLRTLL ILCAQAITSN DNVTAKELIK  420
QIRQHSSPYG DGSQRLAHYF VDALEARLAG TGTQIYTSLI AKRTSAADML KAYQVYISVC  480
PFVKVPIIFA NNYISKAAEK ATKLHIIDFG IFYGFHWPAL IHRLANRPGG PPKLRITGIE  540
FPQPGFRPAE AVQETGRRLV KYCERYNVPF EYHAIAQKWE TIRTEDLKIN SDEVIAVNCL  600
CRFRNLLDET VVLNSPRDTV LNLIRKINPD VFVHSVVNGS YNAPFFVTRF REALFHFSAL  660
FDMCETNVSH EDNMRSMLEQ KFYGREIMNI IACEGTERVE RPESYKQWQV RNMRAGFVQL  720
PLNPELMKRV KERVKARYHS DFMVDVDGRW MLQGWKGRII YASSAWIPA*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A3e-5540376825379Protein SCARECROW
5b3h_D3e-5540376825379Protein SCARECROW
Search in ModeBase
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, shoots, flowers and siliques. {ECO:0000269|PubMed:10341448, ECO:0000269|PubMed:18500650}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012471313.10.0PREDICTED: scarecrow-like protein 33
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A0D2QK260.0A0A0D2QK26_GOSRA; Uncharacterized protein
STRINGGorai.003G130600.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]