PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.001G111500.2
Common NameB456_001G111500, LOC105791430
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 669aa    MW: 74710.2 Da    PI: 7.0013
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.001G111500.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS380.71.8e-1162916611373
                GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPil 91 
                         l++lL+ cA+avs +d + a++l++++++++sp+gd  qRla++f+ AL+arla++++++y++l++++ts    ++ l+a++++  ++P++
  Gorai.001G111500.2 291 LRTLLILCAQAVSGDDGATAKELIKQIRQHSSPTGDGSQRLAQCFVDALEARLAGTGTHIYSSLAVKRTS---AADMLKAYQVYLSACPFM 378
                         5789***************************************************************999...9***************** PP

                GRAS  92 kfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnv 180
                         k++ + aN++I + +e+++++H+iDf+i +G+QWpaL++ La+Rp+gpp+lRiTg++ p++g   +e+++etg+rLa+++e+ +vpfefn+
  Gorai.001G111500.2 379 KMAIFFANNTIFKVAEKATTLHVIDFGIFYGFQWPALIHCLANRPGGPPKLRITGIEFPRPGfrPAEAVQETGHRLARYCERYNVPFEFNA 469
                         *************************************************************9*9*************************** PP

                GRAS 181 lvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsl 271
                          va+++e++++e+L++++++++aVn+ ++ ++llde+v l+s+rd vL+l+++++P+++v++  + ++n++ F++rf eal ++salfd+ 
  Gorai.001G111500.2 470 -VAQKWETIQTEDLKINSNDVIAVNCLFRFKNLLDETVVLNSPRDIVLNLIRKINPDIFVHSIVNGSYNAPFFVTRFREALFHFSALFDMS 559
                         .7***************************************************************************************** PP

                GRAS 272 eaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgW 362
                         e+++++e++ r+++E+++ g+ei+n+vaceg+er+er e +++W+ r  +aGF+++pl+ + +k+++  +++++   ++v+ +  ++++gW
  Gorai.001G111500.2 560 ETNISQEDNLRSMLEQKFYGQEIMNIVACEGTERVERPEAYKQWQVRSVRAGFTQLPLDPELMKKVRGKVKECYHSDFMVDVDGRWMLQGW 650
                         **************************************************************************777************** PP

                GRAS 363 kdrpLvsvSaW 373
                         k+r +++ SaW
  Gorai.001G111500.2 651 KGRIIYASSAW 661
                         *********** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098566.836265642IPR005202Transcription factor GRAS
PfamPF035146.3E-114291661IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 669 aa     Download sequence    Send to blast
MEEKPCLFHD SLALQAAEKS LYEVLGESYP PRNRAPLCSG HSVESSPDDC SFRTSGDHST  60
YAGSSSNTSK SIDSRWNGDL GENNDKPSLF EASVPDNFVF QSSVNSFSQS SARFQKVTAS  120
NGKGLVGSNS NELAIPNYFS ESELALHFKK GVEEASKFLP KGNQLTFDFK SNAWTAELNQ  180
KAPVTVVEME SDWKEYSPHR LTGKKNHDRE DEDFEEGRNN KQSAVSGDES ELSDMFDKVL  240
ICAGRNEKSP ACGADETPRN GPSKLQPKEQ TNGSGKARGK KQGKKKEVVD LRTLLILCAQ  300
AVSGDDGATA KELIKQIRQH SSPTGDGSQR LAQCFVDALE ARLAGTGTHI YSSLAVKRTS  360
AADMLKAYQV YLSACPFMKM AIFFANNTIF KVAEKATTLH VIDFGIFYGF QWPALIHCLA  420
NRPGGPPKLR ITGIEFPRPG FRPAEAVQET GHRLARYCER YNVPFEFNAV AQKWETIQTE  480
DLKINSNDVI AVNCLFRFKN LLDETVVLNS PRDIVLNLIR KINPDIFVHS IVNGSYNAPF  540
FVTRFREALF HFSALFDMSE TNISQEDNLR SMLEQKFYGQ EIMNIVACEG TERVERPEAY  600
KQWQVRSVRA GFTQLPLDPE LMKKVRGKVK ECYHSDFMVD VDGRWMLQGW KGRIIYASSA  660
WVPASYPV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A7e-542736651382Protein SCARECROW
Search in ModeBase
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, shoots, flowers and siliques. {ECO:0000269|PubMed:10341448, ECO:0000269|PubMed:18500650}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012474953.10.0PREDICTED: scarecrow-like protein 33
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A0D2N8600.0A0A0D2N860_GOSRA; Uncharacterized protein
STRINGGorai.001G111500.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]