PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_03284_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 443aa    MW: 48181.5 Da    PI: 5.589
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_03284_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS378.49.1e-116814403374
                        GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfs 85 
                                  lLl+cAe+v+ ++le a+ lL ++s+l+sp g++ +R+ ayf+ AL+ar+++s+ ++y+ l  ++ + ++s++ ++al+ ++
  Cotton_A_03284_BGI-A2_v1.0  81 GLLLQCAECVAMDNLEDATDLLPEISQLSSPFGSSPERVGAYFAHALQARVVSSSLRTYSPLDNKSLTLTQSQKIFNALQSYN 163
                                 58*****************************************************************999************* PP

                        GRAS  86 evsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakf 168
                                  +sP++kfsh+taNqaI +a++ge++vH+iD+di+qGlQWp L++ LasR+++  s+RiTg+gs    s+e le tg+rLa+f
  Cotton_A_03284_BGI-A2_v1.0 164 SISPLVKFSHFTANQAIFQALSGEDCVHVIDLDIMQGLQWPGLFHILASRSKKIRSMRITGFGS----SSELLELTGKRLADF 242
                                 ****************************************************************....99************* PP

                        GRAS 169 AeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnse 251
                                 A++lg+pfef++l  k  +  +l++L v++ Ea++V++++  h l+d ++s       +L+l+  l+Pk++++veq+++h ++
  Cotton_A_03284_BGI-A2_v1.0 243 AASLGLPFEFHPLEGKIGNLTDLSQLGVRSSEAVVVHWMH--HCLYDITGSDLA----TLRLLTLLKPKLITIVEQDLSH-GG 318
                                 ************9777777889****************99..777766666666....9*********************.78 PP

                        GRAS 252 sFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaa 334
                                 sFl rf+eal+yysalfd+l  +l+ +s er++vE++l+g+ei+n+va  g +r+ + + +e+W e l+++GF++v+l+ + a
  Cotton_A_03284_BGI-A2_v1.0 319 SFLGRFVEALHYYSALFDALGDGLSVDSLERHTVEQQLFGNEIRNIVAVGGPKRTGEVK-VERWGEELRRVGFQTVSLGGNPA 400
                                 9****************************************************998887.9********************** PP

                        GRAS 335 kqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                                 +qa+lll +++++gy++ ee+g+l lgWkd +L+++SaW+
  Cotton_A_03284_BGI-A2_v1.0 401 AQASLLLGMFPWKGYTLLEENGCLKLGWKDLSLLTASAWQ 440
                                 ***************************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098556.55953420IPR005202Transcription factor GRAS
PfamPF035143.1E-11381440IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0048366Biological Processleaf development
GO:0090610Biological Processbundle sheath cell fate specification
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 443 aa     Download sequence    Send to blast
MLQSLVPQSP ISSNTTSSNP SSSMKSKRVE RDVSAATGED STADDPSNKR PNYSSGDKAA  60
AANEHDETVI EGESTGLRLL GLLLQCAECV AMDNLEDATD LLPEISQLSS PFGSSPERVG  120
AYFAHALQAR VVSSSLRTYS PLDNKSLTLT QSQKIFNALQ SYNSISPLVK FSHFTANQAI  180
FQALSGEDCV HVIDLDIMQG LQWPGLFHIL ASRSKKIRSM RITGFGSSSE LLELTGKRLA  240
DFAASLGLPF EFHPLEGKIG NLTDLSQLGV RSSEAVVVHW MHHCLYDITG SDLATLRLLT  300
LLKPKLITIV EQDLSHGGSF LGRFVEALHY YSALFDALGD GLSVDSLERH TVEQQLFGNE  360
IRNIVAVGGP KRTGEVKVER WGEELRRVGF QTVSLGGNPA AQASLLLGMF PWKGYTLLEE  420
NGCLKLGWKD LSLLTASAWQ PSD
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-1638544125380Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017626862.10.0PREDICTED: scarecrow-like protein 23
SwissprotQ9FHZ10.0SCL23_ARATH; Scarecrow-like protein 23
TrEMBLA0A1U8NRI90.0A0A1U8NRI9_GOSHI; scarecrow-like protein 23
STRINGGorai.004G063200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM125122631
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G41920.11e-179GRAS family protein
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]
  2. Yoon EK, et al.
    Conservation and Diversification of the SHR-SCR-SCL23 Regulatory Network in the Development of the Functional Endodermis in Arabidopsis Shoots.
    Mol Plant, 2016. 9(8): p. 1197-1209
    [PMID:27353361]