PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA07g08560
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family GRAS
Protein Properties Length: 440aa    MW: 48650.4 Da    PI: 5.1957
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA07g08560genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS369.73.9e-113784363374
        GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaNqa 101
                  lLl+cAe+v+ ++le a +lL ++ el+sp g++ +R+aayf+eAL+ar+ +s  + y+ l+ ++ + ++s++ ++al+ ++ +sP++kfsh taNqa
  CA07g08560  78 GLLLQCAECVAMENLEDAGNLLPEIAELSSPFGSSAERVAAYFAEALSARIISSHLRFYSPLNLKALTLTHSQKLFTALQSYNTISPLIKFSHYTANQA 176
                 58************************************************************************************************* PP

        GRAS 102 IleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrledleleeLrvkpgE 200
                 I++a+e e++vHiiD+di+qGlQWp L+q L+sR+++  s++iTgvgs    s e le+tg+rLa+fA+++g+pfef++l  k  +  +l++L vk +E
  CA07g08560 177 IYQALECEDHVHIIDLDIMQGLQWPGLFQILSSRSRKLRSIKITGVGS----SMELLESTGRRLAEFANSFGLPFEFKPLEGKIGHVRNLSQLGVKAEE 271
                 ************************************************....***************************8777777779********** PP

        GRAS 201 alaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvva 299
                 a++Vn+++  h l+d ++s       + +l+  l+Pk+++ veqe++h +++Fl rf+eal+yysalfd+l  +l++es er++vE++l+g ei+n+va
  CA07g08560 272 AIVVNWMH--HCLYDVTGSDLG----TFRLLTLLRPKLITTVEQELSH-GGNFLGRFVEALHYYSALFDALGDGLGEESIERHMVEQQLFGSEIRNIVA 363
                 *******9..888888888888....99********************.789*********************************************** PP

        GRAS 300 cegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                   g +r  +    e+W + ++++GF pv+ls + a+qa+lll +++ +gy++ ee+g+l lgWkd +L+++SaW+
  CA07g08560 364 VGGPKRSGEV-PIERWGDEFKRVGFLPVSLSGTPAAQASLLLGMFP-RGYTLVEENGCLKLGWKDLSLLTASAWQ 436
                 ****977665.6**********************************.***************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098555.50250417IPR005202Transcription factor GRAS
PfamPF035141.3E-11078436IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0048366Biological Processleaf development
GO:0090610Biological Processbundle sheath cell fate specification
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 440 aa     Download sequence    Send to blast
MLYSLQISFI NHTTISSSSS TMSSKRSIVE FTPPATDEDP VFTKRPRHFP SSDREEDEEE  60
GEEVVHVDAD SIGLRLLGLL LQCAECVAME NLEDAGNLLP EIAELSSPFG SSAERVAAYF  120
AEALSARIIS SHLRFYSPLN LKALTLTHSQ KLFTALQSYN TISPLIKFSH YTANQAIYQA  180
LECEDHVHII DLDIMQGLQW PGLFQILSSR SRKLRSIKIT GVGSSMELLE STGRRLAEFA  240
NSFGLPFEFK PLEGKIGHVR NLSQLGVKAE EAIVVNWMHH CLYDVTGSDL GTFRLLTLLR  300
PKLITTVEQE LSHGGNFLGR FVEALHYYSA LFDALGDGLG EESIERHMVE QQLFGSEIRN  360
IVAVGGPKRS GEVPIERWGD EFKRVGFLPV SLSGTPAAQA SLLLGMFPRG YTLVEENGCL  420
KLGWKDLSLL TASAWQPCD*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-157624375380Protein SCARECROW
5b3h_A1e-158624374379Protein SCARECROW
5b3h_D1e-158624374379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016580400.10.0PREDICTED: scarecrow-like protein 23
SwissprotQ9FHZ11e-171SCL23_ARATH; Scarecrow-like protein 23
TrEMBLA0A1U8HAM30.0A0A1U8HAM3_CAPAN; Protein SCARECROW
TrEMBLA0A2G2Z0W90.0A0A2G2Z0W9_CAPAN; scarecrow-like protein 23
STRINGXP_009804953.10.0(Nicotiana sylvestris)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA107452126
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G41920.11e-157GRAS family protein
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]
  2. Yoon EK, et al.
    Conservation and Diversification of the SHR-SCR-SCL23 Regulatory Network in the Development of the Functional Endodermis in Arabidopsis Shoots.
    Mol Plant, 2016. 9(8): p. 1197-1209
    [PMID:27353361]