PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400052780
Common NameLOC102599068
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family GRAS
Protein Properties Length: 753aa    MW: 85164.8 Da    PI: 5.5308
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400052780genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3581.4e-1093787491373
                  GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsP 89 
                           l++lL++cA+ v+++d + a + L+++++++ + gd+ qRla++f+ +L+arla+++++ly+al+p++ +    +e+l+a++++  ++P
  PGSC0003DMP400052780 378 LRTLLVSCAQSVAADDRRTAYEQLKQIRQHCFSIGDAYQRLASVFADGLEARLAGTGTQLYAALAPKKIT---AAEKLKAYQVYLSACP 463
                           6789****************************************************************99...9*************** PP

                  GRAS  90 ilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpf 176
                           + k+s + aN++I   +++++++H iDf+i +G+QWp L+q L++ p+gpp+lRiTg++ p++g   +e+le+tg+rLak++e+++vpf
  PGSC0003DMP400052780 464 FKKISIFFANKMIFHTASNARTLHLIDFGILYGFQWPILIQLLSEIPDGPPKLRITGIDLPQPGfrPAESLEQTGSRLAKYCERFKVPF 552
                           ***************************************************************9************************* PP

                  GRAS 177 efnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyys 265
                           e+n+++++++e+++le+L++ +gE++aVn+ ++ ++llde+v l+s+rd+vL l+++++P+++v +  + +++++ F++rf eal +ys
  PGSC0003DMP400052780 553 EYNAIATQNWENIKLEDLKLVSGETVAVNCLFRFKNLLDETVMLDSPRDAVLGLIRKMNPDIFVQAVINGSYSAPFFVTRFREALFHYS 641
                           *****999********************************************************************************* PP

                  GRAS 266 alfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveee 354
                           +lfd+++a+lpr++++r   E+e+ +re++nv+aceg+er+er et+++W+ r ++aGFk +pl+++ +++ ++ ++  + + +  +e+
  PGSC0003DMP400052780 642 TLFDMFDATLPRDDQQRLHFEQEFYRREAMNVIACEGSERVERPETYKQWQVRNMRAGFKILPLNQQLVQKLRCKVKAGYHRDFVFNED 730
                           ******************************************************************************99999****** PP

                  GRAS 355 sgslvlgWkdrpLvsvSaW 373
                            +++++gWk+r + + S+W
  PGSC0003DMP400052780 731 GKWMLQGWKGRVVCASSCW 749
                           ******************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098565.334352730IPR005202Transcription factor GRAS
PfamPF035144.9E-107378749IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009410Biological Processresponse to xenobiotic stimulus
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0005829Cellular Componentcytosol
Sequence ? help Back to Top
Protein Sequence    Length: 753 aa     Download sequence    Send to blast
MDPRFIPLSD PVNTLEFEDQ INLSSYEGSL NPPHSYNDDY VAFGVPYTAP SVDIGNFPPS  60
SNVSSEVDSP DDHDSDSLFK YLNQILMEEN IEDKPSMFHD PLALKAAEKS LYEALGKSYP  120
PSPYRTPYHV DHQFKSPSPD SIFQTSSDHS TSSSNAHSNS MDPHWIVDPG ESRLPLPVES  180
HPSEYSIQPL MQSNSERSHG SLNNINNLNV HMDSFLNPNA LSNMFTDSES ILQFKRGVEE  240
ANKFLPNVSQ FVVDLDKYTF PPKVEEVTKE AVVKVEKDER NHSPNGTKGR KHQYPEDSDF  300
EDERSNKHSA IYVEEEAELS EMFDRVLLCT DKGETICGDV KSEMPVDNSL DQNGQAHGSN  360
GGKTRAKKQG TKNEAVDLRT LLVSCAQSVA ADDRRTAYEQ LKQIRQHCFS IGDAYQRLAS  420
VFADGLEARL AGTGTQLYAA LAPKKITAAE KLKAYQVYLS ACPFKKISIF FANKMIFHTA  480
SNARTLHLID FGILYGFQWP ILIQLLSEIP DGPPKLRITG IDLPQPGFRP AESLEQTGSR  540
LAKYCERFKV PFEYNAIATQ NWENIKLEDL KLVSGETVAV NCLFRFKNLL DETVMLDSPR  600
DAVLGLIRKM NPDIFVQAVI NGSYSAPFFV TRFREALFHY STLFDMFDAT LPRDDQQRLH  660
FEQEFYRREA MNVIACEGSE RVERPETYKQ WQVRNMRAGF KILPLNQQLV QKLRCKVKAG  720
YHRDFVFNED GKWMLQGWKG RVVCASSCWV PA*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A7e-433617511379Protein SCARECROW
5b3h_D7e-433617511379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400052780
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754450.0HG975445.1 Solanum pennellii chromosome ch06, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006350781.10.0PREDICTED: scarecrow-like protein 14
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLM1CZ550.0M1CZ55_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000779720.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA27624191
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]