PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400047274
Common NameLOC102598749
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family GRAS
Protein Properties Length: 748aa    MW: 84265.4 Da    PI: 6.1125
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400047274genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS382.26e-1173747441374
                  GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsP 89 
                           l++lL+ cA+av+ g+++ a++lL++++e +sp gd mqRla+yf+ +L+ar+a+s++++ykal +++ s    +++l+a++l+  ++P
  PGSC0003DMP400047274 374 LRTLLTLCAQAVAVGNQRTANELLKQIRESSSPMGDGMQRLAHYFADGLEARMAGSGTHIYKALITRPVS---ATDVLKAYHLLLAACP 459
                           5789***************************************************************999...99************** PP

                  GRAS  90 ilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpf 176
                           + ++s +  N++I++ +e++++vHiiD++i+ G+QWp L+q LasRp+gpp+lRiTg++ p++g   +e++eetg+rLa++Ae+++vpf
  PGSC0003DMP400047274 460 FRTISSFFSNKTIMNLAEKASTVHIIDIGIMWGFQWPGLIQRLASRPGGPPKLRITGIDFPNPGfrPAERVEETGKRLANYAESFKVPF 548
                           ***************************************************************9************************* PP

                  GRAS 177 efnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyys 265
                           efn+ +a+++e+++le+L++++gE+l+Vn+ ++  +llde+v ++s+rd +L+l+++l+P+v++    +  +n++ F  rf eal +ys
  PGSC0003DMP400047274 549 EFNA-IAQKWETVKLEDLKINKGEVLVVNCLYRFRNLLDETVVVNSPRDVFLNLIRRLNPDVFIQGTVNGGYNAPFFISRFREALFHYS 636
                           ****.7*********************************************************************************** PP

                  GRAS 266 alfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveee 354
                           +lfd+le+ +pre +er+ +E+ +lg+e++n++acegaer+er et+++W+ r+ +aGF+++pl+e+++       + ++++ + ++++
  PGSC0003DMP400047274 637 SLFDMLETIIPREVHERMLIEKNILGQEAMNAIACEGAERIERPETYKQWQVRILNAGFRQLPLDEEIMRMTTERFKVYDKN-FIIDND 724
                           **************************************************************************99999955.****** PP

                  GRAS 355 sgslvlgWkdrpLvsvSaWr 374
                           s++l++gWk+r  +++S W+
  PGSC0003DMP400047274 725 SEWLLQGWKGRISFALSTWK 744
                           *******************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098567.492348725IPR005202Transcription factor GRAS
PfamPF035142.1E-114374744IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 748 aa     Download sequence    Send to blast
MGQYHEAGSA IKLEDEDCSF FPDPNLINNL RVNDYFHEDY DPNLINNLRV SDNLVNRNVD  60
ISSPFQSDFG RNTLVPSTAD DFHEDYDFSD GVLKYINQML MEEDIEEKTC MFQESAALQA  120
AERSFYEVIG EKYPSSANCE KPSSLSQNER YAMDHYSGNG GRDSLLCPNW ILDLGEDDVS  180
HVPDGVALHP TSRSSNSLSG TAPDVPVDSP VSTLRIPDIF SDGESVMQFK KGMEEASKFL  240
PTGNSLLADV RYNVVGKELQ YKERKDAVVK VDKYGEKQYT ERSRGKKNTL HEGIVDLPEG  300
RNYKQSAVFS ESTVRSEMFD RVLLCSGGKN ESALREALQA ISRQNASKNG PSKGANGKKL  360
QGKKKGGKRD VVDLRTLLTL CAQAVAVGNQ RTANELLKQI RESSSPMGDG MQRLAHYFAD  420
GLEARMAGSG THIYKALITR PVSATDVLKA YHLLLAACPF RTISSFFSNK TIMNLAEKAS  480
TVHIIDIGIM WGFQWPGLIQ RLASRPGGPP KLRITGIDFP NPGFRPAERV EETGKRLANY  540
AESFKVPFEF NAIAQKWETV KLEDLKINKG EVLVVNCLYR FRNLLDETVV VNSPRDVFLN  600
LIRRLNPDVF IQGTVNGGYN APFFISRFRE ALFHYSSLFD MLETIIPREV HERMLIEKNI  660
LGQEAMNAIA CEGAERIERP ETYKQWQVRI LNAGFRQLPL DEEIMRMTTE RFKVYDKNFI  720
IDNDSEWLLQ GWKGRISFAL STWKAAY*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-5338174326378Protein SCARECROW
5b3h_A9e-5438174325377Protein SCARECROW
5b3h_D9e-5438174325377Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400047274
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754440.0HG975444.1 Solanum pennellii chromosome ch05, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006355702.10.0PREDICTED: scarecrow-like protein 14
RefseqXP_006355703.10.0PREDICTED: scarecrow-like protein 14
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLM1CLG20.0M1CLG2_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000699490.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA27624191
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]