PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PGSC0003DMP400016289
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family GRAS
Protein Properties Length: 413aa    MW: 46992.8 Da    PI: 6.0973
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PGSC0003DMP400016289genomePGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS266.11.2e-81234081373
                  GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsP 89 
                           l++ Ll  A+ ++++   la + L +l ++++  gd++qR+aayf+ +L arl+   s  y+ +   +t+    +ee+ a++ f++vsP
  PGSC0003DMP400016289  23 LIHSLLISATSIDENTIDLAIENLHELYQNVTLFGDSIQRMAAYFADGLIARLLTRKSPFYDMIMKPPTQ----EEEFLAFTQFYKVSP 107
                           57899999***********************************************999999988766665....688889999****** PP

                  GRAS  90 ilkfshltaNqaIleavege.....ervHiiDf.disqGlQWpaLlqaLasRp..egppslRiTgvgspesgskeeleetgerLakfAe 170
                           +++f+h+taNqaIle +e+e       +H+iDf dis+G+QWp+L+q+L++ +   +  sl+iTg g+      +el+et+ rL +fA+
  PGSC0003DMP400016289 108 FYQFAHFTANQAILEVFEKEleynnGLLHVIDFfDISYGFQWPSLIQSLSENAttLNRISLKITGYGK----TIDELRETETRLVSFAK 192
                           *******************998887889****55****************998545566*********....99*************** PP

                  GRAS 171 el.gvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerfl 258
                            + +++fef+ +      + +l++L  +++E+la+nl+++l +l      + s+++++Lk v +l P +v++veqe     ++F+ rf+
  PGSC0003DMP400016289 193 GFrNLSFEFQGV---LSGSYKLSNLTKRKNETLAINLMFHLNSLS-----TYSKISKTLKEVHDLCPSIVTIVEQEGCKIPQTFMPRFM 273
                           9758*******7...34677889****************999997.....3444568*******************9999********* PP

                  GRAS 259 ealeyysalfdsleaklpreseerikvErellgreivnvva......cegaerrerhetlekWrerleeaGFkpvplsekaakqaklll 341
                           ++l+y++a+fdsl+  lp +s er ++E+  lg+ei+nv+       +++++r  ++e++e+ + r+e+ GF  +pls k+ +qakll+
  PGSC0003DMP400016289 274 DSLHYFAAMFDSLDDCLPIDSIERLSIEKNHLGKEIKNVLNydeggnNNNSSRYDDNEQMETCKGRMESHGFLGIPLSCKNIMQAKLLM 362
                           ****************************************732222245577999*********************************9 PP

                  GRAS 342 rkvksdgyrve..............eesgslvlgWkdrpLvsvSaW 373
                           +  +    +++              e+  ++ l W+dr++v++SaW
  PGSC0003DMP400016289 363 KIRSYSNSTIQidgginggfrvfeiEDPRAISLAWQDRSIVTASAW 408
                           87774446666446666665555655566777************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098541.8441375IPR005202Transcription factor GRAS
PfamPF035144.0E-7923408IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 413 aa     Download sequence    Send to blast
MMLKNDVKKK GHIVVEDGKG LHLIHSLLIS ATSIDENTID LAIENLHELY QNVTLFGDSI  60
QRMAAYFADG LIARLLTRKS PFYDMIMKPP TQEEEFLAFT QFYKVSPFYQ FAHFTANQAI  120
LEVFEKELEY NNGLLHVIDF FDISYGFQWP SLIQSLSENA TTLNRISLKI TGYGKTIDEL  180
RETETRLVSF AKGFRNLSFE FQGVLSGSYK LSNLTKRKNE TLAINLMFHL NSLSTYSKIS  240
KTLKEVHDLC PSIVTIVEQE GCKIPQTFMP RFMDSLHYFA AMFDSLDDCL PIDSIERLSI  300
EKNHLGKEIK NVLNYDEGGN NNNSSRYDDN EQMETCKGRM ESHGFLGIPL SCKNIMQAKL  360
LMKIRSYSNS TIQIDGGING GFRVFEIEDP RAISLAWQDR SIVTASAWHC VL*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A3e-4844084378Protein SCARECROW
5b3h_A3e-4844083377Protein SCARECROW
5b3h_D3e-4844083377Protein SCARECROW
Search in ModeBase
Cis-element ? help Back to Top
SourceLink
PlantRegMapPGSC0003DMP400016289
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC1717340.0AC171734.2 Solanum lycopersicum chromosome 11 clone C11HBa0064J13, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006351165.10.0PREDICTED: scarecrow-like protein 21
TrEMBLM1AJ340.0M1AJ34_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000238770.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA123541823
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G04890.14e-51SCARECROW-like 21
Publications ? help Back to Top
  1. Xu X, et al.
    Genome sequence and analysis of the tuber crop potato.
    Nature, 2011. 475(7355): p. 189-95
    [PMID:21743474]