PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA04g11230
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family GRAS
Protein Properties Length: 475aa    MW: 54426.3 Da    PI: 6.7004
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA04g11230genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS291.72e-89874701373
        GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaN 99 
                 l++ Ll  A+ +++++  la   L +l +++s +gd++qR+aayf+ +L arl+   s  y+ +   +t+    +ee+ a++ f++vsP+++f+h+taN
  CA04g11230  87 LIHSLLISATSINENSIDLAVDNLRELYQNVSLKGDSVQRVAAYFADGLIARLLTRKSPFYDMIMKPPTQ----EEEFLAFTEFYKVSPFYQFAHFTAN 181
                 57899999***********************************************999999988766665....68889999***************** PP

        GRAS 100 qaIleavege.....ervHiiDfdisqGlQWpaLlqaLasRp..egppslRiTgvgspesgskeeleetgerLakfAeel.gvpfefnvlvakrledle 190
                 qaIle++e+e       +H+iDfdis+G+QWp+L+q+L++ +  ++  slRiTg g+      +el+et+ rL +fA+ + +++fef+ +    l+  +
  CA04g11230 182 QAILESFEKEleynnGLLHVIDFDISYGFQWPSLIQSLSENAttSNRISLRITGYGK----TIDELRETETRLVSFAKCFrNLSFEFQGV----LSRYK 272
                 *********998887889*********************988656677*********....99***************9758*******7....88999 PP

        GRAS 191 leeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErel 289
                 l++L  +++E+laVnl+++l +l      + s+++e+Lk v +l P +v++veqe     ++F+ rf+e+l+y++a+fdsl+  lp +s +r ++E+  
  CA04g11230 273 LSNLTKRKNETLAVNLIFHLNSLS-----TYSKISETLKEVHDLCPSIVTIVEQEGCKIPQTFMPRFMESLHYFAAMFDSLDDCLPIDSIKRLSIEKNH 366
                 9*****************999997.....4444669*******************9999**************************************** PP

        GRAS 290 lgreivnvvacegae.......rrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk.sdgyrve.............eesgslvlgWkdrpL 367
                 lg+ei+n++   ++e       +  r e++e+W+ r+e+ GF  +pls k+ +qaklll+  + ++  +++             e+  ++ l W+dr++
  CA04g11230 367 LGKEIKNMLNYDHKEgannnvySSSRYEQMETWKGRMESHGFLGIPLSCKNIMQAKLLLKIRShFN-STIQldggnggfrvleiEDPRAISLAWQDRSI 464
                 *********7655442223333678**********************************9877444.555544554444444555566677******** PP

        GRAS 368 vsvSaW 373
                 v++SaW
  CA04g11230 465 VTASAW 470
                 ****** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098547.09461438IPR005202Transcription factor GRAS
PfamPF035146.9E-8787470IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 475 aa     Download sequence    Send to blast
MGDIVDHEEE FLSLRLAILN NSSCCDQRNI NKIKKRKRRE LEFINTWDLN ESCEGKIFSL  60
LELRESMLKI DVKKKGGVVE DGKGLHLIHS LLISATSINE NSIDLAVDNL RELYQNVSLK  120
GDSVQRVAAY FADGLIARLL TRKSPFYDMI MKPPTQEEEF LAFTEFYKVS PFYQFAHFTA  180
NQAILESFEK ELEYNNGLLH VIDFDISYGF QWPSLIQSLS ENATTSNRIS LRITGYGKTI  240
DELRETETRL VSFAKCFRNL SFEFQGVLSR YKLSNLTKRK NETLAVNLIF HLNSLSTYSK  300
ISETLKEVHD LCPSIVTIVE QEGCKIPQTF MPRFMESLHY FAAMFDSLDD CLPIDSIKRL  360
SIEKNHLGKE IKNMLNYDHK EGANNNVYSS SRYEQMETWK GRMESHGFLG IPLSCKNIMQ  420
AKLLLKIRSH FNSTIQLDGG NGGFRVLEIE DPRAISLAWQ DRSIVTASAW HCVL*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A3e-509547027378Protein SCARECROW
5b3h_A3e-509547026377Protein SCARECROW
5b3h_D3e-509547026377Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13338KKRKRR
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC1717340.0AC171734.2 Solanum lycopersicum chromosome 11 clone C11HBa0064J13, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016571642.10.0PREDICTED: scarecrow-like protein 21
TrEMBLA0A1U8GK510.0A0A1U8GK51_CAPAN; scarecrow-like protein 21
STRINGSolyc11g017100.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA123541823
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G48150.21e-52GRAS family protein