PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CCG026550.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus
Family GRAS
Protein Properties Length: 463aa    MW: 52640 Da    PI: 6.9581
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CCG026550.1genomeLZUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS309.86e-95834601373
         GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlta 98 
                  l++lLl  A+a ++++   a + L++l + +s +gd++qR++ayf+ +Laarl+ + s  y+ +  ++ts    +ee+ a++ ++ vsP+++++h+ta
  CCG026550.1  83 LIHLLLITATAADENNVGSALENLTELYQSVSFTGDSVQRVVAYFADGLAARLLTKKSPFYDMIMKEPTS----EEEFLAFTDLYRVSPYYQLAHFTA 176
                  689**********************************************************998888875....788889999*************** PP

         GRAS  99 NqaIleavege.....ervHiiDfdisqGlQWpaLlqaLasRp..egppslRiTgvgspesgskeeleetgerLakfAeel.gvpfefnvlvakrled 188
                  NqaIlea e+e     +++H+iDfd+s+G+QWp+L+q+L++++  ++  slR+Tg+g+    s eel+et++rL +fA+ + ++ fef+ l    l  
  CCG026550.1 177 NQAILEAYEKEednnnRALHVIDFDVSYGFQWPSLIQSLSEKAssGNRISLRVTGFGK----SVEELQETESRLVSFAKGFrNLVFEFQGL----LRG 266
                  *********9988887889**********************99745556*********....9****************9758*******8....666 PP

         GRAS 189 leleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvE 286
                   +l +Lr k++E++aVnlv++l +l  +s+++++    +Lk ++sl+P +vv++eqe +++  sFl rf+e+l+y++a+fdsl+  lp es+er ++E
  CCG026550.1 267 SKLINLRKKKNETVAVNLVFHLNTLN-DSLKISD----TLKSIRSLNPSIVVLAEQEGSRSPRSFLSRFMESLHYFAAMFDSLDDFLPLESSERLSIE 359
                  677899****************9995.8888888....************************************************************ PP

         GRAS 287 rellgreivnvvacegae.rrerhetlekWrerleeaGFkpvplsekaakqaklllr........kvksdg.....yrveee.sgslv.lgWkdrpLv 368
                  +  lg+ei++++   ++e    r  ++ekW+ r+e  GF  ++ls+k   qaklll+        +  +dg     ++v e+ +g ++ l W+dr L+
  CCG026550.1 360 KNHLGKEIKSMLNYDKDEaNCPRYDKMEKWKGRMEGHGFAGMKLSSKSLIQAKLLLKirthycplQ--FDGesgggFKVFERdDGKAIsLVWQDRCLI 455
                  *************999996789*********************************98333322221..222333447776543777778********* PP

         GRAS 369 svSaW 373
                  ++SaW
  CCG026550.1 456 TASAW 460
                  ***** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098549.34157428IPR005202Transcription factor GRAS
PfamPF035142.1E-9283460IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 463 aa     Download sequence    Send to blast
MEDKEEEELL NLSLAIVTDS SGGDMRRKRK RSRADHVFNP LMNGYEGCSE AKIYRLLQMR  60
EQMIKLDHKK KAAVEETGKG LHLIHLLLIT ATAADENNVG SALENLTELY QSVSFTGDSV  120
QRVVAYFADG LAARLLTKKS PFYDMIMKEP TSEEEFLAFT DLYRVSPYYQ LAHFTANQAI  180
LEAYEKEEDN NNRALHVIDF DVSYGFQWPS LIQSLSEKAS SGNRISLRVT GFGKSVEELQ  240
ETESRLVSFA KGFRNLVFEF QGLLRGSKLI NLRKKKNETV AVNLVFHLNT LNDSLKISDT  300
LKSIRSLNPS IVVLAEQEGS RSPRSFLSRF MESLHYFAAM FDSLDDFLPL ESSERLSIEK  360
NHLGKEIKSM LNYDKDEANC PRYDKMEKWK GRMEGHGFAG MKLSSKSLIQ AKLLLKIRTH  420
YCPLQFDGES GGGFKVFERD DGKAISLVWQ DRCLITASAW HCV
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A4e-529146027378Protein SCARECROW
5b3h_A4e-529146026377Protein SCARECROW
5b3h_D4e-529146026377Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12530RRKRKR
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2134230.0AC213423.1 Populus trichocarpa clone POP042-P09, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011038159.10.0PREDICTED: scarecrow-like protein 21
TrEMBLB9MW610.0B9MW61_POPTR; GRAS family protein
STRINGPOPTR_0013s11930.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF62002952
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G04890.14e-54SCARECROW-like 21