PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A02G0620
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 456aa    MW: 52205.8 Da    PI: 7.5842
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A02G0620genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS284.82.4e-87824541374
         GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlta 98 
                  l++ Ll +A+ v++ +++ a + L +l  ++s  gd++qR++a+f+ +L ar++   s  y+ +  ++++    ++++ a++ ++ vsP+++f+h+ta
  Gh_A02G0620  82 LIHSLLIIATSVDKYNMNSALENLIQLYPTVSLMGDSVQRVVAHFADGLFARILTPKSPFYDMVMKEPST----EQQFLAFTSLYRVSPYYQFAHFTA 175
                  6899***************************************************999999988777776....677788899*************** PP

         GRAS  99 NqaIleavege......ervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAee.lgvpfefnvlvakrledl 189
                  Nq I ea+e++      +++H+iDf++s+G+QWp+L+q+L++ +++  slR+Tg g+    s eel+et+ rL +fA+  +++ fef+ l  ++ ++l
  Gh_A02G0620 176 NQVIIEAFEKDqetnnnRALHVIDFNVSYGFQWPSLIQSLSQ-SGKRVSLRLTGYGR----SLEELQETEARLVSFAKGfCNLVFEFQGL-LRSSSKL 267
                  ********998667776788********************98.78899*********....9***************97369*******6.6666666 PP

         GRAS 190 eleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvEr 287
                   +++ r k++E++aVnlv++l +++    +     +++Lk v sl+P +v++veqe +    +Fl rf+e+l+y++a+fdsl+  lp+es er ++Er
  Gh_A02G0620 268 IINQ-REKKNETVAVNLVFHLSNFM----E----MSQTLKSVHSLKPSIVILVEQEGNPRVRNFLSRFMESLHYFAAMFDSLDDCLPQESTERLSIER 356
                  4555.7899***************9....3....4569************************************************************ PP

         GRAS 288 ellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdg.............yrveee.sgslv.lgWkdrpLvsv 370
                    lg+ei+ ++  e+ +    +e++++W++++e+ GF+ ++ls+k   qaklll+    +              +rv ++ +g ++ lgW+dr L+++
  Gh_A02G0620 357 NHLGKEIKAMINSENLD----EENKGTWKNMMERHGFGGMKLSSKCLIQAKLLLKVRTHNYcplpcegentnggFRVFQRdEGKALsLGWQDRCLLTA 450
                  ****************9....8999*****************************865433211111111111126654334555555*********** PP

         GRAS 371 SaWr 374
                  SaW+
  Gh_A02G0620 451 SAWQ 454
                  ***6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098546.74956419IPR005202Transcription factor GRAS
PfamPF035148.4E-8582454IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 456 aa     Download sequence    Send to blast
MEDRDEEEEH LNLSLAIVTD PNGENSRKRK RKLLNVCNPL NPSLEGYFEG KIFKLLQVRE  60
EMLKLDHKRS KGLAENGKGL HLIHSLLIIA TSVDKYNMNS ALENLIQLYP TVSLMGDSVQ  120
RVVAHFADGL FARILTPKSP FYDMVMKEPS TEQQFLAFTS LYRVSPYYQF AHFTANQVII  180
EAFEKDQETN NNRALHVIDF NVSYGFQWPS LIQSLSQSGK RVSLRLTGYG RSLEELQETE  240
ARLVSFAKGF CNLVFEFQGL LRSSSKLIIN QREKKNETVA VNLVFHLSNF MEMSQTLKSV  300
HSLKPSIVIL VEQEGNPRVR NFLSRFMESL HYFAAMFDSL DDCLPQESTE RLSIERNHLG  360
KEIKAMINSE NLDEENKGTW KNMMERHGFG GMKLSSKCLI QAKLLLKVRT HNYCPLPCEG  420
ENTNGGFRVF QRDEGKALSL GWQDRCLLTA SAWQCV
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A7e-509045327378Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12631RKRKRK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016743284.10.0PREDICTED: scarecrow-like protein 23
TrEMBLA0A1U8NZ710.0A0A1U8NZ71_GOSHI; scarecrow-like protein 23
TrEMBLA0A2P5XYH80.0A0A2P5XYH8_GOSBA; Uncharacterized protein
STRINGGorai.005G075100.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15761916
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G03450.11e-52RGA-like 2