PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D02G0672
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 456aa    MW: 52204.8 Da    PI: 7.3811
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D02G0672genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS283.27.5e-87824541374
         GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlta 98 
                  l++ Ll +A+ v++++++ a + L +l  +++  gd++qR++a+f+ +L ar++   s  y+ +  ++t+    ++++ a++ ++ vsP+++f+h+t 
  Gh_D02G0672  82 LIHSLLIIATSVDKNNMNSALENLIQLYPTVTLMGDSVQRVVAHFADGLFARILTPKSPFYDMVMKEPTT----EQQFLAFTSLYRVSPYYQFAHFTV 175
                  6899***************************************************9999**988777776....677788899*************** PP

         GRAS  99 NqaIleavege......ervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAee.lgvpfefnvlvakrledl 189
                  NqaI ea+e++      +++H+iDf++s+G+QWp+L+q+L++ +++  slR+Tg g+    s eel+et+ rL +fA+  +++ fef+ l  ++ ++l
  Gh_D02G0672 176 NQAIIEAFEKDqetnnnRALHVIDFNVSYGFQWPSLIQSLSQ-SGKRVSLRLTGYGR----SLEELQETEARLVSFAKGfCNLVFEFQGL-LRSSSKL 267
                  ********998667776788********************98.78899*********....9***************97369*******6.6666666 PP

         GRAS 190 eleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvEr 287
                   +++ r k++E++aVnlv++l +++    +     +++Lk v sl+P +v++veqe +    +Fl rf+e+l+y++a+fdsl+  lp+es er ++Er
  Gh_D02G0672 268 IINQ-REKKNETVAVNLVFHLSNFM----E----MSQTLKSVHSLKPSIVILVEQEGNPRVRNFLSRFMESLHYVAAMFDSLDDCLPQESTERLSIER 356
                  4555.7899***************9....3....4569************************************************************ PP

         GRAS 288 ellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdg.............yrveee.sgslv.lgWkdrpLvsv 370
                    lg+ei+ ++  e+ +    +e++++W++++e+ GF+ ++ls+k   qaklll+    +              +rv ++ +g ++ lgW+dr L+++
  Gh_D02G0672 357 NHLGKEIKAMINSENLD----EENKGTWKNMMERHGFGGMKLSSKCLIQAKLLLKVRTHNYcplpcegentnggFRVFQRdEGKALsLGWQDRCLLTA 450
                  ****************9....8999*****************************865433211111111111126654334555555*********** PP

         GRAS 371 SaWr 374
                  SaW+
  Gh_D02G0672 451 SAWQ 454
                  ***6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098546.47156419IPR005202Transcription factor GRAS
PfamPF035142.6E-8482454IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 456 aa     Download sequence    Send to blast
MEDKDEEEEH LNLSLAIVTD PNEENSRKRK RKLLNVCNPL NPSHEGYFEG KIFKLLQVRE  60
EMLKLDHKRS KGLAENGKGL HLIHSLLIIA TSVDKNNMNS ALENLIQLYP TVTLMGDSVQ  120
RVVAHFADGL FARILTPKSP FYDMVMKEPT TEQQFLAFTS LYRVSPYYQF AHFTVNQAII  180
EAFEKDQETN NNRALHVIDF NVSYGFQWPS LIQSLSQSGK RVSLRLTGYG RSLEELQETE  240
ARLVSFAKGF CNLVFEFQGL LRSSSKLIIN QREKKNETVA VNLVFHLSNF MEMSQTLKSV  300
HSLKPSIVIL VEQEGNPRVR NFLSRFMESL HYVAAMFDSL DDCLPQESTE RLSIERNHLG  360
KEIKAMINSE NLDEENKGTW KNMMERHGFG GMKLSSKCLI QAKLLLKVRT HNYCPLPCEG  420
ENTNGGFRVF QRDEGKALSL GWQDRCLLTA SAWQCV
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A2e-509045327378Protein SCARECROW
5b3h_A2e-509045326377Protein SCARECROW
5b3h_D2e-509045326377Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12631RKRKRK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016691031.10.0PREDICTED: scarecrow-like protein 21
TrEMBLA0A1U8JL750.0A0A1U8JL75_GOSHI; scarecrow-like protein 21
STRINGGorai.005G075100.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15761916
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G03450.16e-52RGA-like 2