PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.007G221900.1
Common NameB456_007G221900, LOC105804008
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 406aa    MW: 45108.7 Da    PI: 5.0263
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.007G221900.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3143.2e-9624022373
                GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalkl.....fsev 87 
                         ++lL++cA+a++s+d++laq++L+ l+++a pdgd++qRl+  f++AL  r a+  s+++k l++ ++++ n s  ++ +++     f+++
  Gorai.007G221900.1   2 EQLLVHCANAIESNDATLAQQILWVLNNIAPPDGDSNQRLTCAFLRALIVRAAK--SGTCKMLAAMANAHCNLSIDIHTFSVielasFVDL 90 
                         79****************************************************..8899999888888888666666665556688**** PP

                GRAS  88 sPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg....skeeleetgerLakfAeelgv 174
                         +P+++f++ +aN aIleaveg + +Hi+D+++++++Q p+L++a+asR egpp +++T+ g ++++       ++ee+g++L +fA++ +v
  Gorai.007G221900.1  91 TPWHRFGFTAANAAILEAVEGYSVIHIVDLSLTHCMQIPTLIDAIASRLEGPPLVKLTVAGGATEDvppmLDLSYEELGSKLINFARSRNV 181
                         *************************************************************99988998888899**************** PP

                GRAS 175 pfefnvl...vakrledleleeLrvkp......gEalaVnlvlqlhrll................desvsleserdevLklvkslsPkvvv 240
                          +ef+++   +a+ +++l +e+Lrv++      gEal++n++++lh+l+                 e+ s++s r  +Lk++++l P+vvv
  Gorai.007G221900.1 182 VLEFRAIpstYADGFSSL-IEQLRVQHlvyaesGEALVINCHMMLHYLPeetlpplsnvnsnpysFEPSSTQSLRAMFLKALRGLDPTVVV 271
                         *******99844455555.55555555555559*******************************988888889999*************** PP

                GRAS 241 vveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplse 331
                         +v+++ad +s++++ r+  a++y +  +d++++ lp+ s++r+++E+  + ++i+nv+a+eg +r+er e +++W +r+++aGF+ v+++e
  Gorai.007G221900.1 272 LVDEDADFTSNNLVCRLRAAFNYLWIPYDTVDTFLPQGSKQRQWYEAD-ICWKIENVIAHEGLQRVERLEPKSRWVQRMRNAGFRGVSFGE 361
                         ************************************************.****************************************** PP

                GRAS 332 kaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                         +a +++k++l +++  g+ +++e++ lvl+Wk++++v+++aW
  Gorai.007G221900.1 362 EAISEVKTMLDEHA-AGWGLKKEEDDLVLTWKGHNVVFATAW 402
                         **************.8999*********************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098547.1381384IPR005202Transcription factor GRAS
PfamPF035141.1E-932402IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 406 aa     Download sequence    Send to blast
MEQLLVHCAN AIESNDATLA QQILWVLNNI APPDGDSNQR LTCAFLRALI VRAAKSGTCK  60
MLAAMANAHC NLSIDIHTFS VIELASFVDL TPWHRFGFTA ANAAILEAVE GYSVIHIVDL  120
SLTHCMQIPT LIDAIASRLE GPPLVKLTVA GGATEDVPPM LDLSYEELGS KLINFARSRN  180
VVLEFRAIPS TYADGFSSLI EQLRVQHLVY AESGEALVIN CHMMLHYLPE ETLPPLSNVN  240
SNPYSFEPSS TQSLRAMFLK ALRGLDPTVV VLVDEDADFT SNNLVCRLRA AFNYLWIPYD  300
TVDTFLPQGS KQRQWYEADI CWKIENVIAH EGLQRVERLE PKSRWVQRMR NAGFRGVSFG  360
EEAISEVKTM LDEHAAGWGL KKEEDDLVLT WKGHNVVFAT AWLPA*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_B2e-45240488474Protein SHORT-ROOT
5b3h_B7e-46240434420Protein SHORT-ROOT
5b3h_E7e-46240434420Protein SHORT-ROOT
Search in ModeBase
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in seedlings, leaves and flowers. {ECO:0000269|PubMed:18500650}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankGQ3935381e-100GQ393538.1 Gossypium hirsutum cultivar Deltapine 33 B clone MONCS0503 SSR marker CGR5533 genomic sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012491940.10.0PREDICTED: scarecrow-like protein 32 isoform X2
SwissprotQ9SN221e-179SCL32_ARATH; Scarecrow-like protein 32
TrEMBLA0A0D2TJJ70.0A0A0D2TJJ7_GOSRA; Uncharacterized protein
STRINGGorai.007G221900.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM54452750
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G49950.10.0GRAS family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]