PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_30654_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 758aa    MW: 85549.7 Da    PI: 4.9958
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_30654_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS356.15.3e-1093817551373
                        GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalkl 83 
                                 l++lL+ cA+avs++d + a++lL++++e++sp gd++qRla  f+ +L+arl +s+  +    ++ +++ ++ ++ l+a+k+
  Cotton_A_30654_BGI-A2_v1.0 381 LRTLLILCAQAVSADDRRTASELLKQIKEHSSPLGDANQRLAYIFADGLEARLDGSGALIHVFYASLASKMTTAADILKAYKA 463
                                 5789***************************************************777776666777777777********** PP

                        GRAS  84 fsevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetger 164
                                 +   +P+ k++ l aN+ I+  +e+ + +Hi+Df+i +G+QWp L+q L++Rp+gpp+lRiTg++ p+ g   +e++eetg+r
  Cotton_A_30654_BGI-A2_v1.0 464 YLCSCPFTKLAILFANKSIYHMAEKTSVLHIVDFGILYGFQWPILIQHLSTRPGGPPKLRITGIEIPQRGfrPAERIEETGRR 546
                                 ********************************************************************99************* PP

                        GRAS 165 LakfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqead 247
                                 Lak++e+++vpfe+n+++++++e++++e+++++++E+laVn+ ++ h+llde+++++ +r+++Lkl+++++P+++v++  +  
  Cotton_A_30654_BGI-A2_v1.0 547 LAKYCERFNVPFEYNPIAVEHWETIQIEDIKIDSNEMLAVNSLFRFHNLLDETADVDCPRNAMLKLIRKMKPDIFVHSIVNGA 629
                                 *********************************************************************************** PP

                        GRAS 248 hnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvpls 330
                                 +n++ F++rf e l + sa+fd +e++lpre+ +r + Ere+ gre++nv+aceg++r++r et+++W+ r  + GFkp+pl+
  Cotton_A_30654_BGI-A2_v1.0 630 YNAPFFVTRFKEVLFHISAVFDVFENTLPREEPARLMFEREFYGREAMNVIACEGSARVQRPETYKQWQIRTLREGFKPLPLD 712
                                 *********************************************************************************** PP

                        GRAS 331 ekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                                 ++ +k ++  l+  + + + ++e+++++++gWk+r L+  S+W
  Cotton_A_30654_BGI-A2_v1.0 713 QELMKIIRDKLKAWYHKDFVIDEDNHWMLQGWKGRILYGSSCW 755
                                 ***************888************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098564.834355736IPR005202Transcription factor GRAS
PfamPF035141.8E-106381755IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 758 aa     Download sequence    Send to blast
MDPNSIEISD YLNCFKVEDH TFHNGFEFNV PSPDLNFMNM NVPFIPLDSD PGINVPSITA  60
SSDGSPFSAS TGWSPLGESY SPPSDSDSTD PVLKYISQML MEENMEDKPY MFNDYLALED  120
TEKSLYDALV SNIIQPVKVE SPDCNLFGTN GHSDASISSR SGTSDHINPR GIGEVGGPDP  180
SLLRAPYSLQ PDLQQSSSQF SVDSVNSLSN IGNGLMESSV SELLVKNIFS DKESVLQFQR  240
GFEEASKFIP SSEQLVIDLE SSTFAVGKKV DVPKVVVKVE KDEREISSNG LTGRKNHERD  300
DWELEDERSN KQSATYTEES DLSEVFDKVL LCTEGKTMCG IDQTVQHGET DSSQHKEQLD  360
GSIVGRNRSK RRGKKKEVVD LRTLLILCAQ AVSADDRRTA SELLKQIKEH SSPLGDANQR  420
LAYIFADGLE ARLDGSGALI HVFYASLASK MTTAADILKA YKAYLCSCPF TKLAILFANK  480
SIYHMAEKTS VLHIVDFGIL YGFQWPILIQ HLSTRPGGPP KLRITGIEIP QRGFRPAERI  540
EETGRRLAKY CERFNVPFEY NPIAVEHWET IQIEDIKIDS NEMLAVNSLF RFHNLLDETA  600
DVDCPRNAML KLIRKMKPDI FVHSIVNGAY NAPFFVTRFK EVLFHISAVF DVFENTLPRE  660
EPARLMFERE FYGREAMNVI ACEGSARVQR PETYKQWQIR TLREGFKPLP LDQELMKIIR  720
DKLKAWYHKD FVIDEDNHWM LQGWKGRILY GSSCWVPA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A4e-453707577379Protein SCARECROW
5b3h_D4e-453707577379Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1365370RNRSKR
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017639189.10.0PREDICTED: scarecrow-like protein 33
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A2P5W8M70.0A0A2P5W8M7_GOSBA; Uncharacterized protein
STRINGGorai.003G130500.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]