PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_34382_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 767aa    MW: 85500.8 Da    PI: 5.9544
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_34382_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS381.69.2e-1173907601373
                        GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalkl 83 
                                 l++lL+ cA+avs +d + a++l++++++++sp+gd  qRla++f+ AL+arla++++++y++l++++ts    ++ l+a+++
  Cotton_A_34382_BGI-A2_v1.0 390 LRTLLILCAQAVSGDDGATAKELIKQIRQHSSPTGDGSQRLAQCFVDALEARLAGTGTHIYSSLAVKRTS---AADMLKAYQV 469
                                 5789***************************************************************999...9********* PP

                        GRAS  84 fsevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetger 164
                                 +  ++P++k++ + aN++I + +e+++++H+iDf+i +G+QWpaL++ La+Rp+gpp+lRiTg++ p++g   +e+++etg+r
  Cotton_A_34382_BGI-A2_v1.0 470 YLSACPFMKMAIFFANNTIFKVAEKATTLHVIDFGIFYGFQWPALIHCLANRPGGPPKLRITGIEFPRPGfrPAEAVQETGHR 552
                                 *********************************************************************9*9*********** PP

                        GRAS 165 LakfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqead 247
                                 La+++e+ +vpfefn+ va+++e++++e+L+++++E++aVn+ ++ ++llde+v l+s+rd vL+l+++++P+++v++  + +
  Cotton_A_34382_BGI-A2_v1.0 553 LARYCERYNVPFEFNA-VAQKWETIQTEDLKINSNEVIAVNCLFRFKNLLDETVVLNSPRDIVLNLIRKINPDIFVHSIVNGS 634
                                 ****************.7***************************************************************** PP

                        GRAS 248 hnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvpls 330
                                 +n++ F++rf eal ++salfd+ e+++++e++ r+++E+++ ++ei+n+vaceg+er+er e +++W+ r  +aGF+++pl+
  Cotton_A_34382_BGI-A2_v1.0 635 YNAPFFVTRFREALFHFSALFDMSETNISQEDNLRTMLEQKFYRQEIMNIVACEGTERVERPEAYKQWQVRSVRAGFTQLPLD 717
                                 *********************************************************************************** PP

                        GRAS 331 ekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                                  + +k+++  +++++   ++v+ +  ++++gWk+r +++ SaW
  Cotton_A_34382_BGI-A2_v1.0 718 PELMKKVRGKVKECYHSDFMVDVDGRWMLQGWKGRIIYASSAW 760
                                 ***************777************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098566.713364741IPR005202Transcription factor GRAS
PfamPF035143.2E-114390760IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 767 aa     Download sequence    Send to blast
MGSEFNSING FKFHNGFSMP YSNGCPKSDN SNGVRSNDPS LDLSSVGAPF LPSLGLNNSS  60
TYASFFSTDK EGDSSSPSDD GDFSDTVLKY ISQVLLEEDM EEKPCMFHDS LALQAAEKSL  120
YEVLGESYPP RNRAPLCSGH SVESSPDDCS FRTSGDHSTY TGSSSNTSKS IDSRWNGDFG  180
ENNDKPSLFE ASVPDNFVFQ SSVNSFSQSS ARFQKGTASN GKGLVGSISN ELAIPNYFSE  240
SELALHFKKG VEEASKFLPK GNQLTIDFTS NAWTAELNQK APVTVVEMES DWKEYSPYRL  300
TGKKNHDRED EDFEEGRNNK QSAVSGDESE LSDMFDKVLI CAGRNEKSPA CGADETPRNG  360
PSKLRPKEQT NESGKARGKK QGKKKEVVDL RTLLILCAQA VSGDDGATAK ELIKQIRQHS  420
SPTGDGSQRL AQCFVDALEA RLAGTGTHIY SSLAVKRTSA ADMLKAYQVY LSACPFMKMA  480
IFFANNTIFK VAEKATTLHV IDFGIFYGFQ WPALIHCLAN RPGGPPKLRI TGIEFPRPGF  540
RPAEAVQETG HRLARYCERY NVPFEFNAVA QKWETIQTED LKINSNEVIA VNCLFRFKNL  600
LDETVVLNSP RDIVLNLIRK INPDIFVHSI VNGSYNAPFF VTRFREALFH FSALFDMSET  660
NISQEDNLRT MLEQKFYRQE IMNIVACEGT ERVERPEAYK QWQVRSVRAG FTQLPLDPEL  720
MKKVRGKVKE CYHSDFMVDV DGRWMLQGWK GRIIYASSAW VPASYPV
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A3e-533797648382Protein SCARECROW
5b3h_A3e-533797647381Protein SCARECROW
5b3h_D3e-533797647381Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017632253.10.0PREDICTED: scarecrow-like protein 33
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A1U8MQ120.0A0A1U8MQ12_GOSHI; scarecrow-like protein 33
STRINGGorai.001G111500.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]