PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_12754_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 554aa    MW: 61822.1 Da    PI: 4.8988
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_12754_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS446.91.4e-1361865543374
                        GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfs 85 
                                 + L+ecA ++++g+ e a+a++++l++l+s +gdp qR+aay++e+Laar+a s++ lykal+++e +   ss++laa+++++
  Cotton_A_12754_BGI-A2_v1.0 186 QMLIECAAVLAEGNIEGASAIINELRQLVSVHGDPPQRIAAYMVEGLAARVAASGKYLYKALRCKEPP---SSDRLAAMQILF 265
                                 79*****************************************************************9...9*********** PP

                        GRAS  86 evsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLa 166
                                 ev+P++kf++++aN aI ea++ e+rvHiiDfdi+qG Q+++L+q++a+ p++pp+lR+Tgv++p+s+   +  le +g rL+
  Cotton_A_12754_BGI-A2_v1.0 266 EVCPCFKFGFMAANGAIIEAFKDEKRVHIIDFDINQGSQYITLIQSIAKLPGKPPHLRLTGVDDPDSVqrLNGGLEIIGLRLE 348
                                 ******************************************************************99777889********* PP

                        GRAS 167 kfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhn 249
                                 k+Ae lgv+fef++ va+r++ +++++L+++pgEal+Vn+++qlh+++desvs+ ++rd++L++vksl+Pk+v+vveq++++n
  Cotton_A_12754_BGI-A2_v1.0 349 KLAEVLGVSFEFQA-VASRTSIVTPSMLDCRPGEALVVNFAFQLHHMPDESVSTINQRDQLLRMVKSLNPKLVTVVEQDVNTN 430
                                 **************.7******************************************************************* PP

                        GRAS 250 sesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsek 332
                                 +++F+ rf+ea +yys++f+sl+++lpres++r++vEr++l+r+ivn+vaceg+er+er e ++kWr+r+++aGF++ p+s +
  Cotton_A_12754_BGI-A2_v1.0 431 TSPFFPRFIEAYSYYSTVFESLDVTLPRESQDRMNVERQCLARDIVNIVACEGEERIERYEVAGKWRARMTMAGFTSCPMSPN 513
                                 *********************************************************************************** PP

                        GRAS 333 aakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                                 + + ++ l+r++ ++ y+++e+ g+l +gW++++L+++SaWr
  Cotton_A_12754_BGI-A2_v1.0 514 VIDMIRKLIREYCDR-YKLKEDLGALHFGWEGKSLIVASAWR 554
                                 *************66.*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098565.99158535IPR005202Transcription factor GRAS
PfamPF035144.8E-134186554IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 554 aa     Download sequence    Send to blast
MSLVTSTEPA ITACRNTKLY SIQGSGDSSG LSTQMFGSDK HKPRCITDSY SCESYEKSFV  60
GSPSEELMHP SRSDVAESSI RQQNVSSYQP RDYLEVQSAD TLDHDTDKMK LMLQELERDL  120
LGDNDVDVGD MFGTDLNMEI DGEWSDPVRT EPIHESPKES SSSESNLSSI SSNKEASHFS  180
SRTPKQMLIE CAAVLAEGNI EGASAIINEL RQLVSVHGDP PQRIAAYMVE GLAARVAASG  240
KYLYKALRCK EPPSSDRLAA MQILFEVCPC FKFGFMAANG AIIEAFKDEK RVHIIDFDIN  300
QGSQYITLIQ SIAKLPGKPP HLRLTGVDDP DSVQRLNGGL EIIGLRLEKL AEVLGVSFEF  360
QAVASRTSIV TPSMLDCRPG EALVVNFAFQ LHHMPDESVS TINQRDQLLR MVKSLNPKLV  420
TVVEQDVNTN TSPFFPRFIE AYSYYSTVFE SLDVTLPRES QDRMNVERQC LARDIVNIVA  480
CEGEERIERY EVAGKWRARM TMAGFTSCPM SPNVIDMIRK LIREYCDRYK LKEDLGALHF  540
GWEGKSLIVA SAWR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A5e-6319055324377Protein SCARECROW
5b3h_D5e-6319055324377Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017642061.10.0PREDICTED: scarecrow-like protein 1
SwissprotQ9SDQ30.0SCL1_ARATH; Scarecrow-like protein 1
TrEMBLA0A1U8IIM10.0A0A1U8IIM1_GOSHI; scarecrow-like protein 1
STRINGGorai.009G251800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM69532744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21450.10.0SCARECROW-like 1
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]