PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_02423_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 625aa    MW: 67468.9 Da    PI: 6.3922
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_02423_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS359.74.2e-1102576252374
                        GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklf 84 
                                 ++++l++A a+s+g++  ++++L+rl + a+++g++ qRl  +++ AL++r+++    + +  p+ e     s+e++ +++l+
  Cotton_A_02423_BGI-A2_v1.0 257 KQTILDAAAAISEGKNDVVNEILTRLAQAANSKGSSEQRLMECMLSALKSRVNS----VENPPPVAEL---FSKEHAVSTQLL 332
                                 57899************************************************9....4443344433...49999******* PP

                        GRAS  85 sevsPilkfshltaNqaIleavege...ervHiiDfdisqGlQWpaLlqaLasRpegpp.slRiTgvgspesgskeeleetge 163
                                 +++sP++k+++l+aNqaIl+a+  +   ++ H++Dfd + G+Q+++Ll+aL++R +g+p +++iT++++  +g  e+l+++g+
  Cotton_A_02423_BGI-A2_v1.0 333 YDLSPCFKLGFLAANQAILDATLDKpscNKFHVVDFDFGSGGQYMNLLHALSERGNGKPaTVKITAIAD--NGGDERLKTVGD 413
                                 *********************9988778999***********************977666*********..77********** PP

                        GRAS 164 rLakfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqea 246
                                 rL++fAe+ gv+++fnv+   +l+dl+ ++L ++++E laVn++++l+r++desvs+e++rde+L+ vk+l P+vv++veqe+
  Cotton_A_02423_BGI-A2_v1.0 414 RLSQFAESYGVSLKFNVISGLKLSDLSRDSLGIEQDEPLAVNFAFKLYRMPDESVSVENPRDELLRRVKGLAPRVVTLVEQEM 496
                                 *****************9888************************************************************** PP

                        GRAS 247 dhnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvpl 329
                                 ++n+++F+ r+ ea+ yy alfds+e+++ r++ er+k+E+  l r+i n vaceg++r+er+e ++kWr+r+++aGF+  pl
  Cotton_A_02423_BGI-A2_v1.0 497 NTNTAPFALRVGEACGYYGALFDSVESTVLRDNPERAKLEEG-LLRKIANSVACEGRDRVERCEVFGKWRARMSMAGFELKPL 578
                                 ******************************************.889************************************* PP

                        GRAS 330 sekaakqaklllrkvk..sdgyrveeesgslvlgWkdrpLvsvSaWr 374
                                 s+++a+++++ l+  +  + g++v+ee+g + +gW +r+L+++SaWr
  Cotton_A_02423_BGI-A2_v1.0 579 SQTVAETMRAKLNSGNrvNPGFTVKEENGGVSFGWLGRTLTVASAWR 625
                                 **********9998777789**************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098549.007230603IPR005202Transcription factor GRAS
PfamPF035141.4E-107257625IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0005737Cellular Componentcytoplasm
Sequence ? help Back to Top
Protein Sequence    Length: 625 aa     Download sequence    Send to blast
MASGFSGGGP DFYGGLAGRS VGNTGTINNN QTTAPYRTQV PGMFMDTTSQ IVNRATPGFI  60
GKRTLADFQT QQALNNNPGR LFLRSVKPRT YQHTSPISPL SPIDLPSNLS PDVMSNFSSP  120
SSCMSQRYGL PLLQQLRSQQ VPLGINTGAT IQAVNTGLSG GPYMNPVSTR VVQPHDPEKK  180
MMNQLQDLEK QLLDDDNDEG DAVSVITNTN SEWSETIQNL IGSTGSPNNP IAPSPTSTTS  240
SCSSTSSTAS PASPCSKQTI LDAAAAISEG KNDVVNEILT RLAQAANSKG SSEQRLMECM  300
LSALKSRVNS VENPPPVAEL FSKEHAVSTQ LLYDLSPCFK LGFLAANQAI LDATLDKPSC  360
NKFHVVDFDF GSGGQYMNLL HALSERGNGK PATVKITAIA DNGGDERLKT VGDRLSQFAE  420
SYGVSLKFNV ISGLKLSDLS RDSLGIEQDE PLAVNFAFKL YRMPDESVSV ENPRDELLRR  480
VKGLAPRVVT LVEQEMNTNT APFALRVGEA CGYYGALFDS VESTVLRDNP ERAKLEEGLL  540
RKIANSVACE GRDRVERCEV FGKWRARMSM AGFELKPLSQ TVAETMRAKL NSGNRVNPGF  600
TVKEENGGVS FGWLGRTLTV ASAWR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A3e-4225562511375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017611282.10.0PREDICTED: scarecrow-like protein 8
SwissprotQ9FYR70.0SCL8_ARATH; Scarecrow-like protein 8
TrEMBLA0A2P5W5330.0A0A2P5W533_GOSBA; Uncharacterized protein
STRINGGorai.007G012300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43502856
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52510.10.0SCARECROW-like 8