PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_23600_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 641aa    MW: 71166 Da    PI: 6.1543
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_23600_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS408.27.7e-1252576261374
                        GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalkl 83 
                                 lv+lL +c+ea+ s++ +++++++++l elasp+g+ + Rl+ay+teALa r++r +++++++ +p+e + +  +++ +al+l
  Cotton_A_23600_BGI-A2_v1.0 257 LVRLLAACVEAIGSKNIAAINHFISQLGELASPRGTVISRLTAYYTEALALRVTRVWPHIFHITTPRELD-RLDDDNGTALRL 338
                                 6899**************************************************************9998.568888999*** PP

                        GRAS  84 fsevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLa 166
                                 +++v+Pi+kf h+t N+ +l+a+eg++rvHi+Dfdi+qGlQWp+L+q+La+R+++p ++R+Tg+g+    sk+el+etg rLa
  Cotton_A_23600_BGI-A2_v1.0 339 LNQVTPIPKFVHFTSNEILLRAFEGKDRVHIVDFDIKQGLQWPSLFQSLAARTNPPSHVRVTGIGE----SKQELNETGGRLA 417
                                 ******************************************************************....************* PP

                        GRAS 167 kfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhn 249
                                  fAe+l++pfef++ v++rled++l++L+ k++E++aVn+v+qlh++l +        +++L l++s++P  v+++eqe++ n
  Cotton_A_23600_BGI-A2_v1.0 418 GFAEALNLPFEFHP-VVDRLEDVRLWMLHAKENESIAVNCVFQLHKTLYDGNGGV--LRDFLGLIRSINPIAVIIAEQETENN 497
                                 **************.7*******************************96655444..489*********************** PP

                        GRAS 250 sesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsek 332
                                   ++ +r++++l+yysa+fds++++lp es  rik+E++ ++rei+n++aceg++r+erh+++ekWr+ +e+ GFk + ++++
  Cotton_A_23600_BGI-A2_v1.0 498 ILNLEARVANSLRYYSAIFDSIDSTLPLESPVRIKIEEM-FAREIRNLIACEGSDRFERHTSFEKWRKLMEQGGFKCMGITDR 579
                                 ***************************************.******************************************* PP

                        GRAS 333 aakqaklllrkvksdgyrveee.....sgslvlgWkdrpLvsvSaWr 374
                                 +  q ++ll+++ s+ y+v+++     +g+l+l W d+pL++vSaW+
  Cotton_A_23600_BGI-A2_v1.0 580 ELVQSQMLLKMYTSENYSVKKQgpdgdDGALTLSWLDEPLYTVSAWT 626
                                 *************999****77777779******************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098557.026231601IPR005202Transcription factor GRAS
PfamPF035142.7E-122257626IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 641 aa     Download sequence    Send to blast
MLAGCSSSTL LSPRHRLRSE TSAVQFQACH FQTSMSTQRL DLPCNFSRKE TSKSQTIRPA  60
VESKAKTSSC SIKQNIRLPP LTTTSAQNPF EGRIEIKGKS LKRFAEQGLV DDETVINRAK  120
RKKGSSDDEK PDDHGGLSLG QLGAGNFWFQ PSLSGDEERG TPPLPLSNNP WIDSVITELT  180
DVGEKDVETT HRPGKEASGS GSTSTSSESH SLGPPLNVQA KEYERGNGSG NPYPHEGARL  240
GANEEEINHR EHEGFELVRL LAACVEAIGS KNIAAINHFI SQLGELASPR GTVISRLTAY  300
YTEALALRVT RVWPHIFHIT TPRELDRLDD DNGTALRLLN QVTPIPKFVH FTSNEILLRA  360
FEGKDRVHIV DFDIKQGLQW PSLFQSLAAR TNPPSHVRVT GIGESKQELN ETGGRLAGFA  420
EALNLPFEFH PVVDRLEDVR LWMLHAKENE SIAVNCVFQL HKTLYDGNGG VLRDFLGLIR  480
SINPIAVIIA EQETENNILN LEARVANSLR YYSAIFDSID STLPLESPVR IKIEEMFARE  540
IRNLIACEGS DRFERHTSFE KWRKLMEQGG FKCMGITDRE LVQSQMLLKM YTSENYSVKK  600
QGPDGDDGAL TLSWLDEPLY TVSAWTPIDV AGSSSSFSQP S
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A8e-732416271380Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017622689.10.0PREDICTED: scarecrow-like protein 28
SwissprotQ9CAN30.0SCL28_ARATH; Scarecrow-like protein 28
TrEMBLA0A1U8N2580.0A0A1U8N258_GOSHI; scarecrow-like protein 28
STRINGGorai.003G162400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM72962743
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G63100.10.0GRAS family protein