PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_14441_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 670aa    MW: 74754.2 Da    PI: 6.3367
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_14441_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS426.71.8e-1302886551374
                        GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalkl 83 
                                 l++lL++c+ea+ s++ +++++++a+l +lasp+g+++ Rl+ay+teAL  r++r +++++++ +p+e + +  +++ +al+l
  Cotton_A_14441_BGI-A2_v1.0 288 LIHLLTACVEAIGSKNIAAINHYMAKLGDLASPRGSAISRLTAYYTEALTLRVTRLWPHIFHITTPRELD-RVDDDNGTALRL 369
                                 689****************************************************************998.5678888999** PP

                        GRAS  84 fsevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLa 166
                                 +++vsPi+kf h+t N+ +l+a+eg++rvHiiDfdi+qGlQWp+L+q+LasR+++p ++R+Tg+g+    sk+el+etg+rL+
  Cotton_A_14441_BGI-A2_v1.0 370 LNQVSPIPKFFHFTSNEILLRAFEGKDRVHIIDFDIKQGLQWPSLFQSLASRANPPSHVRVTGIGE----SKQELNETGDRLS 448
                                 ******************************************************************....************* PP

                        GRAS 167 kfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhn 249
                                  fAe+l++pfef++ v++rled++l++L+vk++E++aVn+v+qlh++l +    +   +++L l++s++P vvv++eqea+hn
  Cotton_A_14441_BGI-A2_v1.0 449 GFAEALNLPFEFHP-VVDRLEDVRLWMLHVKEKETVAVNCVFQLHKTLYDGNGGA--LRDLLGLIRSTNPAVVVMAEQEAEHN 528
                                 **************.7********************************6554444..588*********************** PP

                        GRAS 250 sesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsek 332
                                   s+ +r++++l+yysa+fds++++lp es  r+kvE++ ++rei+n++aceg++r+erhe++ekWr+ +e+  F+ + +se+
  Cotton_A_14441_BGI-A2_v1.0 529 VLSLEARVTNSLRYYSAIFDSIDSSLPMESPVRMKVEEM-FAREIRNIIACEGSDRFERHESFEKWRKLMEQGRFRCIGISER 610
                                 ***************************************.******************************************* PP

                        GRAS 333 aakqaklllrkvksdgyrve...eesgslvlgWkdrpLvsvSaWr 374
                                 +  q ++ll+++  + y+v+   e+ g+l+l W d+pL+svSaW+
  Cotton_A_14441_BGI-A2_v1.0 611 ELLQSQMLLKMYTCENYSVKkqgEDGGALTLSWLDQPLYSVSAWT 655
                                 *************999****6766799999**************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098557.905262632IPR005202Transcription factor GRAS
PfamPF035146.4E-128288655IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 670 aa     Download sequence    Send to blast
MLAGCSSSTL VSPRHRLRSE ASAQFQACHF PTSMSTQRLD LPCSFSRKDT SRSQPIRPVG  60
LSVEKPTESK TSGCSLKQNI RLPPLTTTAH EGRREIKDEF WEKGKCLKRF AAEGFIDESV  120
IDRRAKRKKG SCHNEISGDV HEGGGDNLSL GQLGAGEFWF QPSFAGHNAP QLPFSLTASG  180
DEERVCFVPG EVISPPLPLS NNPWTESVIT EITDVGEKDV ETIHRPGKEA SGSSTSSESH  240
SLGLRLNEQA TEHEVGNGSG NPYPHEGNGV GVYREEEINH REQQGFELIH LLTACVEAIG  300
SKNIAAINHY MAKLGDLASP RGSAISRLTA YYTEALTLRV TRLWPHIFHI TTPRELDRVD  360
DDNGTALRLL NQVSPIPKFF HFTSNEILLR AFEGKDRVHI IDFDIKQGLQ WPSLFQSLAS  420
RANPPSHVRV TGIGESKQEL NETGDRLSGF AEALNLPFEF HPVVDRLEDV RLWMLHVKEK  480
ETVAVNCVFQ LHKTLYDGNG GALRDLLGLI RSTNPAVVVM AEQEAEHNVL SLEARVTNSL  540
RYYSAIFDSI DSSLPMESPV RMKVEEMFAR EIRNIIACEG SDRFERHESF EKWRKLMEQG  600
RFRCIGISER ELLQSQMLLK MYTCENYSVK KQGEDGGALT LSWLDQPLYS VSAWTPIDVA  660
GSSSSFPQPK
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-752746563380Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017638829.10.0PREDICTED: scarecrow-like protein 28
RefseqXP_017638830.10.0PREDICTED: scarecrow-like protein 28
SwissprotQ9CAN30.0SCL28_ARATH; Scarecrow-like protein 28
TrEMBLA0A1U8MGF80.0A0A1U8MGF8_GOSHI; scarecrow-like protein 28
STRINGGorai.008G243900.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM72962743
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G63100.10.0GRAS family protein