PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_07105_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 774aa    MW: 84805.4 Da    PI: 5.8467
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_07105_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3342.8e-1024127732374
                        GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklf 84 
                                  ++L+++Ae v +g+++ aq +Larl++++sp g p+qR+a+yf+eAL+  l+   ++ ++  pp++ +  +++ ++ a+k+f
  Cotton_A_07105_BGI-A2_v1.0 412 LDQLYKAAELVGTGNFSHAQLILARLNHQVSPVGMPLQRAAFYFKEALQVLLLM--NNPVSPPPPRSPTAFDVIFKMGAYKVF 492
                                 679*************************************************98..5556677777777789*********** PP

                        GRAS  85 sevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLak 167
                                 sevsP+++f ++t+NqaIlea++  +r+Hi+Dfdi+ G+QW++++q+L  R++g+pslRiT+++sp+++++ el  +++ L +
  Cotton_A_07105_BGI-A2_v1.0 493 SEVSPLIQFVNFTCNQAILEALDDTDRIHIVDFDIGFGAQWASFMQELPMRSRGAPSLRITAFASPSTHHSIELGLMRDILMQ 575
                                 *********************************************************************************** PP

                        GRAS 168 fAeelgvpfefnvlvakrledleleeLrvkp...gEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqead 247
                                 fA+e+g++fe +v+  ++l++ ++++L + +   +Ea+aVn+ +   +  +++ +     ++++++vk+lsPk+vv  +q +d
  Cotton_A_07105_BGI-A2_v1.0 576 FANEIGLSFELEVVNFDSLDQ-NPHSLPMFQsngNEAIAVNFPV--WSCSNRPSA----LPHLMRFVKQLSPKIVVSLDQGCD 651
                                 *************86666555.566665555556********99..444434444....456********************* PP

                        GRAS 248 hnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvpls 330
                                 +n+ +F +++++a++ y+ l++sl+a+  ++s++ +k+Er+l+ ++i++ v    +      e++  W+  +++aGF+pv++s
  Cotton_A_07105_BGI-A2_v1.0 652 RNDLTFPQHVIHAFQSYTNLLESLDAG-NATSDAVNKIERFLFVPRIESSVLGHLQT----PEKMPLWSTLFSSAGFTPVTFS 729
                                 ************************999.799********************998887....9********************* PP

                        GRAS 331 ekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                                 +++++qa++++++ +++g++ve++++slvl+W++r+L+s+SaWr
  Cotton_A_07105_BGI-A2_v1.0 730 NFTETQADCVVKRAQVRGFHVEKRQASLVLCWQQRNLISASAWR 773
                                 *******************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098545.392385753IPR005202Transcription factor GRAS
PfamPF035149.6E-100412773IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 774 aa     Download sequence    Send to blast
MHFELQAKGA VELPGFASIC QQNKWIKQRE ANSFNFGNGF YYNNEPEPTS VLHLRRSQSP  60
PTSASTLSSF NGGATGSGGG VNSTGNTNTT AATIAPPETT APNNNSKEEW ASELQPIPSE  120
LDLVPGPGGA QRCNSGLEDW ETMLSESAVS LNQDHSLLRW IAGEVDDPSF GLKQLLQSGS  180
SGPNQGVDFE GNAGVGAVDQ GPVSDPIGSL ISSSSGNGIS NAALNLGGLP GSGFVPNANN  240
ENEPSCSSVE LVNNSKVLGA TVGSNCNIQN PLFSSPASNF GFPVSLPILY QQQHQEEKPR  300
IFIGPQQHPQ YPNFFLPLAQ EHHLFQPLPQ RLNSGNLELS SQIPKLPFAD PGHELFTRKQ  360
QQQHMGFSQG VQYVPQQQPL MVEKKRVLVP GEEMAQPQHQ YQLHQQQQAT LLDQLYKAAE  420
LVGTGNFSHA QLILARLNHQ VSPVGMPLQR AAFYFKEALQ VLLLMNNPVS PPPPRSPTAF  480
DVIFKMGAYK VFSEVSPLIQ FVNFTCNQAI LEALDDTDRI HIVDFDIGFG AQWASFMQEL  540
PMRSRGAPSL RITAFASPST HHSIELGLMR DILMQFANEI GLSFELEVVN FDSLDQNPHS  600
LPMFQSNGNE AIAVNFPVWS CSNRPSALPH LMRFVKQLSP KIVVSLDQGC DRNDLTFPQH  660
VIHAFQSYTN LLESLDAGNA TSDAVNKIER FLFVPRIESS VLGHLQTPEK MPLWSTLFSS  720
AGFTPVTFSN FTETQADCVV KRAQVRGFHV EKRQASLVLC WQQRNLISAS AWRC
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A6e-3543477326375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017638390.10.0PREDICTED: scarecrow-like protein 22 isoform X1
TrEMBLA0A0D2RL860.0A0A0D2RL86_GOSRA; Uncharacterized protein
STRINGGorai.008G238300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16534813
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00150.11e-109GRAS family protein