PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_30598_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 627aa    MW: 70459.3 Da    PI: 5.3474
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_30598_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS318.61.3e-972666262374
                        GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklf 84 
                                  ++L ++Ae+v++g+++ aq +Larl++l sp g+++qR+ +yf+eAL+  l+       ++  +++ ++ +++ ++ a+k+f
  Cotton_A_30598_BGI-A2_v1.0 266 LDQLCKVAEMVETGNFSHAQGILARLNHL-SPVGKTFQRIGFYFKEALQLLLFM----YNNTPVSKNPTPFDVIFKMGAYKVF 343
                                 679************************98.799********************8....33344555666667*********** PP

                        GRAS  85 sevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpeg.ppslRiTgvgspesgskeeleetgerLa 166
                                 sevsP+++f ++t Nqa lea+e ++r+Hi+Dfdi+ G+QW++++q+L  R++g  pslRiT+++s ++ +  el+ ++e+L+
  Cotton_A_30598_BGI-A2_v1.0 344 SEVSPFVQFVNFTSNQAFLEALEDSDRIHIVDFDIGFGAQWASFMQELPMRSKGvVPSLRITAFVSLSTYHPIELSLIRENLE 426
                                 ****************************************************55389*********9999************* PP

                        GRAS 167 kfAeelgvpfefnvlvakrledl..eleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqead 247
                                 +fA+ +gv+fe +vl  ++l++   +l ++r++++EalaVn+ +   +    +  l    +++L++vk+l+Pk+vv  ++ +d
  Cotton_A_30598_BGI-A2_v1.0 427 QFANGIGVNFELEVLNFDSLDQNpySLPMFRMNENEALAVNFPVWSASYR--PSIL----PNLLRIVKQLRPKIVVSLDRGCD 503
                                 ***************888777654488899999**********9776665..4444....56********************* PP

                        GRAS 248 hnsesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvpls 330
                                 +n+ sF +++++a++ y +l++sl+a +  +s++ +k+Er+l+ ++i+  v  +  +     e++  W+  + +aGF+p+++s
  Cotton_A_30598_BGI-A2_v1.0 504 RNDLSFPQHIINAFYSYISLLESLDAAVNVNSDAINKIERFLIVPKIETTVLGRLHS----LEKMPPWKTLFASAGFTPLTFS 582
                                 ***************************************************998887....9********************* PP

                        GRAS 331 ekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                                 +++++qa++++++ +++g+++++ ++slvl+W++++L+s+SaWr
  Cotton_A_30598_BGI-A2_v1.0 583 NFTETQAECVVKRAQVRGFHIQKCHASLVLCWQQKELISASAWR 626
                                 *******************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098543.135239606IPR005202Transcription factor GRAS
PfamPF035144.6E-95266626IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 627 aa     Download sequence    Send to blast
MHFQFQPKVG VELAGFASIY QDKWAKQQED NSFNDQEPTS VLHMRSQSPP TSASTLSSSF  60
NGGSGGGGSG GNFTTTTIVP PESTQLEFQP IQSELDLVTT GPVGVQRCNN LGIEDWEAML  120
FESTVSPSQD QNSLLRWIAG DVDEHGLKQL LQTGPDFIPG LDPMDPGNLG YFPQNPIFTS  180
PPESIGLYQQ LENQEMKPQI LNPPNPNFFL PFPQEQQPLP KRLNHGQIPK LPFSDHELFI  240
KKQQLVGFQE QKPLMVSQQQ QAATLLDQLC KVAEMVETGN FSHAQGILAR LNHLSPVGKT  300
FQRIGFYFKE ALQLLLFMYN NTPVSKNPTP FDVIFKMGAY KVFSEVSPFV QFVNFTSNQA  360
FLEALEDSDR IHIVDFDIGF GAQWASFMQE LPMRSKGVVP SLRITAFVSL STYHPIELSL  420
IRENLEQFAN GIGVNFELEV LNFDSLDQNP YSLPMFRMNE NEALAVNFPV WSASYRPSIL  480
PNLLRIVKQL RPKIVVSLDR GCDRNDLSFP QHIINAFYSY ISLLESLDAA VNVNSDAINK  540
IERFLIVPKI ETTVLGRLHS LEKMPPWKTL FASAGFTPLT FSNFTETQAE CVVKRAQVRG  600
FHIQKCHASL VLCWQQKELI SASAWRC
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A1e-3429762636375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6228214e-78JX622821.1 Gossypium hirsutum clone NBRI_GE69712 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017642892.10.0PREDICTED: scarecrow-like protein 6
TrEMBLA0A2P5WU950.0A0A2P5WU95_GOSBA; Uncharacterized protein
STRINGGorai.003G170700.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2493934
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00150.11e-110GRAS family protein