PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D02G0141
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 1595aa    MW: 184697 Da    PI: 6.1599
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D02G0141genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS81.91.1e-25143643374
         GRAS   3 elLlecAeavss.gdlelaqalLarlselaspdgd...pmqRlaayfteALaarlar.svselykalppsetseknsseelaalklfsevsPilkfs. 94 
                  +lLl+cAea+++ gdl+ a+a+L  +  la +  +   +   +++yf+ AL +r ++ +  ++y ++p+ +++                   ++    
  Gh_D02G0141  14 RLLLSCAEAIEEyGDLKSADAFLHDILILADQGTSwfpNEIGVVKYFADALVRRAYGlHPASCYFTFPVDPAP-------------------YYHYNs 92 
                  799*******9989**********99888766554554566899************93334444444444443...................333330 PP

         GRAS  95 ...hltaNqaIleavegeervHiiDfdisqGlQWp.aLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfe..fnvlvakrl 186
                     + +  + I +a+ g++r+H+iDf+i +   +  + l++L +  ++p  +R+  + +p  ++  e ++  e L+k Aee++v++e   +v++ ++l
  Gh_D02G0141  93 yhiNGVIEKVIDDAFMGNRRLHVIDFSIPYYYRFEnSVLRTLPNFFGDPLPVRVSYILPPFLKEYVEFSRQMEILTKDAEEVNVKLEneLKVVYGNSL 190
                  0003345678899********************96389******99*************9999999999999*************97225667888** PP

         GRAS 187 edleleeLrvkp...gEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpresee 281
                  ++++  e+++k+   +E+++V   ++l +l++e  ++e      L  +k+++P++v++ +  ++h++++Fl+ f ++++y  + +d +++       +
  Gh_D02G0141 191 AEVDECEIDFKRrrdDEMVVVYYKFKLDKLVREAKAMER----ELARLKEINPTIVIMLDFYSNHSDSNFLTCFKDSFQYSLKTLDYWAEL------D 278
                  **********99999****************99999988....9********************************************766......4 PP

         GRAS 282 rikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                      E+   g+e +n  a eg++ + rh tl +W++ +++aGF+ +pl+++  +   l ++  + +   + e++++l+lg ++ p++++SaW+
  Gh_D02G0141 279 YYLGEEY--GWE-CNREAGEGNNIIRRHPTLTEWQHLFSMAGFSRIPLNHRKDN---LSVEDES-WLEIMGEKEECLILGYNGCPMFFLSAWK 364
                  4445555..555.67779999999************************987654...3333333.43455699*******************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098514.6181348IPR005202Transcription factor GRAS
PfamPF035143.7E-2314364IPR005202Transcription factor GRAS
SMARTSM006141.4E-13619669IPR003656Zinc finger, BED-type
PROSITE profilePS5080810.049619673IPR003656Zinc finger, BED-type
PfamPF028922.0E-6622662IPR003656Zinc finger, BED-type
SuperFamilySSF576672.62E-6622671No hitNo description
SuperFamilySSF530983.11E-437631199IPR012337Ribonuclease H-like domain
PfamPF143721.2E-129781069IPR025525hAT-like transposase, RNase-H fold
PfamPF056996.2E-1811201198IPR008906HAT, C-terminal dimerisation domain
PROSITE profilePS5060015.29613841562IPR003653Ulp1 protease family, C-terminal catalytic domain
SuperFamilySSF540012.94E-3214071590No hitNo description
PfamPF029021.4E-2014081588IPR003653Ulp1 protease family, C-terminal catalytic domain
Gene3DG3DSA:3.30.310.1307.5E-1214361557No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006508Biological Processproteolysis
GO:0003677Molecular FunctionDNA binding
GO:0008234Molecular Functioncysteine-type peptidase activity
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1595 aa     Download sequence    Send to blast
MASFFDIDTD TALRLLLSCA EAIEEYGDLK SADAFLHDIL ILADQGTSWF PNEIGVVKYF  60
ADALVRRAYG LHPASCYFTF PVDPAPYYHY NSYHINGVIE KVIDDAFMGN RRLHVIDFSI  120
PYYYRFENSV LRTLPNFFGD PLPVRVSYIL PPFLKEYVEF SRQMEILTKD AEEVNVKLEN  180
ELKVVYGNSL AEVDECEIDF KRRRDDEMVV VYYKFKLDKL VREAKAMERE LARLKEINPT  240
IVIMLDFYSN HSDSNFLTCF KDSFQYSLKT LDYWAELDYY LGEEYGWECN REAGEGNNII  300
RRHPTLTEWQ HLFSMAGFSR IPLNHRKDNL SVEDESWLEI MGEKEECLIL GYNGCPMFFL  360
SAWKLKVKDG HFNSISTNHT FGQGFNPNPL PLQPLQPFLE GLILNRLATL AEIHDISKDL  420
CCKYKLSLAL TWAAEVNNMN ETISDPKKKH AFSIQSNSCY VKDGKSYQFM FASECLISGL  480
IIEKALEARD GYHFEPSVTK VLPKVEDFRY SMLKDCNIDV VVAICLQNRH TSDEVYIVEF  540
YWPPTESEIS KFLALRIFDD FKHMKTTFVT VKVRGPEIKF QEEAISSVPT SSNTAMPSKI  600
AENAHIEQIV ETKRNKQRKS WSKVWEDFDK FEEHGKQVAK CKHCLKVFTG SSKSGTTHLK  660
NHSKVCPGKK KQNQESQLIL PVDTNERSST FDQETSHLDL VKMVIRHQYP LDLAGQEAFK  720
NFVEGLQPMY EFQSRDKLLS DIHRIYNEER EKLQLYFDQL ACKLNLTVSL WKNNHGKTAY  780
CCLIAHFIDD GWELKMKILG LRKLEHVHDT KVVGGIIRSF VSEWKISKKV CSITVDNSFL  840
NDGMVHQIRE NCVSEQGSLS SAHWFISSTL LEDGFREMDS IHSKLWKSIE YVTETTHGKL  900
NFQEAVNQVK LQGGKSWDEL SFKLESDSDI LDSALRSREI FCKLEQIDDN FMLNLSKEEW  960
EKAVTLQSCF KCFDDIKGTQ SLTANLYFPK LCNMYEEFGQ LKKSNHPFVI LMKRKFDNYW  1020
SLCNVAFTIA AALDPRLKFR FSCNETYDPE SMMKLKRFRK VLMDVYFEYA NEAKNLSASS  1080
SVMDDSNSLT AETTKDCIVS YFSKFASASN VNEVASQKSE LDCYLEETLL PSDADILGWW  1140
RVNSQRFPTL AKMARDFLAI PVSVSSPCSN IRAMTINPAY SSLDPESMEA LVCSQNWLES  1200
TKENNGELHE PMQNMDKRKR KMEENDTSTV KVFKNRTHEK ASSNGDIASD FNKNDGSLSF  1260
DNWMEPQCSS SESVGEKAEI MEASVCNRDR LESSIGKTNH GRNIAAAIEI PNDEPSFNSN  1320
QLDQIQSSSS ESDDETTLRE QGSWCREDVR TYLVSSFTNK EKKRLNRWKR SELSGKKIGR  1380
DKEFQLMGEN LTPLLVAPHC DETLREYYID DSVVNTFFKL LKKRSDKFPN VYIKHYSFDS  1440
QIATFLIKGS KLEDEVLAWF KDEKLRGVHK LFLPMCLSAH WVLFCVDTKE KKISWLDPIP  1500
SSRIMSNSFE KQKIFQWFTL YLLPQFGYND AEKWPFEVRT DIPKQENSID CGVFVMKYGD  1560
CLMHDDFFPF TQKDMIHFRR RIFLDIYRGR LHGKR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2ckg_A5e-141408159147224SENTRIN-SPECIFIC PROTEASE 1
2ckg_B5e-141408159147224SENTRIN-SPECIFIC PROTEASE 1
2ckh_A5e-141408159147224SENTRIN-SPECIFIC PROTEASE 1
6nnq_A4e-141408159146223Sentrin-specific protease 1
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.31030.0ovule
Expression -- Microarray ? help Back to Top
Source ID E-value
GEO1098835370.0
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5891383e-53JX589138.1 Gossypium hirsutum clone NBRI_GE23769 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012481502.10.0PREDICTED: uncharacterized protein LOC105796364
TrEMBLA0A1U8LHD20.0A0A1U8LHD2_GOSHI; uncharacterized protein LOC107927468 isoform X2
STRINGGorai.005G015100.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM8809420
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01570.12e-30GRAS family protein