PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D02G0128
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 1619aa    MW: 187440 Da    PI: 6.1996
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D02G0128genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS89.74.6e-2873523374
         GRAS   3 elLlecAeavssgdlelaqalLarlselaspdg.dpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshltaN 99 
                  +lLl+cA+a+++gdl++a+a+L ++  la ++  +   ++++yf+ AL +r ++        l+p ++  +  ++ +    l+++   ++   + +  
  Gh_D02G0128   7 NLLLSCAKAIEDGDLKRADAFLHNILILADERPySYQSKVVKYFADALVRRAYG--------LHPASSYFSFPVDPA----LYYHYNSYH--INGVIK 90 
                  79**************************9999956677****************........333333211122222....233333333..344567 PP

         GRAS 100 qaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvl.vakrledleleeLrv 196
                  + I++a+ g++r+H iDf+i++ +   + l++L +  + p  +R+  + +p  ++  e ++  e L+k Aee++v++e ++  + ++l++++  e+++
  Gh_D02G0128  91 EVIYDALLGNRRLHLIDFSIQYYGFEGSVLRTLPNFFGYPLPVRVSYILPPFLKEYVEFSRQMEFLTKDAEEVDVKLEDELKvYGNSLAEVDECEIDF 188
                  99********************9999*********999999*********99999999999999*************8775327789*********99 PP

         GRAS 197 kp...gEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsl.eaklpreseerikvErell 290
                  k+   +E+++V   ++l +l+++  ++e      L  +k+++P++v++ +  ++h++++Fl+ f ++++y  + +d + e  l      + k+E e  
  Gh_D02G0128 189 KRrkdDEMVVVYYKFKLDKLVRDAKAMER----ELVRLKEINPTIVIMLDFYSNHSHSNFLTCFKDSFQYSLKTLDYWvELDL----YLNGKYEWE-- 276
                  99999****************99888888....8999*****************************************66665....666777777.. PP

         GRAS 291 greivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                      +n+ a eg++ + rh tl +W++ +++aGF+ +pls++  +   l ++  ++    + ee+++l+lg ++ p++++SaW+
  Gh_D02G0128 277 ----CNIEAGEGNNIIGRHPTLTEWQHLFSMAGFSRIPLSHRKDN---LRVEDESFL-EIMGEEEECLILGYEGCPMFFLSAWK 352
                  ....8999********************************98653...344444422.2455999******************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098517.0091336IPR005202Transcription factor GRAS
PfamPF035141.6E-257352IPR005202Transcription factor GRAS
PROSITE profilePS508089.749617671IPR003656Zinc finger, BED-type
SMARTSM006146.1E-14617667IPR003656Zinc finger, BED-type
PfamPF028921.6E-5620660IPR003656Zinc finger, BED-type
SuperFamilySSF576677.97E-6620669No hitNo description
SuperFamilySSF530981.14E-407871223IPR012337Ribonuclease H-like domain
PfamPF143728.7E-1210021093IPR025525hAT-like transposase, RNase-H fold
PfamPF056994.6E-1911441222IPR008906HAT, C-terminal dimerisation domain
PROSITE profilePS5060015.75914081586IPR003653Ulp1 protease family, C-terminal catalytic domain
PfamPF029021.0E-2014321612IPR003653Ulp1 protease family, C-terminal catalytic domain
SuperFamilySSF540011.16E-3314321614No hitNo description
Gene3DG3DSA:3.30.310.1303.0E-1214601581No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006508Biological Processproteolysis
GO:0003677Molecular FunctionDNA binding
GO:0008234Molecular Functioncysteine-type peptidase activity
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1619 aa     Download sequence    Send to blast
MASTALNLLL SCAKAIEDGD LKRADAFLHN ILILADERPY SYQSKVVKYF ADALVRRAYG  60
LHPASSYFSF PVDPALYYHY NSYHINGVIK EVIYDALLGN RRLHLIDFSI QYYGFEGSVL  120
RTLPNFFGYP LPVRVSYILP PFLKEYVEFS RQMEFLTKDA EEVDVKLEDE LKVYGNSLAE  180
VDECEIDFKR RKDDEMVVVY YKFKLDKLVR DAKAMERELV RLKEINPTIV IMLDFYSNHS  240
HSNFLTCFKD SFQYSLKTLD YWVELDLYLN GKYEWECNIE AGEGNNIIGR HPTLTEWQHL  300
FSMAGFSRIP LSHRKDNLRV EDESFLEIMG EEEECLILGY EGCPMFFLSA WKPKVEDVHY  360
NSNSTNHKFG QGFNPNPPPL QPLQPFQEGL ILNQLAALAE IHDISKDLCC KYKLSLALTW  420
AAKVNNMNET ISDPNKKHAF SIQSNYCYVK NRKYYDFMLL SECLISGPII EKAFESRDGY  480
HFEPSITKVV TKVEDLRNFM LENFNIDVAV AICLQNRHTS DEVYIVEFYW PPTESEISNS  540
LALRIFDDLK HTKTMFVTVK VQGPEIKFQE EAISSTPTSS NTAMHLKIGE EARDIHAIEI  600
NAHIEQIVET KRNKQRKSWS KVWMDFDKFE EDGKQVAKCK HCPKVLTGSS KSGTTHLNNH  660
SKVCPGKKKQ NQESQLIISV DTNERSSTFD QERKKQNQES QLILPVDTNE RSSKFDQERS  720
HLDLVKMVIR HQYPLDLAGQ EVFKNFVKGL QPMYEFQSRD KLLSDIHRIY NEEREKLQLY  780
FDQLACKLNL TVSLWKNNHG KTAYCCLIAH FIDDGWELKM KILGLRKLEH VYDTKVVGGI  840
IRSFVSEWNI SKKVCSITVD NSFLNDGMVH QIRENCVSEQ GSFSSARWFI SFTLLEDGFR  900
EMDSILSKLW KSIEYVTETT HGKLNFQEAV NQVKLQGGKS WHELSFKLES DSDILDSALR  960
SREIFCKLEK IDDNFMLNLS KEEWEKAVTL QSCFKCFDDI KGTQSLTANL YFPKLCNMYE  1020
EFGQLKKSNH PFVILMKRKF DNYWSLCNVA FTIAAALDPR LKFRSSCNET YDLESMMKLI  1080
RFRKVLMDVY SEYANEAKNL SASSSVLDDS NSLTAETTKD CIVSYFSKFA SASNVKEVAS  1140
QKSELDCYLE ETLLPSDADI LGWWRVNSQR FPTLAKMARD FLAIPVSVSS PCSNISAMTI  1200
NPAYSSLDPE SMEALVCSEN WLESTKENNG EYHEPMQNMD IRKRKMEEND TSTVKVFKHR  1260
TREKASSNGD IASDFNKNDG SLSFDNWMEP QCSSSESVGE KAEIMEASVC NGDSLESSIG  1320
KTNHGRNIAA AIEIPNDEPS FNSNQLDRFQ SSSSESDDET TLRAQGSWCR EDVRTYLVSS  1380
FTNKEKKRLN RWKRSELSRK KIGRDKEFQL MGENLTPLLM VPHCDETLIE YYIDDSVVNT  1440
YFKLLKKRSD KFPNVYIKHY SFDSLIATCL IEGSKSEDEV LAWFKDEKLR GVHKLFLPMC  1500
LSAHWVLFCV DTKEKKISWL DPIPSSRIMS NSVEKEKIFQ WFTLYLLPQF GYNDAEKWPF  1560
EVRTDIPKQE NSIDCGVFVM KYGDCLMHGD FFPFTQKDMI HFRRRIFLDI YRGRLHGKR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6nnq_A1e-141432161546223Sentrin-specific protease 1
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.31030.0ovule
Expression -- Microarray ? help Back to Top
Source ID E-value
GEO1098835370.0
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5891381e-55JX589138.1 Gossypium hirsutum clone NBRI_GE23769 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012481502.10.0PREDICTED: uncharacterized protein LOC105796364
TrEMBLA0A1U8LHD20.0A0A1U8LHD2_GOSHI; uncharacterized protein LOC107927468 isoform X2
STRINGGorai.005G031000.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM8809420
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01570.12e-30GRAS family protein