PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.148430.2
Common NameCsa_7G044240, LOC101219415
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family HD-ZIP
Protein Properties Length: 759aa    MW: 82491.1 Da    PI: 6.1856
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.148430.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.61.2e-1982137156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     +++ +++t++q++++e++F+++++p+ ++r+eL+++l+L+  qVk+WFqN+R+++k
  Cucsa.148430.2  82 KKRYHRHTQHQIQQMEAFFKECPHPDDKQRKELSRELNLEPLQVKFWFQNKRTQMK 137
                     688999***********************************************999 PP

2START223.28e-702754951206
                     HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEE CS
           START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetl 82 
                     ela +a++elv++a+ +ep+W++ +     ++n++e++++f+++ +     +s ea+ra++vv+m++  lve l+d++ qW++t++    +a+tl
  Cucsa.148430.2 275 ELAVAAMEELVRMAQMGEPLWMTGVdgstNELNEEEYVRSFPRGIGpkpsgFSCEASRATAVVIMNHISLVEMLMDVN-QWSTTFTgivsRAMTL 368
                     57899********************999999************999********************************.**************** PP

                     EEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-E CS
           START  83 evissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehv 170
                     ev+s+g      galq+m++elq++splvp R+++fvRy++q+g+g+w++vdvS+d  ++ p      R++++pSg+li++++ng+skvtwvehv
  Cucsa.148430.2 369 EVLSTGvagnynGALQVMTSELQVPSPLVPtRESYFVRYCKQHGEGTWAVVDVSLDTLRPAPA----LRCRRRPSGCLIQEMPNGYSKVTWVEHV 459
                     ************************************************************995....**************************** PP

                     E--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 171 dlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                     ++++r +h+l+++lv+sg+a+gak+w atl+rqce+
  Cucsa.148430.2 460 EVDDRGVHSLYNQLVSSGHAFGAKRWIATLDRQCER 495
                     **********************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.7E-2160137IPR009057Homeodomain-like
SuperFamilySSF466895.43E-1965138IPR009057Homeodomain-like
PROSITE profilePS5007116.53679139IPR001356Homeobox domain
SMARTSM003892.2E-1880143IPR001356Homeobox domain
CDDcd000867.88E-1982140No hitNo description
PfamPF000462.9E-1782137IPR001356Homeobox domain
PROSITE patternPS000270114137IPR017970Homeobox, conserved site
PROSITE profilePS5084845.725266498IPR002913START domain
SuperFamilySSF559615.36E-34268497No hitNo description
CDDcd088751.27E-128270494No hitNo description
SMARTSM002341.1E-65275495IPR002913START domain
PfamPF018521.7E-59276495IPR002913START domain
Gene3DG3DSA:3.30.530.208.4E-6326477IPR023393START-like domain
SuperFamilySSF559611.24E-23515750No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 759 aa     Download sequence    Send to blast
MPAGMMIPAR NTASSMIGRN GNVGIFGSPA SLVLGQGIEM GENEYGRMRE TEEFESGTKS  60
SSENHEVGSG DDQLNNQRPN KKKRYHRHTQ HQIQQMEAFF KECPHPDDKQ RKELSRELNL  120
EPLQVKFWFQ NKRTQMKTHH ERHENTQLRT ENEKLRADNM RYREALSNAT CPNCGGPTAI  180
GEMSFDEHHL RLENARLREE IDRISAIAAK YVGKPVSNYP LLSTPIPSRP LELGMGSYGG  240
HDLGLGPGGG DMFGAADLLR TISAPSEADK PVIIELAVAA MEELVRMAQM GEPLWMTGVD  300
GSTNELNEEE YVRSFPRGIG PKPSGFSCEA SRATAVVIMN HISLVEMLMD VNQWSTTFTG  360
IVSRAMTLEV LSTGVAGNYN GALQVMTSEL QVPSPLVPTR ESYFVRYCKQ HGEGTWAVVD  420
VSLDTLRPAP ALRCRRRPSG CLIQEMPNGY SKVTWVEHVE VDDRGVHSLY NQLVSSGHAF  480
GAKRWIATLD RQCERLASAM ATSIIPNGDA GVITNQEGRK SMLKLAERMV MSFCGGVSAS  540
TTHTWTTLSG TGADDVRVMT RKSVDDPGRP SGIVLSAATS FWLPLPPNRV FHFLRDENSR  600
NEWDILSNGG VVQEMAHIAN GRDTGNCVSL LRVNSANSSQ SNMLILQESS TDQTASFVIY  660
APVDIVSINV VLNGGDPDYV ALLPSGFAIL PDGSTASSGG ANGVGEHGSG GSLLTVAFQI  720
LVDSVPTAKL SLGSVATVNN LIACTVERIK ASLSCDNP*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6817920.0LN681792.1 Cucumis melo genomic scaffold, anchoredscaffold00034.
GenBankLN7132550.0LN713255.1 Cucumis melo genomic chromosome, chr_1.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004144488.10.0PREDICTED: homeobox-leucine zipper protein HDG2 isoform X1
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLA0A0A0K2660.0A0A0A0K266_CUCSA; Uncharacterized protein
STRINGXP_004144488.10.0(Cucumis sativus)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.40.0homeodomain GLABROUS 2
Publications ? help Back to Top
  1. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  2. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  3. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]