PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.204000.1
Common NameCsa_6G043490, LOC101205810
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family Trihelix
Protein Properties Length: 654aa    MW: 73286.7 Da    PI: 6.192
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.204000.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix93.42.3e-2966150187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW++qe+laL+++r+em+ ++r+++ k+plWe++s+k+ e g++rs+k+Ckek+en+ k++k++ke +++  +++s+t+++f+qlea
  Cucsa.204000.1  66 RWPRQETLALLKIRSEMDVAFRDASVKGPLWEQISRKLGELGYHRSAKKCKEKFENVYKYHKRTKEVRSG--KPDSKTYKFFEQLEA 150
                     8********************************************************************9..58999********85 PP

2trihelix101.56.6e-32474559187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW+k ev+aLi++r+++e++++++  k+plWee+s++m++ g++r++k+Ckekwen+nk++kk+ke++k r +e+s+tcpyf+ql+a
  Cucsa.204000.1 474 RWPKVEVQALIKLRTNLETKYQENGPKGPLWEEISSAMKKLGYNRNAKRCKEKWENINKYFKKVKESRKTR-PEDSKTCPYFHQLDA 559
                     8********************************************************************97.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007171.6E-563125IPR001005SANT/Myb domain
PfamPF138375.2E-1965151No hitNo description
PROSITE profilePS500907.1265123IPR017877Myb-like domain
CDDcd122039.67E-2465130No hitNo description
PROSITE profilePS500907.573467531IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.606.5E-5467530IPR009057Homeodomain-like
SMARTSM007170.0015471533IPR001005SANT/Myb domain
PfamPF138375.8E-22473560No hitNo description
CDDcd122035.18E-27474538No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 654 aa     Download sequence    Send to blast
MLGDSTTSVL GGGAGGDSAV PATTTHRQDG LIVDMDENNN NSGEDERGRS SGGGGDDGDR  60
GFGGNRWPRQ ETLALLKIRS EMDVAFRDAS VKGPLWEQIS RKLGELGYHR SAKKCKEKFE  120
NVYKYHKRTK EVRSGKPDSK TYKFFEQLEA LENHPPLNFH SHLSKPTPPP PLPPPPTTVI  180
SHIPSTTVPS TTTTTLPHLL NISFSQPNPT IHLPSPPPPP APLPLNNPTS LPTTVPPAVP  240
FQINVSSTGV GMGFQSIEAD LISNSTSDDV NSSTSSDEAS RRRRRKRKWK DFFERLMKEV  300
IDKQEEMQKR FLEAIEKREQ ERVVREEAWR MQEMAKINRE REILAQERSM AAAKDAAITS  360
FLQKITESQH NNNNNNPSQL SPPPPPPPPP SQQQQIPTSN PSPVVHPQQQ PQLQPQLQPP  420
PPPAPQASTL QVVVPNSTPQ KVGNNNELLQ MEIMKMDHNG GENYSISPAS SSSRWPKVEV  480
QALIKLRTNL ETKYQENGPK GPLWEEISSA MKKLGYNRNA KRCKEKWENI NKYFKKVKES  540
RKTRPEDSKT CPYFHQLDAL YREKSNNNNN MITSSTPIMQ HQQQPLMVRP EQQWPPQQEM  600
ARPDSGNEEM ESEPMDRDDK DDDDEDEEEE EEDEGGGNYE IVASKPATVS AAE*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1281287RRRRKRK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818170.0LN681817.1 Cucumis melo genomic scaffold, anchoredscaffold00015.
GenBankLN7132570.0LN713257.1 Cucumis melo genomic chromosome, chr_3.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004140891.10.0PREDICTED: trihelix transcription factor GT-2-like
TrEMBLA0A0A0KCE10.0A0A0A0KCE1_CUCSA; Uncharacterized protein
STRINGXP_004140891.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF37334181
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.17e-56Trihelix family protein
Publications ? help Back to Top
  1. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  2. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  3. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]