PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.193210.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family Trihelix
Protein Properties Length: 715aa    MW: 78897.5 Da    PI: 8.9916
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.193210.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix963.4e-3097181187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW++qe+laL+++r+em++ +r++ lk+plW+evs+k+ e g++r++k+Ckek+en++k+yk++keg+++r  ++ +t+++f+qlea
  Cucsa.193210.1  97 RWPRQETLALLKIRSEMDSVFRDATLKGPLWDEVSRKLGEMGYKRNAKKCKEKFENVQKYYKRTKEGRGGR--QDGKTYKFFTQLEA 181
                     8********************************************************************98..56667*******85 PP

2trihelix104.67.4e-33491576187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW+kqevlaLi++r  +e++++++  k+plWee+s+ m + g++rs+k+Ckekwen+nk++kk+ke++kkr  e+s+tcpyf++l+a
  Cucsa.193210.1 491 RWPKQEVLALIKLRGGLESKYQETGPKGPLWEEISAGMIKMGYKRSSKRCKEKWENINKYFKKVKESNKKR-REDSKTCPYFNELDA 576
                     8*********************************************************************8.78999*******985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.03994156IPR001005SANT/Myb domain
PROSITE profilePS500906.65596154IPR017877Myb-like domain
PfamPF138374.5E-2096182No hitNo description
CDDcd122032.50E-2896161No hitNo description
PROSITE profilePS500906.992484548IPR017877Myb-like domain
SMARTSM007170.09488550IPR001005SANT/Myb domain
PfamPF138371.7E-22490577No hitNo description
CDDcd122032.29E-30490555No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 715 aa     Download sequence    Send to blast
MEPPAPGSGS GLPDPSQLFR VVSTPPPPPS LTVDTTISDS QQVEAASPIS SRPPAVPSSY  60
EELIRLSGGG GGGQMVVDDD EADGRSGGGS GGSGGNRWPR QETLALLKIR SEMDSVFRDA  120
TLKGPLWDEV SRKLGEMGYK RNAKKCKEKF ENVQKYYKRT KEGRGGRQDG KTYKFFTQLE  180
ALHNANVAPS SSNSSFTLPH PLPAAAAATT TVGFGISNPT PISSVKISSS SSQTQMGIFS  240
TPSDHFTIRP PPAVAAPMGV SFSSNTSSAS TEDDDDDNED EEMGFDVDLE GEPENVAGSS  300
RKRRRGVIKG NNDEWRSSSS SGDHKMMMEF FEGLMKQVME KQEVMQQKFL EAIEKREQDR  360
MVREENWKKE EMFRLSQEQE RMAQERTISA SRDAAIIAFL QKFTGQTIQF SAPAPAPQVP  420
LPVPMAVSVP MPTPVPAPLS PVSSHQPMQP QTLPHLQNQP PSNTIPLEQS KPKFQENSQG  480
GDGSSEPISS RWPKQEVLAL IKLRGGLESK YQETGPKGPL WEEISAGMIK MGYKRSSKRC  540
KEKWENINKY FKKVKESNKK RREDSKTCPY FNELDALYRK KILSTTAAAT ASDHSGSFEQ  600
NPIQNMEIIP PSTTTTTDHH LQSQPHSSSI PQGLSATLFG EGTEEQQQQP TSTKVKLVNP  660
LNKTIILITT LFINSDYCFV FEARRHCERV DGAAGRRRLS TSSQPRRWQR RKQR*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1139147KRNAKKCKE
2300304RKRRR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818950.0LN681895.1 Cucumis melo genomic scaffold, anchoredscaffold00005.
GenBankLN7132630.0LN713263.1 Cucumis melo genomic chromosome, chr_9.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011654663.10.0PREDICTED: trihelix transcription factor GTL1 isoform X2
TrEMBLA0A1S3AU590.0A0A1S3AU59_CUCME; trihelix transcription factor GTL1-like isoform X1
STRINGXP_004145967.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF37334181
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.22e-61Trihelix family protein