PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cla004965
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Citrullus
Family Trihelix
Protein Properties Length: 696aa    MW: 77000.7 Da    PI: 4.7838
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cla004965genomeICuGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix97.71e-3091175187
   trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                rW++qe+laL+++r+em++++r++ lk+plW+evs+k+ e g++r++k+Ckek+en++k+yk++keg+++r  ++ +t+++f+qlea
  Cla004965  91 RWPRQETLALLKIRSEMDSAFRDATLKGPLWDEVSRKLGEMGYKRNAKKCKEKFENVQKYYKRTKEGRGGR--QDGKTYKFFTQLEA 175
                8********************************************************************98..56667*******85 PP

2trihelix108.54.5e-34468553187
   trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                rW+kqevlaLi++r  +e+r++++  k+plWee+s+ m++ g++rs+k+Ckekwen+nk++kk+ke++kkr  e+s+tcpyf++l+a
  Cla004965 468 RWPKQEVLALIKLRGGLESRYQETGPKGPLWEEISAGMARMGYKRSAKRCKEKWENINKYFKKVKESNKKR-REDSKTCPYFNELDA 553
                8*********************************************************************8.78999*******985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.002788150IPR001005SANT/Myb domain
PfamPF138371.8E-2090176No hitNo description
CDDcd122039.40E-2990155No hitNo description
PROSITE profilePS500906.65590148IPR017877Myb-like domain
PROSITE profilePS500906.922461525IPR017877Myb-like domain
SMARTSM007170.0084465527IPR001005SANT/Myb domain
CDDcd122032.60E-32467532No hitNo description
PfamPF138376.1E-24467554No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 696 aa     Download sequence    Send to blast
MEEAAGGSGS GLPDPSQLFG VSTPPPPLTV DAAMSDSQQV EAASPISSRP PAVPSSLNYE  60
ELIRLGGGGG GQMVVDDEET DRSGGGSGGN RWPRQETLAL LKIRSEMDSA FRDATLKGPL  120
WDEVSRKLGE MGYKRNAKKC KEKFENVQKY YKRTKEGRGG RQDGKTYKFF TQLEALHNAN  180
VAPSSSSSFT LPQPTTVGFG ISNPTPISSV KISTSSSQNP MGIFSPPPDH FTVRPPPTVA  240
APMGVSFSSN TSSASTEDDD NDEDEEMGFD VDLEGLPENV AGSSRKRRRG VVKGNESRTH  300
KMMMEFFEGL MKQVMEKQEV MQQKFLEAIE KREQDRMVRE ENWKRQEMAR LSQEQERMAQ  360
ERTISASRDA AIIAFLQKFT GQTIQFSAPQ QQQQFDVAPV PVAVPVSVAV SVPMPMPVPA  420
PLSPVPSQPL QPQTLPQPPS NTIPLDQPMF QEISQGGGDG SSEPISSRWP KQEVLALIKL  480
RGGLESRYQE TGPKGPLWEE ISAGMARMGY KRSAKRCKEK WENINKYFKK VKESNKKRRE  540
DSKTCPYFNE LDALYRKKIL SASASDSGSF SDTNKFEQNP TTTTTTTTDL QPQPHSSIPQ  600
GLSATLFGEG TEEQPSSTKP EDIVNELMEL QDDIYQRHLD QDDDDENDDY CSDDDDDLPE  660
GKRNSNIDYK IEFQRRNNVG NSNGVASEFQ SMAVVQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1133141KRNAKKCKE
2284288RKRRR
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818951e-137LN681895.1 Cucumis melo genomic scaffold, anchoredscaffold00005.
GenBankLN7132631e-137LN713263.1 Cucumis melo genomic chromosome, chr_9.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011654662.10.0PREDICTED: trihelix transcription factor GTL1 isoform X1
TrEMBLA0A1S3AU590.0A0A1S3AU59_CUCME; trihelix transcription factor GTL1-like isoform X1
STRINGXP_004145967.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF37334181
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.23e-63Trihelix family protein