PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cla003748
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Citrullus
Family Trihelix
Protein Properties Length: 623aa    MW: 72128.6 Da    PI: 6.4223
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cla003748genomeICuGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix43.58.1e-14129190167
   trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikeg 67 
                +W+++e laL+++r+++++ + ++      We+vs+k+ e gf+r++++Ckek+e+ ++++  i+ +
  Cla003748 129 EWSNDELLALLRIRSNIDNCFLES-----TWEHVSRKLGEVGFRRTAEKCKEKFEEESRYFNHINYN 190
                5*******************9999.....9*******************************988755 PP

2trihelix941.4e-29498587186
   trihelix   1 rWtkqevlaLiearremeerlrr.....gklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                rW+++evlaL+++r ++ ++        g+lk+plWe++s+ m + g++rs+k+Ckekwen+nk+++k+k+ +kkr s +s+tcpyf+ql 
  Cla003748 498 RWPRDEVLALVNVRCNLYNNGDGsgeqgGSLKAPLWERISQGMLQLGYKRSAKRCKEKWENINKYFRKTKDANKKR-SLDSRTCPYFHQLS 587
                8**************887777642223358*********************************************8.9999********95 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.015122181IPR017877Myb-like domain
SMARTSM007174.3126183IPR001005SANT/Myb domain
PfamPF138372.8E-8129189No hitNo description
SMARTSM007171.2335562IPR001005SANT/Myb domain
PfamPF138371.5E-18497587No hitNo description
CDDcd122036.31E-23497567No hitNo description
PROSITE profilePS500906.342498560IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0001158Molecular Functionenhancer sequence-specific DNA binding
GO:0005516Molecular Functioncalmodulin binding
Sequence ? help Back to Top
Protein Sequence    Length: 623 aa     Download sequence    Send to blast
MFEGSVSEQL HQFLTPRTTT PPPNSNSLPL IPLNFALHSP NFNFHPFDSY NATSTTHHHH  60
HQIHLHHPHH LLHHQSPNPH ENGDEKDDDE TTTTTGNNLQ VAMDLEVGRE NNNNNNRSIL  120
MEDHIHHGEW SNDELLALLR IRSNIDNCFL ESTWEHVSRK LGEVGFRRTA EKCKEKFEEE  180
SRYFNHINYN KTCRFLTHEL NYHHHQHDQD HLLLIHDGNG KPDDGGVTVV VVPEEGKEEN  240
EANFKDRDGE LQEEEEEEED LRKEETRAAT NEEQDESSRS RSCKKKKRKM MRQKEFELLK  300
GYCEEIVKKM MIQQEEIHSK LLQDMLKREE EKVAKEECWK KQQMEKLHKE LEVMAHEQAI  360
ASDRQATIIE ILNQITNSTT LFSSFQSQKE LQNLLQSLNN YNNNNVPNSP SSSSLIQTQT  420
SSPNKEQQQQ QQEGALPPHE NSSSFTSQND PIKNPKNPCL STQILAPQDP NSFINHSKLP  480
PNPKSRDHKD ELDELGKRWP RDEVLALVNV RCNLYNNGDG SGEQGGSLKA PLWERISQGM  540
LQLGYKRSAK RCKEKWENIN KYFRKTKDAN KKRSLDSRTC PYFHQLSTLY NQGRAANKHP  600
ENCPIVSPEN HSDQSENHLA TSS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1283288KKKKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00011PBMTransfer from AT5G28300Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818110.0LN681811.1 Cucumis melo genomic scaffold, anchoredscaffold00069.
GenBankLN7132560.0LN713256.1 Cucumis melo genomic chromosome, chr_2.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008462720.10.0PREDICTED: trihelix transcription factor GTL2 isoform X1
TrEMBLA0A1S3CHL00.0A0A1S3CHL0_CUCME; trihelix transcription factor GTL2 isoform X1
STRINGXP_008462720.10.0(Cucumis melo)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF50433355
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.12e-43Trihelix family protein