PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim08g005900.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family Trihelix
Protein Properties Length: 327aa    MW: 37790 Da    PI: 9.0479
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim08g005900.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix64.81.8e-2053135286
            trihelix   2 WtkqevlaLiearremeerlrrgk..lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                         W+ qe+++ i++r e+e +++++k    k lWe v++++ erg++rs++qCk+kw+ l +ryk  ++++ ++      ++pyfd+l+
  Sopim08g005900.0.1  53 WSTQETRDFIAIRGELEMEFSSAKrsNLKSLWEIVAARINERGYRRSAEQCKSKWKTLINRYKGKETSDPDN----GGQFPYFDELH 135
                         ********************986522669************************************9999984....4469*****98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.63145111IPR017877Myb-like domain
SMARTSM007170.003449113IPR001005SANT/Myb domain
PfamPF138379.9E-2152137No hitNo description
CDDcd122036.68E-1953118No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 327 aa     Download sequence    Send to blast
MFGGSNEKLG PFTQRLMLPQ HPLHLLPGAG GSSGSGLGGD DEFPKRDERV PPWSTQETRD  60
FIAIRGELEM EFSSAKRSNL KSLWEIVAAR INERGYRRSA EQCKSKWKTL INRYKGKETS  120
DPDNGGQFPY FDELHALFSS NMNNVHRVPI ESEAGSQQAR KRPRGTSRDR SSEEISEDEE  180
GYACESDEVK LGRSNVAPKK KPEKEKRPRT SNAEKASRQA SFGSSINNNT GRAVDNIQEM  240
LKEFFQHQLR IEMQWRETTE KRAREREAFE QEWRSSMEKL ERDRMMIEQS WREREEQRRT  300
REETRAEKRD ALLTTLLNKL IGENHP*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1158164ARKRPRG
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755200.0HG975520.1 Solanum lycopersicum chromosome ch08, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004244448.10.0trihelix transcription factor GT-3b
TrEMBLA0A3Q7HJV10.0A0A3Q7HJV1_SOLLC; Uncharacterized protein
STRINGSolyc08g005900.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA101331923
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G01380.11e-28Trihelix family protein