PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen12g030120.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family bHLH
Protein Properties Length: 1331aa    MW: 150749 Da    PI: 8.7751
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen12g030120.2genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH28.23.4e-0910941136754
                        HHHHHHHHHHHHHHHHHHCTSCC.C...TTS-STCHHHHHHHHHHHHHH CS
               HLH    7 erErrRRdriNsafeeLrellPk.askapskKlsKaeiLekAveYIksL 54  
                        ++ErrRR+r+N +++ Lr+++Pk +      K++  +iL  +++Y k+L
  Sopen12g030120.2 1094 MAERRRRKRLNGRLSMLRSIVPKiS------KMDRTSILGDTIDYTKEL 1136
                        79*********************66......**************8876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF142446.2E-153279IPR029472Gag-polypeptide of LTR copia-type
PfamPF037321.2E-7115170IPR005162Retrotransposon gag domain
PfamPF077271.0E-63647837IPR013103Reverse transcriptase, RNA-dependent DNA polymerase
SuperFamilySSF566721.27E-29669816No hitNo description
SuperFamilySSF566721.27E-298511033No hitNo description
CDDcd092721.18E-579271037No hitNo description
PROSITE profilePS5088814.40510871136IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF474591.31E-1310921157IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003531.7E-1010931142IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene3DG3DSA:4.10.280.103.6E-1310941149IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000107.5E-710941136IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF566727.12E-4911991331No hitNo description
Gene3DG3DSA:3.10.10.105.0E-3012161304No hitNo description
CDDcd016471.59E-4112481331No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1331 aa     Download sequence    Send to blast
MVSEVIASSS SSVSADTLEG IEKNRNHPLY LHPSDTPGSV LTSIQLTGTE NYSLWSRSLM  60
INLRAKSKVG FVLGSCRKSD YKPELGEQWE KCNAFVLAWI MNTVSKELLS GIVYASDAAM  120
VWADLNERFN KVDGSRSYQL HREICTLHQG NLTVSAYFTK LRLLWDEFDA LVPPPTCDCD  180
KSKMYIDHLQ FLRLFAFLVG LNEVYSHARS QILMMNPLPS VNKAYVMITS DESQRMTAVN  240
RVGGDIHESM ALYSKKIDDS VMQYNKGKNV SGSYDPMAIL IGYPPDWKFK KKFGPGQDVQ  300
NAGKANHAYV DNQIKDEFES VQGLNNSSGS VEQAAKMGQN GHENQLTTQP TFTPNQYNKI  360
LQMINKEDSQ DYMANTAVTY KLLSVYNTKD KRKDNEDWII DSGATRHMTS KETKLDYSVS  420
VKDLDKQDLS NGKVKGIGKE RNGLYYLPSQ IPKRDAHDEQ NTLISNMVFN SEGFKWHSRL  480
GHPSMKPDLS HLRIIGYLCY ATKLVKDDKF STRADACVFM GYSSTKKGYV VYSLRHQKML  540
VSRDVVFKED IFPFARKEMD KDVPLFNPQH VTTDSQSMED VITVQEPAIN DDTLEFTETN  600
NPNNSEEIEL EDDLVPNIPV VAPVATRHSE RSRWKQWNKR LKHLRIMETF APVVKMVTVR  660
SIIALASIEV WPIFQMDVFN AFLQGDLYEE VYMELPKGFK GAEQNCVCKL VKSLYSLKQA  720
SRQWNAKLTE ALCKSGYTQS LLDYSLFTKR SDAGMVIVLV YVDDLLITGS DPVLVKATKQ  780
VLHSHFKMKD LGELKYFLGI EFCRSEKGIV MNQRKYALEL ISEAGLAGAQ PVFTPLECNV  840
KLTSVAYNTS DADPLFLDIS RYQRMIGKLL YLTNTRPDVA FAVQNLSQFM QQPKHSHWNA  900
ALRVIKYIKG SPGLGLLMSS HKDTKLTGFC DADWAACLST RRSVTGYLLK FGDSLISWKS  960
KKQNTVSRSS AEAEYRSLAT LTAEVVWVTS LFKELCVKLE SSTIIHCDSK AALQIAANPV  1020
FHERTKHIEI DCHFIHDFSS IYFDVGNFGD CKLERTQSTA SSAAIAPANF NIGACLDKKS  1080
KKKKVNGEPS KNLMAERRRR KRLNGRLSML RSIVPKISKM DRTSILGDTI DYTKELLEKI  1140
NNLQQEMELG SNQLSLMSIF KNEKPIEMFV RNSPKEMEQR GVTNAEEIKQ ALFKNAGYVG  1200
KCLPPDRDID FCIDVEPGTR PISIPPYRMA PAELKELKEQ LQDLLSKGFI RPSVSPWGAP  1260
VLFVKKKDGS MRMCIDYRQL NKVTIRNKYP IPRIDDLFDQ LQGASVFSKI DLRSGYHQLK  1320
VRAEDIPKTA F
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ol8_A2e-261213133155173Reverse transcriptase/ribonuclease H
4ol8_B2e-261213133155173Reverse transcriptase/ribonuclease H
4ol8_E2e-261213133155173Reverse transcriptase/ribonuclease H
4ol8_F2e-261213133155173Reverse transcriptase/ribonuclease H
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1747754TKRSDAGM
210811101KKKVNGEPSKNLMAERRRRKR
310951102ERRRRKRL
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754510.0HG975451.1 Solanum pennellii chromosome ch12, complete genome.
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G65640.11e-30beta HLH protein 93