PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen00g003940.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family B3
Protein Properties Length: 1242aa    MW: 141765 Da    PI: 6.0018
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen00g003940.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B338.12.8e-125643699
                      EEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                B3 36 ltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99
                      + le ++g+ Wev++  ++++g+++l+kGW++F+++  + ++  ++F+++ rs f  vv+++++
  Sopen00g003940.1  5 VFLEAPHGKAWEVEV--ENSQGQIWLAKGWSDFCDDYSISVKSLLMFTYNPRSHF--VVSIYDQ 64
                      67999**********..********************************988888..9999986 PP

2B342.41.2e-131872581086
                       HHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-S CS
                B3  10 vlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldg 86 
                       +++++ + +p +fa++ ++++ +   ++l +e+g +W+v++  +  ++++++++GW  F k+n++  g+++ Fkl++
  Sopen00g003940.1 187 SHATC-MAIPLRFAQQTDIINMK--NMRLVNEEGVEWKVEI--EYARSMVIIKEGWTAFRKDNKIANGETCRFKLIR 258
                       34445.99******999888554..7***************..888899*************************987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5086312.337165IPR003340B3 DNA binding domain
PfamPF023623.1E-9564IPR003340B3 DNA binding domain
SuperFamilySSF1019365.3E-12572IPR015300DNA-binding pseudobarrel domain
CDDcd100171.20E-9563No hitNo description
Gene3DG3DSA:2.40.330.101.2E-11565IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019366.47E-15175265IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.106.7E-15175265IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.491177271IPR003340B3 DNA binding domain
CDDcd100174.57E-10178260No hitNo description
SMARTSM010191.5E-6180271IPR003340B3 DNA binding domain
PfamPF023623.1E-11190258IPR003340B3 DNA binding domain
SuperFamilySSF530983.03E-229721108IPR012337Ribonuclease H-like domain
Gene3DG3DSA:3.30.420.102.0E-219721106IPR012337Ribonuclease H-like domain
PROSITE profilePS5099410.9289961120IPR001584Integrase, catalytic core
PfamPF006651.6E-89971056IPR001584Integrase, catalytic core
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0015074Biological ProcessDNA integration
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1242 aa     Download sequence    Send to blast
MFNPVFLEAP HGKAWEVEVE NSQGQIWLAK GWSDFCDDYS ISVKSLLMFT YNPRSHFVVS  60
IYDQSTTEIE YPIDHDIESD EEEEDILVAQ ANANVIDEDI LILQSNANVI EEDIPILQSN  120
ANVIEDEEED IPVNFPQTSA NVIDQHKEVG EANSISEKVG PNNYSSRYSL VDLTGDNPFF  180
EMVIKKSHAT CMAIPLRFAQ QTDIINMKNM RLVNEEGVEW KVEIEYARSM VIIKEGWTAF  240
RKDNKIANGE TCRFKLIRGP IANVLQIPSP YRDVALNCAN VQSIYPAPLP VYQDQAPLYQ  300
NPHPNCEAPM PNYQKNSYSR NQIPRLNNRG YQQMPPPQGN YDSTRPRFEK KPSRNFTVLA  360
KSRTKLYERL AAAGYIHPVW PKSVDINSKF YRPDQRCAYH SNSVGHDTEE CINLKHKIQD  420
LIDQEVVSLK PAAPNVNTNP LPNNGGGNVS MIENDDDWCG TKVITPIIHD EVEKVVASFS  480
IKENKEFVIL KPAKVVALVP SKTLIKPKFM IETAAAQGMT RSGRCYTPEE TALGVNVRDF  540
GEVQRDTLGA VNLVIQLSPA EFNAQFQVVD IDTSYNLLLG RPFIHMAGVV PSTVHQMMKL  600
IWKHEELVIH GKGSHSGRQA PIIDEVSRVT DFYTVELEFD AVIEEKVELS GDDKVDDYEE  660
ESEEPEYVAE EFLQFKNQHK LNLEETEAIN LGDQECVKEV NISLHLNEAQ RKGLIHLLIE  720
YIDMFVWEKA IKSHALVNHL AENPIDEEYE PLKTYFHDEE VSFVGEDISE AYPGWRLFFD  780
GEANHQGKGI RVVLVSESGQ HYPMAAKLRF NFMNNMAESK ACILGLKMAI DMSVYELLGT  840
RRFALISRYK EVFRVRKLSK RCNTQQKEVD TPYGSQFLLS GEVLYRRTPD LGLLRCVDVV  900
EAVKLIEHIH AGVCGMQMNG LTLARKIIRA GYFWMTMEND CCKFVQKCHK CQVHGDLIRV  960
PPHKHNAMSS PWPLVAWDMD VIGPIEPAAS NGHRFILVSI DYFTKFRVPE LIITDNDANL  1020
NSYLMRDICE QFKITHRNSA AYCPQMNGVV EAANKNNKKI LRKMINNHRG WYEMLPYDLL  1080
GYRTTTKTSI RATLYLLVYR TKAVIPVEVE IPSLRIIQEA KLSNAEWVSK RIDQLTLIDE  1140
KRMAAVCHGQ LYRQRMILVF HKRVRARIFK VGQLVLKRIF PHQDEYKGKF APNWQGPYMV  1200
RKVLSGDALV ISEMDGIACQ STQMLSRDTM CEVSVCISVI YL
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754420.0HG975442.1 Solanum pennellii chromosome ch03, complete genome.
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA7499524
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.13e-10B3 family protein