PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen09g021360.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 2093aa    MW: 236148 Da    PI: 8.3402
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen09g021360.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox43.55.3e-14155015881856
                        HHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   18 lFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56  
                        +F+k+++p++++ ++LA++ gL+ +qVk+WFqNrRa+ k
  Sopen09g021360.1 1550 FFKKCPHPDEDQQKQLASEAGLDHKQVKFWFQNRRAQAK 1588
                        6889********************************998 PP

2START57.93.4e-191615172989206
                        EEEEEEEEXXTTXX-SSXEEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE-.-SSXX CS
             START   89 galqlmvaelqalsplvpRdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgiliepksnghskvtwvehvdl.kgrlp 177 
                        ++l l+++++     + +R+f f+R +rql a +w+ vd+S d  ++  +  +s+  + ++pSg+ i++++ng skvtwvehv + +++++
  Sopen09g021360.1 1615 STLGLYSNSSDG---VEAREFFFIRGCRQLDATTWIMVDISYDIFNDIHSgVPSY--CWKFPSGCAIQDMGNGQSKVTWVEHVQVyEKYQV 1700
                        566666666666...899***********************99988877534444..67************************97256778 PP

                        HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
             START  178 hwllrslvksglaegaktwvatlqrqcek 206 
                          ++r l+  +   gak+w  tlqr ce+
  Sopen09g021360.1 1701 NHIFRDLLCDRESYGAKRWIVTLQRMCER 1729
                        ***************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF037322.4E-15204300IPR005162Retrotransposon gag domain
PfamPF082842.3E-6500539IPR013242Retroviral aspartyl protease
SuperFamilySSF566729.01E-1675931022No hitNo description
Gene3DG3DSA:3.10.10.102.4E-26620709No hitNo description
PROSITE profilePS5087812.627650829IPR000477Reverse transcriptase domain
CDDcd016473.52E-89653829No hitNo description
PfamPF000782.9E-29669828IPR000477Reverse transcriptase domain
Gene3DG3DSA:3.30.70.2705.2E-8732832No hitNo description
CDDcd092745.49E-579231038No hitNo description
Gene3DG3DSA:3.30.420.105.5E-3112011358IPR012337Ribonuclease H-like domain
SuperFamilySSF530983.04E-4312021361IPR012337Ribonuclease H-like domain
PROSITE profilePS5099420.7812021365IPR001584Integrase, catalytic core
PfamPF006656.0E-1312111322IPR001584Integrase, catalytic core
PfamPF003857.6E-614791525IPR023780Chromo domain
SMARTSM003891.1E-615311594IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.604.8E-1215431591IPR009057Homeodomain-like
CDDcd000863.24E-1015501588No hitNo description
PfamPF000461.9E-1115501588IPR001356Homeobox domain
SuperFamilySSF466893.29E-1115501591IPR009057Homeodomain-like
PROSITE profilePS5007113.39415511590IPR001356Homeobox domain
PROSITE patternPS00027015651588IPR017970Homeobox, conserved site
PfamPF018523.0E-1416121729IPR002913START domain
SuperFamilySSF559614.67E-1016271729No hitNo description
PROSITE profilePS5084818.69816271732IPR002913START domain
SuperFamilySSF559611.65E-1017511924No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0015074Biological ProcessDNA integration
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2093 aa     Download sequence    Send to blast
MVRTRATTVP TPTPARQGAC EPTIGAVTRG GAVARGRGRG RRRTPSRGRE QTPDSTSNRA  60
VTPPPTDEVV REGEEGENEQ VQDEELPPQP TPEMINQVLT YLSGLSDQGQ TPLVFSAPAP  120
QVQGVQHAAA VAPCMDASLE VGTFPRLTTG SIMTSDQHEL FTRFLKLKPT VFKGAESEDA  180
YDFLVDFHEL LHKMDIVERF CVEFVIYQFQ GDAKMWWQSY VECQPAQAPP MTWASFSSLF  240
MEKYIPRTLR DRRRDEFLSL EQGRMSVAAY EAKYRALSRY ATQLCFSTQE RIRRFVKGLR  300
SDLQIPALQV AAAAKSFQKV VDFVIEVEGV KQDDFTMAST SKKFRKGGEF SGSYSRAQSS  360
GGYPTRPIRS SLQALAGGPS QPSQPFSEFG GYPQTSSFSQ GPMLDSRNYY GCGEARHIRK  420
YCPKESYRPP IVRGRGGHGR GRHSGGRGGQ GNGGHQISRG GGQAGTSAAQ HGRGNGQTGD  480
RAHCYAFPGR SEAETSDAVI TDLIILEKVD FDVILGMTWL SPNFAILDCN AKTVTLAKPG  540
TGPLVWEGDY ISTPVHIISF LPAKRMVSKG CLALLAHLRD DTSQVPSIES VSIVREFLDV  600
FPTDLPGMPP DWDIDFCIDL EPGTRPISIF PYRMAQAELR DLKAQLQELL GKGFIRPSAS  660
PWGASVLFVK KKDGSFRMCI DYRHLNKVTI KNKYLIPRID DLFDQLQGAC VFSKIDLRSG  720
YHQLKIRAAD VPKTAFRTRY GHYEFLVMSF GLTNAPAAFM SLMNGIFKPY LDLFVIVFID  780
DILIYSKSRK EHEEHLRIVL ELLREKRLYA KFSKCEFWLD SVSFLGHVVS KDGVMVDPSK  840
IEAVKSWVRP TNVTEVRSFV GLASYYRRFV KGFSSIASQL TNLTKQNVPF VWSDECEESF  900
QKLKTLLTTA PILTLPVEGK NFIVYCDASY SGLGAVLMQE RNVIAYASRQ LKVHERNYPT  960
HDLELAAVVF ALKQWRHYLY GVKCEVYTDH RSLQYVFTQK DLNLRQRRWM ELLKDYDITI  1020
LYNPGKANVV ADALSRKAGS MGSLAHLQVS RRPLAREVQT LANDLMRLEV LEKGGLLACV  1080
EARSSFLDKI KGKQFADEKL SRIRDMVLRG EAKEAIIDEE GVLRIKGRVC VPRVDDLIHT  1140
ILTEAHSSRY SIHPGATKMY RDLKQHFWWS RMKRDIVDFV AQCPNCKQVK YEHQRPGGTL  1200
QRMPIPEWKW ERIAMDFVVG LPKTLGKFDS IWVIVDRLTK SAHFIPVKMT YNAEKLAKLY  1260
ISEIVRLHGV PLSIISDRGT QFTSKFWRTL HAELGTRLDL STAFHPQTDG QSERTIQVLE  1320
DMLRACMIEF GGHWDKFLPL AEFSYNNSYH SSIDMAPFEA LYRRRCRSPI GWFDAFEVRP  1380
WGTDLLRESL DKVKFIQEKI LAAQSRQKEY ADQKVRDLNF MEGEQVLLKV SPMKGVMRFG  1440
ERGKLSPRYI GPFEVLKLVG EVAYELALPP GLSRVHPEEP VAILDREVRK LRSKEIESIK  1500
VQWKNRPVEA STWESEVDMQ ERYPHPFTDS GTLSRPCPSS CDRSRMNDGF FKKCPHPDED  1560
QQKQLASEAG LDHKQVKFWF QNRRAQAKDE KVSNLISHVF GRPFVMDSIL SPQISTLGLY  1620
SNSSDGVEAR EFFFIRGCRQ LDATTWIMVD ISYDIFNDIH SGVPSYCWKF PSGCAIQDMG  1680
NGQSKVTWVE HVQVYEKYQV NHIFRDLLCD RESYGAKRWI VTLQRMCERF NFHMGSTYPK  1740
RHDSKGVFHY PEGQKNTIQV SQRMVKSFFE ILSMTDNHGD FSISPQLNRG DRISVRKNEE  1800
TIQPKGFIAI ATTSLWLPLS FQDVFSFFND AKTRNQWDIL TGGNNVIELD RVPTGTFPGN  1860
NITIIQPYNM HKEMLLLEET SIDEMGAFLV YAPIDLRAIN SIVNGGDATK VPILPSGIII  1920
SPDGRLSSNR DNTANAQNGS ILTVTFQIMI CAVSDLDSNP LLVVNDVSTM CFRFRGRGRT  1980
PGPDSNRAVT PPPTDEVVRE GEDGENEQVQ DEGLPPQPTP EMINQVLTYL SGLSDQGQTP  2040
PVFSAPAPQV PGVQHAAAVA LCMDGVITKL IAHKGDLQPT WLVVLGLIGY AEP
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ol8_A7e-88593103634475Reverse transcriptase/ribonuclease H
4ol8_B7e-88593103634475Reverse transcriptase/ribonuclease H
4ol8_E7e-88593103634475Reverse transcriptase/ribonuclease H
4ol8_F7e-88593103634475Reverse transcriptase/ribonuclease H
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754480.0HG975448.1 Solanum pennellii chromosome ch09, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027771432.10.0LOW QUALITY PROTEIN: uncharacterized protein LOC114076513
TrEMBLQ6F2D60.0Q6F2D6_SOLDE; Putative polyprotein, identical
STRINGSolyc11g020560.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA127
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73360.13e-71homeodomain GLABROUS 11