PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PK15417.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Cannabis
Family Trihelix
Protein Properties Length: 493aa    MW: 55566.5 Da    PI: 6.3505
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PK15417.1genomeCCBRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix75.87e-2496196186
   trihelix   1 rWtkqevlaLiearremeerlrrgk..............lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfdql 85 
                +Wt+ +v++Li a+ +++++ ++++              +kk++W+ vs++m+e+gf +sp+qC++k+++lnkryk+++++ +k+ +++++++ +++d +
  PK15417.1  96 KWTDTMVRLLIMAVYYIGDENSSEAidptnkkkptggllQKKGKWKTVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGtACKVVENQSLLDTM 195
                7**************999888876556667788999999**********************************************559*********999 PP

   trihelix  86 e 86 
                +
  PK15417.1 196 D 196
                7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.9E-2294221No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 493 aa     Download sequence    Send to blast
MGAGMLGLEI PLQQQQQNPS NTQNPHLLSH SQMVAYGHHD SDHHPQAQPS VKHGYPYAPK  60
SKQIPPTLSD EDDLGFGADD NSGDGKRKMS PWQRMKWTDT MVRLLIMAVY YIGDENSSEA  120
IDPTNKKKPT GGLLQKKGKW KTVSRAMMEK GFYVSPQQCE DKFNDLNKRY KRVNDILGKG  180
TACKVVENQS LLDTMDLAPK IKEEVRKLLN SKHLFFREMC AYHNSCSHAG ATTTNGVASG  240
ANHSSELATS EPSSNVQPNQ QNQQQQQQQQ RCFHATDNSH VVSNLSRPGT EHGSKVVKAS  300
GSGGEDEEDD DEDDDDSDDY DDDEDDDEAE GGSRGPIGHG HEDIDDVSHV RLRKRPRKGV  360
LSESSQLIQQ LSCEILGVIQ DGSKSSWEKK QWLKSRMIQL EEQQVNFQCE AFELEKQRLK  420
WVKFSSKKER EMEKAKFENE KRRLENERMV LLIRQKELEV LELYHRSQHQ QQQHSSNKRS  480
GGGGGGDPSS ITG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1350356RLRKRPR
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015898596.10.0uncharacterized protein LOC107432054
TrEMBLA0A2P5FVN40.0A0A2P5FVN4_TREOI; Sequence-specific DNA binding transcription factor
STRINGXP_004303047.10.0(Fragaria vesca)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF48203457
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G10040.15e-75sequence-specific DNA binding transcription factors