PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP016175.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family Trihelix
Protein Properties Length: 561aa    MW: 63902 Da    PI: 6.1145
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP016175.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix85.75.6e-27372455185
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                  rW+++ev aLi +r ++e+++ +  +k++ Weevs+ m++ g+++s+ +Ck+kwen+nk+++k+k+++kkr + +s+tc+yf+ql
  PCP016175.1 372 RWPQSEVEALILVRGSVESKFVELGSKGHVWEEVSALMASMGYQKSARRCKQKWENINKYFRKTKHTAKKR-PLQSKTCSYFNQL 455
                  8*********************************************************************8.9***********9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 561 aa     Download sequence    
MQSNFGAAEI GNQHFMDSNS SVLPVFNPQK QNQDFLHQQF HQQQPQPFHH LSQLAHPIPI  60
THDLFQRQNQ LQFQALLQQQ QQAQQEEGRL PWQPSPPSFK LGLSESSGPH NLGFKNWKTQ  120
EYYGIRRESM CWKPLDAVVA DTTNKRQRQE EGESCRDLEG KHRLYGELEA IYSLAKMGEA  180
NQNQTGSGSA LTGENCSKNV EFPVVFVDPN GLNAAPADNV RVDNVSEASI GEELSVGKIQ  240
KRKRKRKMKE QLSSMTRFFE NLVKQVVDHQ ENLHKKYLEV IEKMGKERRE REEAWRYQAE  300
ENHKRETMAK LHEQALASSR EALIVSYIEE ITGQRINLPP RQTPLLLQPD NVSEPQTAEE  360
LTPSQAVRTN SRWPQSEVEA LILVRGSVES KFVELGSKGH VWEEVSALMA SMGYQKSARR  420
CKQKWENINK YFRKTKHTAK KRPLQSKTCS YFNQLGQLYS KTPTTVPSSS SSVSMDVSIQ  480
RQGYSELLEA FVAGRETEVT HNLSNGNFEV SEMGFSRLDF NGISNGKVEN VQGDRGQEKE  540
NDEDGENMDE GEDSDGDDSD E
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1241247KRKRKRK
2241248KRKRKRKM
3242247RKRKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.24e-40Trihelix family protein