PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP024436.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family Trihelix
Protein Properties Length: 1323aa    MW: 148922 Da    PI: 5.9418
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP024436.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix46.21.2e-14801858264
     trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykki 64 
                  W+++e laL+++r+ me+++ +       We+vs+k++e gf+rs+++Ckek+e+ ++++ +i
  PCP024436.1 801 WSNDELLALLRIRSTMENWFPEF-----TWEHVSRKLAELGFKRSAEKCKEKFEEESRYFNNI 858
                  ********************998.....9******************************9875 PP

2trihelix93.52.1e-2911701263186
     trihelix    1 rWtkqevlaLiearremeerlrrgk.........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86  
                   rW+++evlaLi++r ++ ++ ++g           k+plWe++s+ m e g++rs+k+Ckekwen+nk+++k+k+ +kkr s +s+tcpyf+ql 
  PCP024436.1 1170 RWPRDEVLALINLRCSLFNNGGSGGdqdkdggvvMKAPLWERISQGMFEIGYKRSSKRCKEKWENINKYFRKTKDVNKKR-SLDSRTCPYFHQLS 1263
                   8*****************9999875566788888*********************************************8.9999********95 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1323 aa     Download sequence    
MWLKPAPSCF SMLKPYCSAT PPPPPPPTTA FKPLMATKTT EESDVGIFCY ISQLPGFRGV  60
LKQRYSDFMV NEVDKMGNVV HLTNLDAPVE AVKESGTTTS DGASKDYTAE IESFRALVDP  120
NDAERLETFI NQINSSSDDG VMPIVLSPDY DKSHRTAVHN FFKQHFKFLV TDTVDGPDAS  180
SKCIRVRINS GGQNSRGRNS RKRKERGDKP FDSRGSEDWS EHVGKFLRFH LYKENKDTQE  240
ALGVIGKMLG VQPRSFGFAG TKDKRAVTTQ RVTVFKKLAS KLAALNDRLI GIKVGDFCYV  300
TEGLVLGQLL GNRFTITLSD LEAVLCQLTI LELHYSEESG NLLNVITEAR EYYKETNDIE  360
GTLRKLPRHL VSERAILQCL KKCPGNYLQA LKAIPRTLRM MYVHSYQSYL WNHAASMRVK  420
KYGTDRVVLG DLVFCKGNET EKVTEVVTSE CIDENSDYML DPNDLGDIAE TNLPEEKLNL  480
VKAVTAEDIQ SGNYTIEDVV LPMPGSRVIF PENDIADVFH DLAKKDAISL TESVHNVNEF  540
SITSVTGNYR RVFQKPMDFE WEILKYVDGN VPLVDTDLDK ISKAKPAKVD KEDPSSVNGD  600
GSLHDSAKQS EYIDNDIGDK GEKVPEVGSL GDTNPQETQM ALNEQTKGMV KRNTHTLSHK  660
THTNMFDGVP AEQLHQLIAS SRTSLPLPLP LPLSSFPPPP PNNNINTFHV PAGAPFDPYN  720
NNNDPSHLHH QLLQIQPHLL HQNHQLLQLH RQSTATPENL EEEEEHSSTV SISINNNLEI  780
ERDRSSSVPA SSDVPISSDP WSNDELLALL RIRSTMENWF PEFTWEHVSR KLAELGFKRS  840
AEKCKEKFEE ESRYFNNIYF TKNYRFLSDL EQICQGGGDA HQRSPDDYDH RNPDDHQEAP  900
GDKNNKKEMV EKPSDEGDHP DSTRNETLVE DNIGVISNEP LNLQEHNKKE VVVETRPTNN  960
NNKRKRQRRF EMLKGFCEDI VNRMMAQQEE MHNKLLEEMV KRNEEKIAKE EAWKKQETDR  1020
MNKELELMAR EQAVAGDRQA TIIKFLKKFT SSSSNSSTST SSSPIQVQNP TLPTCRDVSG  1080
RKELDRHQEK ENNPTTPSSL TESTLAPQSP TSSTLAPTPL PVPTVTISLS TTTNTTVALA  1140
PPENPSSEDV IINAQNPSSI SHDKQDLGKR WPRDEVLALI NLRCSLFNNG GSGGDQDKDG  1200
GVVMKAPLWE RISQGMFEIG YKRSSKRCKE KWENINKYFR KTKDVNKKRS LDSRTCPYFH  1260
QLSTLYNQGM LVSPSDQAPE NRPASPENHS LGDGDSTKHD EGEKNNNMVK QAPTPTPAFD  1320
FEF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1963968KRKRQR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.14e-42Trihelix family protein