PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sphfalx0007s0206.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Bryophyta; Sphagnophytina; Sphagnopsida; Sphagnales; Sphagnaceae; Sphagnum
Family CPP
Protein Properties Length: 1103aa    MW: 118536 Da    PI: 6.9287
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sphfalx0007s0206.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR47.34e-15640678341
                   TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                           +k+CnCkkskClk+YCeCfaag +C  +C C +C Nk e
  Sphfalx0007s0206.1.p 640 CKRCNCKKSKCLKLYCECFAAGIYCVGSCACRECLNKPE 678
                           89**********************************975 PP

2TCR512.8e-16724762139
                   TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                           ++k+gCnCkks ClkkYCeC++ag+ Cse C+Ce+CkN 
  Sphfalx0007s0206.1.p 724 RHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNM 762
                           589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011143.6E-13638679IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163435.525639764IPR005172CRC domain
PfamPF036383.9E-11641676IPR005172CRC domain
SMARTSM011141.6E-18724765IPR033467Tesmin/TSO1-like CXC domain
PfamPF036388.2E-12726762IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 1103 aa     Download sequence    Send to blast
MVTRGGGRRC GGEKSKEVDG AKDIGSPDRR DPIAASNHQE SPLFKYLCNL SPIKPLKAVH  60
VAHTYNELTF PPEPRIFASP RSTQRASSSS LKRVVVSERE IKSQFANIRQ VVKASPQDAA  120
VPKFEKVAEF NEDHSARLWM YEEQAEASLR AKFLQDFPPE MVDSVSVVIN DQVSESCDGG  180
LKRSSALDSG YCDHHLSVRH GGEAELDQHT TAMAYLLAGD QDFEPQDRED DWSEDSLEGS  240
RVSACRTSKY PTALKDDFCE DVNVDMNTQL ANEGSVLQDQ SMVLSHTVNP SGQHRGFRRR  300
CLDFDTSVAR GKNLDTGRRS VRRRQSIADV SPSVTSASAV VSCERASAAV DATMSGFSMS  360
MSDLLNAKEC SQAMFNSSAP ATGTVGLGCN GSAGAASGLL QVSGCCNTAG LEPSSVSGCS  420
GAPVCRTQDL SKCEGGNHTF NESSNQGCNP LVMPSGIGLH LNSLTSSISF KRDFHSTRVG  480
NSEGTLASIL GVEPLKSASL KGDQSDVCAQ VGMGNEGTPV IASQSFGSGV ASSSQRSQTG  540
KQFSESLLGN NSLKVDTLDF PSLDHESYGP PEGSLSVTGG DAMAGCRSII GQTGLQEQQT  600
DLFHQSEVET PEEFLDSPQS FKKRRRKSTV ASGDKSGEGC KRCNCKKSKC LKLYCECFAA  660
GIYCVGSCAC RECLNKPEFE ETVLNTRQQI ESRNPLAFAP KIVQAAETSP TPGEDSMDTP  720
ASARHKRGCN CKKSLCLKKY CECYQAGVGC SEGCRCEGCK NMYGRKEGSR EEEEKDGNQV  780
FTMQEEPQAD DPIELLNRMS GKSEQFRSSG NKNISPITPS FENDGMGLRS ASRKRAPHDE  840
HCSSPLLHQA GSRPSKFPTW FSNTLDGFQL AAYSQGAMEL SMSGGGESPM TTMNLSRIGH  900
LSPQWEGLAD ICTLTPLPMA PPRPTPTSVT TLDRSGFSPC FSSQLIDPSC NGGSSATRHN  960
HSLGKLLRRS PPRFRQPAAR SPLNFKTQQD RCNQNQSLAS THLSQEHSQG GKHNALSINS  1020
SGEDDDIHDS LRCPEVASPL QTTITKSGSP NQKRVNPPRQ GGSLEQGPSK AGGGMVMSSS  1080
PGLRSSCKFT LQKCNTTSYI SP*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A4e-1764176212121Protein lin-54 homolog
5fd3_B4e-1764176212121Protein lin-54 homolog
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1621626KKRRRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.16e-61Tesmin/TSO1-like CXC domain-containing protein