PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Lus10002033
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Linaceae; Linum
Family CPP
Protein Properties Length: 863aa    MW: 93016.2 Da    PI: 6.4047
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Lus10002033genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.21e-15466504341
          TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                  +k+CnCkk+kClk+YC+Cfaag +Cse C+C++C+Nk e
  Lus10002033 466 CKRCNCKKTKCLKLYCDCFAAGIYCSEACSCQGCFNKPE 504
                  79**********************************976 PP

2TCR50.63.7e-16552590139
          TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                  ++k+gCnCkks ClkkYCeC++a++ Cs+ C+Ce+CkN 
  Lus10002033 552 RHKRGCNCKKSMCLKKYCECYQANVGCSSGCRCEGCKNV 590
                  589***********************************5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011143.2E-16464505IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163435.376465592IPR005172CRC domain
PfamPF036381.6E-11467502IPR005172CRC domain
SMARTSM011143.1E-17552593IPR033467Tesmin/TSO1-like CXC domain
PfamPF036381.2E-11554589IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 863 aa     Download sequence    Send to blast
MDSPEVTKTT PAAASSATLT SPIVVQESPF SNFVSNLSPI GPVKSSRISH GYADLSSPPL  60
VFTSPHVVPQ REASLLRRVQ CPQTSSSQTV KVGGKAFADI PPELEKCDPY FGRRFIIGSQ  120
KDSDHISCTK ELSDSPSGCV DEYLSEAVDA DCTDSTSLIR ETSADPVESS ACGLVNLQEK  180
KLQSGTDKEV INFETRPTDS GHDRSTCVPP SDDCNNSTSQ MQLDLVSNEV QSENCFVKDA  240
GLECNRSLPE LSQDLQENEV YLEQVGGGFD NPECDVIQIN PEASQLHRGV SRRCLQFGEG  300
QGVGKGSANV MGSGLPFSAG MESSELSDSG IAARTKKSQM INLSRVLASK RPPNHNANTV  360
LSISKPSGIG LHLNSILNAS AMGCIGSESI NSQSVDVGDR ILENQAAPVS VSSTTDSFHT  420
TKSLTIIQSF EPNTTPQEKR PLDSEQNESS PKRKRKKLTP DGGDGCKRCN CKKTKCLKLY  480
CDCFAAGIYC SEACSCQGCF NKPEYEDTVL QTRQLIESRN PLAFAPKVVH HASRVPAAIV  540
ADVNGTTPSS ARHKRGCNCK KSMCLKKYCE CYQANVGCSS GCRCEGCKNV FGMKDDFAIA  600
EEITSSTTSK DLIENTADDK LQMVVYSDDR VHTEMYNAQS PAPFTPQLQF SEHNREGAPK  660
FRPFSTKYLP SPLSDIPTFQ SVEKSGDGPS VGSSGQEMEC SISEMIEQFS PRFDPIGDNV  720
CDRNASSSSA STTKELANVS SRPLQLLRWR GSPITPMPNL GEIGKSIKGP DSGKKMSEVL  780
MADDTPEILK ETCSPIQSVK ATSPNKKRVS PPHAHPKSIH GFGSSSSGSF KTRKFVLKSV  840
PSFPPLTPCL GSKTKTNEEQ EK*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A4e-1846959014121Protein lin-54 homolog
5fd3_B4e-1846959014121Protein lin-54 homolog
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1450457PKRKRKKL
2452456RKRKK
Cis-element ? help Back to Top
SourceLink
PlantRegMapLus10002033
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
STRINGLus100020330.0(Linum usitatissimum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF85652841
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.12e-72Tesmin/TSO1-like CXC domain-containing protein