Type: Package
Title: CRISPR-Based Salmonella Enterica Serotype Analyzer
Version: 1.2.0
Description: Salmonella enterica is a major cause of bacterial food-borne disease worldwide. Serotype identification is the most commonly used typing method to characterize Salmonella isolates. However, experimental serotyping needs great cost on manpower and resources. Recently, we found that the newly incorporated spacer in the clustered regularly interspaced short palindromic repeat (CRISPR) could serve as an effective marker for typing of Salmonella. It was further revealed by Li et. al (2014) <doi:10.1128/JCM.00696-14> that recognized types based on the combination of two newly incorporated spacer in both CRISPR loci showed high accordance with serotypes. Here, we developed an R package 'CSESA' to predict the serotype based on this finding. Considering it’s time saving and of high accuracy, we recommend to predict the serotypes of unknown Salmonella isolates using 'CSESA' before doing the traditional serotyping.
License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
Encoding: UTF-8
LazyData: true
RoxygenNote: 6.0.1
Suggests: testthat
Imports: Biostrings
NeedsCompilation: no
Packaged: 2019-03-02 11:48:19 UTC; Xia
Author: Xia Zhang [aut, cre], Lang Yang [aut]
Maintainer: Xia Zhang <zhangxia9403@gmail.com>
Repository: CRAN
Date/Publication: 2019-03-03 16:40:02 UTC

CSESA (CRISPR-based Salmonella enterica Serotype Analyzer).

Description

The main function in CSESA package.

Usage

CSESA(in.file1 = NULL, in.file2 = NULL, out.file = NULL,
  method = c("PCR", "WGS"))

Arguments

in.file1

The first input file, the default value is NULL.

in.file2

The second input file (optional), the default value is NULL.

out.file

Into which results will be saved if this value is set. Otherwise results will be displayed on the screen.

method

The method to handle the input file(s), which can be set as "PCR" or "WGS". Choose "PCR" if the CRISPR sequence(s) from PCR amplification is entered, and choose "WGS" when entering the whole genome assembly of a Salmonella isolate.

Note

If you use the "WGS" method, please make sure you have installed the BLAST software and included it within the working path.

Examples

  CSESA(system.file("extdata", "sequence_CRIPSR1.fasta", package = "CSESA"), 
  system.file("extdata", "sequence_CRIPSR2.fasta", package = "CSESA"), method = "PCR")
  CSESA(system.file("extdata", "sequence_CRIPSR1.fasta", package = "CSESA"), method = "PCR")
  CSESA(system.file("extdata", "Salmonella_whole_genome_assembly.fasta", 
  package = "CSESA"), method = "WGS")

Find the serotype based on the analysis of the new spacers.

Description

Find the serotype based on the analysis of the new spacers.

Usage

FindSerotype(csesa1 = NA, csesa2 = NA)

Arguments

csesa1

The new spacer of the first sequence.

csesa2

The new spacer of the second sequence.

Value

The data frame which represents the serotype.


Get the new spacers from the molecular sequence and its reverse complement.

Description

Get the new spacers from the molecular sequence and its reverse complement.

Usage

GetAllNewSpacers(molecular.seq = NULL)

Arguments

molecular.seq

The molecular sequence.

Value

The vector of the new spacers, which is extracted from the molecular sequence and its reverse complement.

Note

If there doesn't exist any new spacer, the function would return NA.


Get the new spacer from the molecular sequence.

Description

Get the new spacer from the molecular sequence.

Usage

GetNewSpacer(molecular.seq = NULL)

Arguments

molecular.seq

The molecular sequence.

Value

The new spacer sequence as a string.

Examples

GetNewSpacer("AGAGGCGGACCGAAAAACCGTTTTCAGCCAACGTAT")


Get the new spacer from the molecular sequence and map it to the code.

Description

Get the new spacer from the molecular sequence and map it to the code.

Usage

GetNewSpacerCode(molecular.seq = NULL)

Arguments

molecular.seq

The molecular sequence.

Value

The new spacer code as a string.


Return the reverse complement of the sequence.

Description

Return the reverse complement of the sequence.

Usage

GetReverseComplement(x)

Arguments

x

The input sequence.

Value

The reverse complement sequence as a string.


Get the information string from the CSESA s3 object.

Description

Get the information string from the CSESA s3 object.

Usage

GetStr(csesa)

Arguments

csesa

The S3 object CSESA.

Value

The string record the newly spacers and serotype information.


Get the CSESA obeject through the two sequence.

Description

Get the CSESA obeject through the two sequence.

Usage

PCR(seq1, seq2, out.file)

Arguments

seq1

The first DNA sequence.

seq2

The second DNA sequence.

out.file

Into which results will be saved if this value is set. Otherwise results will be displayed on the screen.


Read the three types of input file.

Description

Read the three types of input file.

Usage

ReadInFile(file.name)

Arguments

file.name

The input file name.

Value

The molecular sequence as a string.


Find the serotype based on the analysis of the new spacers.

Description

Find the serotype based on the analysis of the new spacers.

Usage

WGS(file)

Arguments

file

The input fasta file.

Value

The two DNA molecular sequence.