aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
-rw-r--r--doc/cli.markdown61
-rw-r--r--doc/index.markdown9
-rw-r--r--doc/install.markdown35
-rw-r--r--doc/python.markdown22
4 files changed, 77 insertions, 50 deletions
diff --git a/doc/cli.markdown b/doc/cli.markdown
index 843bae9..214c3b7 100644
--- a/doc/cli.markdown
+++ b/doc/cli.markdown
@@ -1,39 +1,36 @@
# Quikr Command Line Utilities #
-
Quikr has three command-line utilities that mirror the behavior of the python
module and the matlab implementation. The advantage of this is ease of scripting
-and job management. These utilities are written in python and wrap the quikr
-module.
+and job management, as well as faster processing and lower memory usage. These
+utilities are written in C and utilize OpenMP for multithreading.
## Quikr\_train ##
-
The quikr\_train is a tool to train a database for use with the quikr tool.
-Before running the quikr utility, you need to generate the sensing matrix or
+Before running the quikr utility, you need to generate the sensing matqrix or
download a pretrained matrix from our database\_download.html.
### Usage ###
-quikr\_train returns a custom trained matrix that can be used with the quikr
-function. You must supply a kmer.
+quikr\_train returns a custom sensing matrix that can be used with the quikr
+function.
quikr\_train's arguments:
-i, --input, the database of sequences (fasta format)
- -o, --output, the trained matrix (text file)
- -k, --kmer, the kmer size, the default is 6 (integer)
- -z, --compress compress the output matrix with gzip (flag)
+ -o, --output, the sensing matrix (text file)
+ -k, --kmer, specifiy wha size of kmer to use. (default value is 6)
+ -v, --verbose, verbose mode.
### Example ###
Here is an example on how to train a database. This uses the -z flag to compress
the output matrix since it can be very large. Because of the sparse nature of
the database, the matrix easily achieves a high compression ratio, even with
-gzip. It takes the gg94\_database.fasta as an input and outputs the trained
-matrix as gg94\_trained\_databse.npy.gz
+gzip. It takes the gg94\_database.fasta as an input and outputs the sensing
+matrix as gg94\_sensing\_databse.npy.gz
- quikr_train -i gg94_database.fasta -o gg94_trained_database.npy.gz -k 6 -z
+ quikr_train -i gg94_database.fasta -o gg94_sensing_database.matrix.gz -k 6
## Quikr ##
Quikr returns the estimated frequencies of batcteria present when given a
-input FASTA file. A default trained matrix will be used if none is supplied
-You must supply a kmer and default lambda if using a custom trained matrix.
+input FASTA file. You need to train a matrix or download a new matrix
### Usage ###
quikr returns the solution vector as a csv file.
@@ -42,8 +39,8 @@ quikr's arguments:
-f, --fasta, the sample's fasta file of NGS READS
-o, --output OTU\_FRACTION\_PRESENT, a vector representing the percentage of
database sequence's presence in sample (csv output)
- -t, --trained-matrix, the trained matrix
- -l, --lamb, the lambda size. (the default lambda value is 10,000)
+ -s, --sensing-matrix the sensing matrix. (generated by quikr\_train)
+ -l, --lambda, the lambda size. (the default lambda value is 10,000)
-k, --kmer, this specifies the size of the kmer to use (default is 6)
## Multifasta\_to\_otu ##
@@ -66,14 +63,14 @@ with aspecified number of jobs. Otherwise python with run one job per cpu core.
### Usage ###
multifasta\_to\_otu's arguments:
- -i, --input-directory, the directory containing the samples' fasta files of
+ -i, --input, the directory containing the samples' fasta files of
reads (note each fasta file should correspond to a separate sample)
-o, --otu-table, the OTU table, with OTU\_FRACTION\_PRESENT for each sample,
which is compatible with QIIME's convert\_biom.py (or sequence table if not
OTU's)
- -t, --trained-matrix, the trained matrix
- -f, --trained-fasta, the fasta file database of sequences
- -l, --lamb, specify what size of lambda to use (the default value is 10,000)
+ -s, --sensing-matrix, the sensing matrix
+ -f, --sensing-fasta, the fasta file database of sequences
+ -l, --lambda, specify what size of lambda to use (the default value is 10,000)
-k, --kmer, specify what size of kmer to use, (default value is 6)
-j, --jobs, specifies how many jobs to run at once, (default=number of CPUs)
@@ -98,12 +95,6 @@ The QIIME procedue:
principal_coordinates.py -i beta_div/weighted_unifrac_<quikr_otu>.txt -o <quikr_otu_project_name>_weighted.txt
make_3d_plots.py -i <quikr_otu_project_name>_weighted.txt -o <3d_pcoa_plotdirectory> -m <qiime_metadata_file>
-
-# Python Quikr Troubleshooting #
-
-If you are having trouble, and these solutions don't work. Please contact the
-developers with questions and issues.
-
#### Broken Pipe Errors ####
Make sure that you have the count-kmers and probablilties-by-read in your
$PATH, and that they are executable.
@@ -111,19 +102,5 @@ $PATH, and that they are executable.
If you have not installed quikr system-wide, you'll need to add the folder
location of these binaries in the terminal before running the command:
+ mv /path/to/quikr/src/nbc/count /path/to/quikr/src/nbc/count-kmers
PATH = $PATH:/path/to/quikr/src/nbc/
-
-Make sure that the binaries are executable by running:
-
- chmod +x probabilities-by-read
- chmod +x count-kmers
-
-#### Python Cannot Find XYZ ####
-
-Ensure that you have Python 2.7, Scipy, Numpy, and BIOpython installed
-and that python is setup correctly. You should be able to do this from a python
-prompt without any errors:
- >>> import numpy
- >>> import scipy
- >>> from Bio import SeqIO
-
diff --git a/doc/index.markdown b/doc/index.markdown
index 93688bc..03672cd 100644
--- a/doc/index.markdown
+++ b/doc/index.markdown
@@ -13,17 +13,16 @@ accurate down to the genus level.
## How Do I Install Quikr ##
Please read the directions on the [installation page](install.html).
-
## How Do I use Quikr ##
-We have several ways to use quikr. There is a python module, command line
-scripts, and matlab scripts.
+We have several ways to use quikr. Quikr is first and formost a command
+line utility, but we also provide python and matlab scripts.
++ [Command Line Utilities](cli.html)
+ [Matlab documentation](matlab.html)
+ [Python documentation](python.html)
-+ [Command Line Utilities](cli.html)
## Contact ##
-For issues with the python implementation, contact gailro@gmail.com
+For issues with the quikr software, contact gailro@gmail.com
## Contributors ##
+ David Koslicki
diff --git a/doc/install.markdown b/doc/install.markdown
index cbbbe21..82004aa 100644
--- a/doc/install.markdown
+++ b/doc/install.markdown
@@ -1,14 +1,32 @@
# How Can I Install The Quikr Utility? #
To use Quikr there are several prerequisites.
-Base Requirements:
-+ Mac OS X or GNU/Linux or Unix-based operating system+
-+ Python 2.7, Scipy, Numpy, BIOPython modules
+## Requirements ##
++ Mac OS X 10.6.8 or GNU/Linux
+ 4Gb of RAM minimum. Absolutely neccessary.
++ gcc that supports OpenMP
+
+### Python Requirements ###
++ Python 2.7
++ Scipy
++ Numpy
++ BioPython
+
+### Mac Requirements ###
++ Mac OS X 10.6.8 (what we have tested)
++ GCC 4.7 or newer. (gcc 4.2 did not work, and is the default installation)
++ OCaml compiler mlton
++ OpenMP libraries (libgomp, usually comes with gcc)
+
+### Linux Requirements ###
++ GCC 4.7 or newer
++ OCaml compiler mlton
++ OpenMP libraries (libgomp, usually comes with gcc)
We also have a Quikr implementation in Matlab so that you can easily integrate
Quikr into your custom programs and scripts.
+### Installation ###
Our Quikr code is available on our sourceforge download page:
[http://sourceforge.net/projects/quikr/](sourceforge project page)
@@ -16,3 +34,14 @@ Our Quikr code is available on our sourceforge download page:
Our development GIT repository is available here:
[http://rosalind.ece.drexel.edu/git/quikr/](rosalind.ece.drexel.edu/git/quikr/)
+
+To install quikr, download our project and in the folder run:
+
+ make
+ sudo make install
+
+This will install the quikr, quikr\_train and multifasta\_to\_otu utilities.
+To install the python scripts and module systemwide, run
+
+ make python
+ sudo make install_python
diff --git a/doc/python.markdown b/doc/python.markdown
new file mode 100644
index 0000000..df086d8
--- /dev/null
+++ b/doc/python.markdown
@@ -0,0 +1,22 @@
+# Python Documentation #
+The python version comes with scripts that can be used like the regular quikr
+program, and also a module called quikr so integration with python scripts
+is easier.
+
+If you are switching to use the python scripts instead of the regular, you
+will need to regenerate your trained databases with the python version of
+quikr\_train.
+
+## Function documentation ##
+If you want to use our quikr module, run help on the module:
+
+ >>> import quikr
+ >>> help(quikr)
+## Python Cannot Find XYZ ##
+
+Ensure that you have Python 2.7, Scipy, Numpy, and BIOpython installed
+and that python is setup correctly. You should be able to do this from a python
+prompt without any errors:
+ >>> import numpy
+ >>> import scipy
+ >>> from Bio import SeqIO