Paul Igor Costea
metaSNV

Repository

git clone git@git.embl.de:costea/metaSNV.git
sudo add-apt-repository "deb http://archive.ubuntu.com/ubuntu $(lsb_release -sc) universe"
sudo apt-get update
sudo apt-get install libhts-dev libboost-dev
conda create --name metaSNV boost htslib pkg-config numpy pandas
source activate metaSNV
export CFLAGS=-I$CONDA_ENV_PATH/include
export LD_LIBRARY_PATH=$CONDA_ENV_PATH/lib:$LD_LIBRARY_PATH
conda create --name metaSNV boost htslib pkg-config numpy pandas
source activate metaSNV
# Add this command:
conda install gcc
export CFLAGS=-I$CONDA_ENV_PATH/include
export LD_LIBRARY_PATH=$CONDA_ENV_PATH/lib:$LD_LIBRARY_PATH
make
./getRefDB.sh
metaSNV.py project_dir/ all_samples ref_db [options]
metaSNV_post.py project_dir [options]
./getRefDB.sh
$ cd EXAMPLE
$ ./getSamplesScript.sh
$ find `pwd`/EXAMPLE/samples -name “*.bam” > sample_list
$ python metaSNV.py tutorial sample_list db/freeze9.genomes.RepGenomesv9.fna --threads 8
$ python metaSNV_post.py tutorial

Voila! Your distances will be in the tutorial/distances folder. Enjoy!
$ python metaSNV.py tutorial sample_list db/freeze9.genomes.RepGenomesv9.fna --n_splits 8 --print-commands

Note the addition of the "--print-commnads". This will print out one-liners that you need to run. When done, run same again.
$ python metaSNV.py tutorial sample_list db/freeze9.genomes.RepGenomesv9.fna --n_splits 8 --print-commands

This will calculate the "load balancing" and give you the commands for running the SNV calling.
$ python metaSNV_post.py tutorial