Commit 40c2b294 authored by Bertjan Broeksema's avatar Bertjan Broeksema
Browse files

Reorganize website a bit.

* Move text from main page to installation page
* Add link to (outdated) video on main page
* Add proper content to about page
parent ca67a072
...@@ -4,12 +4,26 @@ title: About ...@@ -4,12 +4,26 @@ title: About
permalink: /about/ permalink: /about/
--- ---
This is the base Jekyll theme. You can find out more info about customizing your Jekyll theme, as well as basic Jekyll usage documentation at [jekyllrb.com](http://jekyllrb.com/) ICoVeR is the result of collaborative work of members from the e-Science research unit and the plants for biomass unit in the [environmental department (ERIN)](http://www.list.lu/en/erin/) of [Luxembourg Institute of Science and Technology (LIST)](http://www.list.lu/).
You can find the source code for the Jekyll new theme at: <div style="width: 230px; display:block; margin: 0 auto;">
{% include icon-github.html username="jglovier" %} / <a href="http://www.list.lu/">
[jekyll-new](https://github.com/jglovier/jekyll-new) <img src="{{ site.baseurl }}/img/list-logo.jpg" alt=" ">
</a>
</div>
You can find the source code for Jekyll at ### People
{% include icon-github.html username="jekyll" %} /
[jekyll](https://github.com/jekyll/jekyll) * [Bertjan Broeksema](https://www.linkedin.com/in/bertjanbroeksema)
* [Magdalena Calusinska](https://be.linkedin.com/in/magdalena-calusinska-21052642/en)
* [Fintan McGee](https://lu.linkedin.com/in/fintan-mcgee-3a60a22)
* [Klaas Winter](https://nl.linkedin.com/in/klaas-winter-27662252)
* [Mohammad Ghoniem](https://lu.linkedin.com/in/mohammadghoniem/en)
* [Philippe Delfosse](https://lu.linkedin.com/in/philippe-delfosse-23252a5)
### License
ICoVeR is released under GPLv3.
Copyright 2015 Luxembourg Institute of Science and technology <tto@list.lu>.
All rights reserved.
...@@ -3,133 +3,11 @@ layout: default ...@@ -3,133 +3,11 @@ layout: default
title: Overview title: Overview
--- ---
## Introduction ICoVeR is an interactive visualization tool for binning of metagenomics data.
It allows comparing results of multiple automated binning techniques and provides means to analyze bins in more detail.
Its interactive visualization features help to get deeper insight in the structure of bins and interaction between bins.
## Required Software
<iframe style="display:block; margin: 0 auto;"
This guide was tested with [R](http://www.r-project.org/) v3.2.0 and width="600" height="480" frameborder="0"
[RStudio](http://www.rstudio.com/) v0.98.1102. Newer versions should work, src="http://www.youtube.com/embed/59ZDtmP8ZjA?vq=hd720"
please report any problems at [TODO](http://no.link.yet). Further more, a modern allowfullscreen></iframe>
browser is required. We have had most convenient experience with Chrome, but
recent versions of Firefox worked as well.
To run our interactive interface, the OpenCPU R-package must be installed. To
this end run the following command in R or RStudio.
{% highlight r %}
# Bioconductor is required for the preprocessing of the fasta files
> source("http://bioconductor.org/biocLite.R")
> biocLite("Biostrings")
# OpenCPU is required to run the interactive binning tool
> install.packages("opencpu")
# devtools is required to install the interactive binning tool
> install.packages(devtools)
{% endhighlight %}
To get started with the guide you will have to download our
[prepared package](link.to.package) or clone the [github repository](github.com).
In the remainder of this guide we will assume that the package is extracted in
your home directory: `~/icobire-guide/`.
## Data
We provide the two data sets that have been analyzed using our interactive contig
binning system. The first data set is the one generated by
[Wrighton et al](http://www.sciencemag.org/content/337/6102/1661). The second
data set, is the CSTR data set, which is generated by us from an anaerobic
digester.
## Assembly
**TODO**: Currently the data contains a fasta file. I assume this file is
generated somehow from a fastq(?) file.
## Abundance extraction
**TODO**: Currently the data contains a csv fil with abundance levels. We'll
need to describe how we generated them.
## Preprocessing
In order to load the data into our interactive contig binning system, we need to
pre-process the fasta and abundance level files. This is done by using the
scripts provided in the R.preprocessing directory. In this example we will
prepare the cstr data set for interactive contig binning.
The `PrepareDataForInteractiveBinning` function performs the following steps:
1. Extract gc_content and contig length for each contig from the fasta file
1. Extract tetra nucleotide frequencies for each contig from the fasta file
1. Combine extracted information with sample abundance levels into the files
which are expected by the interactive contig binning tool.
**NOTE:** The ids in the fasta file must match the names in the abundance file.
Thus, if a contig in the fasta file starts with a line `>contig_123`, there must
be a line in the abundance file as where the name column has value `contig_123`.
Some error checking is done, but you should not rely on this and make sure that
the fasta file and the abundance file are prepared properly.
Start R studio and type the following commands.
{% highlight r %}
# Set the working directory
> setwd("~/icobire-guide")
# Source the required R file for preprocessing. This will load all the required
# dependencies as well.
> source("R.preprocessing/preprocessing.R")
> PrepareDataForInteractiveBinning(
dataset.name = "cstr",
file.fasta = "data//cstr_assembled.fasta",
file.abundance = "data//cstr_avg_coverage.csv",
dir.result = "data//prepared"
)
# Check if you find cstr.rda and cstr.schema.rda in: R.ICoBiRe/data. If so, we
# continue installing the interactive binning application.
> library(devtools)
# NOTE: Before installing you should check the file R.ICoBiRe/R/sqlite.R. At the
# top there is a variable declared named: p.db.dataset. The value for this
# variable must match the data set you want to analyze (i.e. "cstr" in
# this case, which is the default).
#
# If you want to bin a different contig set, you must change this value
# **before** installing the R.ICoBiRe package.
> install_local(file.path(getwd(), "R.ICoBiRe"))
{% endhighlight %}
## Interactive binning
At this point everything is set up to start the interactive binning process.
In order to start the application we need to start OpenCPU:
{% highlight r %}
> library(opencpu)
Initiating OpenCPU server...
Using config: ~/.opencpu.conf
OpenCPU started.
[httpuv] http://localhost:2110/ocpu
OpenCPU single-user server ready.
{% endhighlight %}
If OpenCPU started without errors, the interactive application can now be accessed
in your browser at the following url:
http://localhost:2110/ocpu/library/ICoBiRe/www/
**NOTE 1:** The port number (i.e. 2110) must match with the output of OpenCPU in your RStudio session.
This can differ everytime you start the application.
**NOTE 2:** The application stores the most important data resulting from your analysis (i.e. clustering results and tags).
An analysis can therefore be splitted into several sessions.
The initial preparations are not required after the first (assuming binning of the same data set is continued).
To continue a previously stopped session, just start R(Studio), load the OpenCPU library as shown above, and point your browser to the correct url.
## Re-assembly and verification
---
layout: page
title: Installation
permalink: /installation/
---
Table of Contents
=================
* [Prerequisites](#prerequisites )
* [Data preparation](#data-preparation )
## Prerequisites
[ICoVeR]({{ site.baseurl }}/) is designed and implemented as a regular [R](http://www.r-project.org/) package.
This guide was tested with [R](http://www.r-project.org/) v3.2.0 and [RStudio](http://www.rstudio.com/) v0.98.1102.
Newer versions should work, please report any problems at [our github page](https://github.com/bbroeksema/ICoVeR/issues).
Further more, a modern browser is required.
We have had most convenient experience with Chrome, but recent versions of Firefox worked as well.
To run our interactive interface, the OpenCPU R-package must be installed.
To this end run the following command in R or RStudio.
{% highlight r %}
# Bioconductor is required for the preprocessing of the fasta files
> source("http://bioconductor.org/biocLite.R")
> biocLite("Biostrings")
# OpenCPU is required to run the interactive binning tool
> install.packages("opencpu")
# devtools is required to install the interactive binning tool
> install.packages(devtools)
{% endhighlight %}
To get started with the guide you will have to download our [prepared package](link.to.package) or clone the [github repository](github.com).
In the remainder of this guide we will assume that the package is extracted in
your home directory: `~/icobire-guide/`.
## Data preparation
In order to get you started quickly with the tool, we [provide]() to prepared datasets.
The first data set is the one generated by [Wrighton et al](http://www.sciencemag.org/content/337/6102/1661).
The second data set, is the CSTR data set, which is generated by us from an anaerobic digester.
## Preprocessing
In order to load the data into our interactive contig binning system, we need to
pre-process the fasta and abundance level files. This is done by using the
scripts provided in the R.preprocessing directory. In this example we will
prepare the cstr data set for interactive contig binning.
The `PrepareDataForInteractiveBinning` function performs the following steps:
1. Extract gc_content and contig length for each contig from the fasta file
1. Extract tetra nucleotide frequencies for each contig from the fasta file
1. Combine extracted information with sample abundance levels into the files
which are expected by the interactive contig binning tool.
**NOTE:** The ids in the fasta file must match the names in the abundance file.
Thus, if a contig in the fasta file starts with a line `>contig_123`, there must
be a line in the abundance file as where the name column has value `contig_123`.
Some error checking is done, but you should not rely on this and make sure that
the fasta file and the abundance file are prepared properly.
Start R studio and type the following commands.
{% highlight r %}
# Set the working directory (change path to your local checkout location)
> setwd("~/ICoVeR")
# Source the required R file for preprocessing. This will load all the required
# dependencies as well.
> source("R.preprocessing/preprocessing.R")
> PrepareDataForInteractiveBinning(
dataset.name = "cstr",
file.fasta = "data//cstr_assembled.fasta",
file.abundance = "data//cstr_avg_coverage.csv",
dir.result = "data//prepared"
)
## Installation
# Check if you find cstr.rda and cstr.schema.rda in: R.ICoVeR/data. If so, we
# continue installing the interactive binning application.
> library(devtools)
# NOTE: Before installing you should check the file R.ICoBiRe/R/sqlite.R. At the
# top there is a variable declared named: p.db.dataset. The value for this
# variable must match the data set you want to analyze (i.e. "cstr" in
# this case, which is the default).
#
# If you want to bin a different contig set, you must change this value
# **before** installing the R.ICoBiRe package.
> install_local(file.path(getwd(), "R.ICoVeR"))
{% endhighlight %}
## Interactive binning
At this point everything is set up to start the interactive binning process.
In order to start the application we need to start OpenCPU:
{% highlight r %}
> library(opencpu)
Initiating OpenCPU server...
Using config: ~/.opencpu.conf
OpenCPU started.
[httpuv] http://localhost:2110/ocpu
OpenCPU single-user server ready.
{% endhighlight %}
If OpenCPU started without errors, the interactive application can now be accessed
in your browser at the following url:
http://localhost:2110/ocpu/library/ICoVeR/www/
**NOTE 1:** The port number (i.e. 2110) must match with the output of OpenCPU in your RStudio session.
This can differ everytime you start the application.
**NOTE 2:** The application stores the most important data resulting from your analysis (i.e. clustering results and tags).
An analysis can therefore be splitted into several sessions.
The initial preparations are not required after the first (assuming binning of the same data set is continued).
To continue a previously stopped session, just start R(Studio), load the OpenCPU library as shown above, and point your browser to the correct url.
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment