Alignment-free simulation, mapping, and visualization of low-complexity regions in FASTA data.
This method provides the ability of fast sequence characterization through low-complexity regions, ideally for scenarios entangling the presence of new or unknown sequences. AlcoR offers important analysis advantages, such as the high-sensitivity, speed, and does not provide false positives, ideally to be applied in the current Telomere-to-telomere (T2T) sequencing and assembly methodologies. AlcoR is implemented in C language using multi-threading to increase the computational speed, is flexible for multiple applications, and does not contain external dependencies. The tool accepts any sequence in FASTA format.
The AlcoR tool contains one main menu (command: AlcoR) with the four sub menus for computing the features that it provides, namely
- extract: extracts a sequence of a FASTA file using positional coordinates (independent from the existing headers of the FASTA files);
- mapper: computes the low-complexity regions of a FASTA read while providing bidirectional complexity profiles and further structural similarity analysis;
- simulation: FASTA sequence simulation with features: file extraction, random generation, sequence modeling. Additionally, it allwos to apply specific SNPs probability mutations;
- visual: computes an SVG file with the respective map containing the low-complexity regions.
Install Miniconda, then run the following:
conda install -y -c bioconda alcor
Otherwise, CMake is needed for installation (http://www.cmake.org/). You can download it directly from http://www.cmake.org/cmake/resources/software.html or use an appropriate packet manager. In the following instructions we show the procedure to install, compile and run AlcoR:
sudo apt-get install cmake git git clone https://github.com/cobilab/alcor.git cd alcor/src/ cmake . make
Run AlcoR FASTA simulation tool:
AlcoR simulation -rs 2000:0:1:0:0:0 -rs 2000:0:11:0:0:0 -rs 2000:0:1:0:0:0 -rs 2000:0:71:0:0:0 > sample.fasta;
Run AlcoR low-complexity mapper:
AlcoR mapper -v -n -m 13:50:0:1:10:0.9/5:10:0.9 --dna -w 3 -t 0.5 sample.fasta
To see the possible options type
AlcoR
or
AlcoR -h
If you are not interested in viewing each sub-program option, type
AlcoR extract -h AlcoR mapper -h AlcoR simulation -h AlcoR visual -h
On using this software/method please cite:
- pending
For any issue let us know at issues link.
GPL v3.
For more information:
http://www.gnu.org/licenses/gpl-3.0.html