summaryrefslogtreecommitdiff
path: root/biology/racon
diff options
context:
space:
mode:
authorbrook <brook@pkgsrc.org>2021-05-26 18:53:39 +0000
committerbrook <brook@pkgsrc.org>2021-05-26 18:53:39 +0000
commit315d1a09752fb70777fcfa3b71ab0d4274f93179 (patch)
tree5c4d4d0a5e02976039db4cf867793f8d1851f508 /biology/racon
parentf9e63d7f42343c9b0be99e2b0eabe2577990ed9d (diff)
downloadpkgsrc-315d1a09752fb70777fcfa3b71ab0d4274f93179.tar.gz
biology/racon: add racon 1.4.3
## Description Racon is intended as a standalone consensus module to correct raw contigs generated by rapid assembly methods which do not include a consensus step. The goal of Racon is to generate genomic consensus which is of similar or better quality compared to the output generated by assembly methods which employ both error correction and consensus steps, while providing a speedup of several times compared to those methods. It supports data produced by both Pacific Biosciences and Oxford Nanopore Technologies. Racon can be used as a polishing tool after the assembly with **either Illumina data or data produced by third generation of sequencing**. The type of data inputed is automatically detected. Racon takes as input only three files: contigs in FASTA/FASTQ format, reads in FASTA/FASTQ format and overlaps/alignments between the reads and the contigs in MHAP/PAF/SAM format. Output is a set of polished contigs in FASTA format printed to stdout. All input files **can be compressed with gzip** (which will have impact on parsing time). Racon can also be used as a read error-correction tool. In this scenario, the MHAP/PAF/SAM file needs to contain pairwise overlaps between reads **including dual overlaps**. A **wrapper script** is also available to enable easier usage to the end-user for large datasets. It has the same interface as racon but adds two additional features from the outside. Sequences can be **subsampled** to decrease the total execution time (accuracy might be lower) while target sequences can be **split** into smaller chunks and run sequentially to decrease memory consumption. Both features can be run at the same time as well.
Diffstat (limited to 'biology/racon')
-rw-r--r--biology/racon/DESCR12
-rw-r--r--biology/racon/Makefile32
-rw-r--r--biology/racon/PLIST3
-rw-r--r--biology/racon/distinfo6
4 files changed, 53 insertions, 0 deletions
diff --git a/biology/racon/DESCR b/biology/racon/DESCR
new file mode 100644
index 00000000000..2e436862b90
--- /dev/null
+++ b/biology/racon/DESCR
@@ -0,0 +1,12 @@
+Racon is intended as a standalone consensus module to correct raw
+contigs generated by rapid assembly methods which do not include a
+consensus step. The goal of Racon is to generate genomic consensus
+which is of similar or better quality compared to the output generated
+by assembly methods which employ both error correction and consensus
+steps, while providing a speedup of several times compared to those
+methods. It supports data produced by both Pacific Biosciences and
+Oxford Nanopore Technologies.
+
+Racon can be used as a polishing tool after the assembly with either
+Illumina data or data produced by third generation of sequencing. The
+type of data inputed is automatically detected.
diff --git a/biology/racon/Makefile b/biology/racon/Makefile
new file mode 100644
index 00000000000..f7a7be58d25
--- /dev/null
+++ b/biology/racon/Makefile
@@ -0,0 +1,32 @@
+# $NetBSD: Makefile,v 1.1 2021/05/26 18:53:39 brook Exp $
+
+GITHUB_PROJECT= racon
+DISTNAME= racon-v1.4.3
+PKGNAME= ${DISTNAME:S,-v,-,}
+CATEGORIES= biology
+MASTER_SITES= ${MASTER_SITE_GITHUB:=isovic/}
+GITHUB_RELEASE= 1.4.3
+
+MAINTAINER= pkgsrc-users@NetBSD.org
+HOMEPAGE= https://github.com/isovic/racon/
+COMMENT= Genomic consensus builder
+LICENSE= mit
+
+USE_CMAKE= yes
+USE_TOOLS+= pkg-config
+USE_LANGUAGES= c c++
+
+PKGCONFIG_OVERRIDE+= vendor/bioparser/vendor/zlib/contrib/minizip/minizip.pc.in
+PKGCONFIG_OVERRIDE+= vendor/bioparser/vendor/zlib/zlib.pc.in
+PKGCONFIG_OVERRIDE+= vendor/googletest/googlemock/cmake/gmock.pc.in
+PKGCONFIG_OVERRIDE+= vendor/googletest/googlemock/cmake/gmock_main.pc.in
+PKGCONFIG_OVERRIDE+= vendor/googletest/googletest/cmake/gtest.pc.in
+PKGCONFIG_OVERRIDE+= vendor/googletest/googletest/cmake/gtest_main.pc.in
+PKGCONFIG_OVERRIDE+= vendor/spoa/spoa.pc.in
+
+INSTALLATION_DIRS+= share/doc/racon
+
+post-install:
+ ${INSTALL_DATA} ${WRKSRC}/README.md ${DESTDIR}${PREFIX}/share/doc/racon
+
+.include "../../mk/bsd.pkg.mk"
diff --git a/biology/racon/PLIST b/biology/racon/PLIST
new file mode 100644
index 00000000000..2341fde0401
--- /dev/null
+++ b/biology/racon/PLIST
@@ -0,0 +1,3 @@
+@comment $NetBSD: PLIST,v 1.1 2021/05/26 18:53:39 brook Exp $
+bin/racon
+share/doc/racon/README.md
diff --git a/biology/racon/distinfo b/biology/racon/distinfo
new file mode 100644
index 00000000000..1c49e39e0c1
--- /dev/null
+++ b/biology/racon/distinfo
@@ -0,0 +1,6 @@
+$NetBSD: distinfo,v 1.1 2021/05/26 18:53:39 brook Exp $
+
+SHA1 (racon-v1.4.3.tar.gz) = 472afb308a07f83689f32777f044ba1cc5cd812b
+RMD160 (racon-v1.4.3.tar.gz) = 40528de630d4063582e528b56ca0fdf815a002fe
+SHA512 (racon-v1.4.3.tar.gz) = 4ac1b62b194f237e3436d8384e6fadbf86ba3ea1ceef64000aca15d04a67ba38b189aad19966172c5ef3124a3b284488ec28f000f67b5b374402d64f208b0395
+Size (racon-v1.4.3.tar.gz) = 51468817 bytes