summaryrefslogtreecommitdiff
path: root/biology
AgeCommit message (Collapse)AuthorFilesLines
2018-11-06samtools: updated to 1.9adam2-18/+19
Release 1.9: * Samtools mpileup VCF and BCF output is now deprecated. It is still functional, but will warn. Please use bcftools mpileup instead. * Samtools mpileup now handles the '-d' max_depth option differently. There is no longer an enforced minimum, and '-d 0' is interpreted as limitless (no maximum - warning this may be slow). The default per-file depth is now 8000, which matches the value mpileup used to use when processing a single sample. To get the previous default behaviour use the higher of 8000 divided by the number of samples across all input files, or 250. * Samtools stats new features: - The '--remove-overlaps' option discounts overlapping portions of templates when computing coverage and mapped base counting. - When a target file is in use, the number of bases inside the target is printed and the percentage of target bases with coverage above a given threshold specified by the '--cov-threshold' option. - Split base composition and length statistics by first and last reads. * Samtools faidx new features: - Now takes long options. - Now warns about zero-length and truncated sequences due to the requested range being beyond the end of the sequence. - Gets a new option (--continue) that allows it to carry on when a requested sequence was not in the index. - It is now possible to supply the list of regions to output in a text file using the new '--region-file' option. - New '-i' option to make faidx return the reverse complement of the regions requested. - faidx now works on FASTQ (returning FASTA) and added a new fqidx command to index and return FASTQ. * Samtools collate now has a fast option '-f' that only operates on primary pairs, dropping secondary and supplementary. It tries to write pairs to the final output file as soon as both reads have been found. * Samtools bedcov gets a new '-j' option to make it ignore deletions (D) and reference skips (N) when computing coverage. * Small speed up to samtools coordinate sort, by converting it to use radix sort. * Samtools idxstats now works on SAM and CRAM files, however this isn't fast due to some information lacking from indices. * Compression levels may now be specified with the level=N output-fmt-option. E.g. with -O bam,level=3. * Various documentation improvements. * Bug-fixes: - Improved error reporting in several places. - Various test improvements. - Fixed failures in the multi-region iterator (view -M) when regions provided via BED files include overlaps - Samtools stats now counts '=' and 'X' CIGAR operators when counting mapped bases. - Samtools stats has fixes for insert size filtering (-m, -i). - Samtools stats -F now longer negates an earlier -d option. - Fix samtools stats crash when using a target region. - Samtools sort now keeps to a single thread when the -@ option is absent. Previously it would spawn a writer thread, which could cause the CPU usage to go slightly over 100%. - Fixed samtools phase '-A' option which was incorrectly defined to take a parameter. - Fixed compilation problems when using C_INCLUDE_PATH. - Fixed --version when built from a Git repository. - Use noenhanced mode for title in plot-bamstats. Prevents unwanted interpretation of characters like underscore in gnuplot version 5. - blast2sam.pl now reports perfect match hits (no indels or mismatches). - Fixed bug in fasta and fastq subcommands where stdout would not be flushed correctly if the -0 option was used. - Fixed invalid memory access in mpileup and depth on alignment records where the sequence is absent.
2018-11-06htslib: updated to 1.9adam2-22/+17
1.9: If ./configure fails, make will stop working until either configure is re-run successfully, or make distclean is used. This makes configuration failures more obvious. The default SAM version has been changed to 1.6. This is in line with the latest version specification and indicates that HTSlib supports the CG tag used to store long CIGAR data in BAM format. bgzip integrity check option '--test' Faidx can now index fastq files as well as fasta. The fastq index adds an extra column to the .fai index which gives the offset to the quality values. New interfaces have been added to htslib/faidx.h to read the fastq index and retrieve the quality values. It is possible to open a fastq index as if fasta (only sequences will be returned), but not the other way round. New API interfaces to add or update integer, float and array aux tags. Add level=<number> option to hts_set_opt() to allow the compression level to be set. Setting level=0 enables uncompressed output. Improved bgzip error reporting. Better error reporting when CRAM reference files can't be opened. Fixes to make tests work properly on Windows/MinGW - mainly to handle line ending differences. Efficiency improvements: Small speed-up for CRAM indexing. Reduce the number of unnecessary wake-ups in the thread pool. Avoid some memory copies when writing data, notably for uncompressed BGZF output. Bug fixes: Fix multi-region iterator bugs on CRAM files. Fixed multi-region iterator bug that caused some reads to be skipped incorrectly when reading BAM files. Fixed synced_bcf_reader() bug when reading contigs multiple times. Fixed bug where bcf_hdr_set_samples() did not update the sample dictionary when removing samples. Fixed bug where the VCF record ref length was calculated incorrectly if an INFO END tag was present. (71b00a) Fixed warnings found when compiling with gcc 8.1.0. sam_hdr_read() and sam_hdr_write() will now return an error code if passed a NULL file pointer, instead of crashing. Fixed possible negative array look-up in sam_parse1() that somehow escaped previous fuzz testing. Fixed bug where cram range queries could incorrectly report an error when using multiple threads. Fixed very rare rANS normalisation bug that could cause an assertion failure when writing CRAM files.
2018-10-31biology: +py-pydicomleot1-1/+2
2018-10-31py-pydicom: Import py-pydicom-1.2.0 as biology/py-pydicomleot4-0/+465
Pydicom is a pure Python package for working with DICOM files such as medical images, reports, and radiotherapy objects. Pydicom makes it easy to read these complex files into natural pythonic structures for easy manipulation. Modified datasets can be written again to DICOM format files. Packaged by Eric A. Borisch via NetBSD/pkgsrc#37, thank you Eric!
2018-09-29xylem: build fixtnn2-1/+16
2018-08-22Recursive bump for perl5-5.28.0wiz11-18/+22
2018-08-04biology/plinkseq: Requires c++11 to build with protobuf-3.6.0minskim1-3/+3
2018-07-25Add trimmomaticbacon1-1/+2
2018-07-25biology/trimmomatic: import Trimmomatic-0.38bacon4-0/+47
Trimmomatic performs a variety of useful trimming tasks for illumina paired-end and single ended data. The selection of trimming steps and their associated parameters are supplied on the command line. It works with FASTQ (using phred + 33 or phred + 64 quality scores, depending on the Illumina pipeline used), either uncompressed or gzipp'ed FASTQ.
2018-07-20Recursive revbump from textproc/icu-62.1ryoon1-2/+2
2018-07-18Mark packages that require C++03 (or the GNU variants) if they fail withjoerg2-4/+4
C++14 default language.
2018-07-04*: Move SUBST_STAGE from post-patch to pre-configurejperkin3-7/+7
Performing substitutions during post-patch breaks tools such as mkpatches, making it very difficult to regenerate correct patches after making changes, and often leading to substituted string replacements being committed.
2018-07-03extend PYTHON_VERSIONS_ for Python 3.7adam1-2/+2
2018-05-22biology/ncbi-blast+: Respect env to support PKGSRC_USE_RELRObacon3-8/+9
Fix a previous patch that hard-coded relro support by patching in pkgsrc CFLAGS, CXXFLAGS, and LDFLAGS instead. OK wiz@
2018-05-07Add samtoolsbacon1-1/+2
2018-05-07biology/samtools: import samtools-1.8bacon4-0/+58
Samtools implements various utilities for post-processing alignments in the SAM, BAM, and CRAM formats, including indexing, variant calling (in conjunction with bcftools), and a simple alignment viewer. OK wiz@
2018-05-01biology/htslib: Fix category in bl3bacon1-2/+2
2018-04-30Add htslibbacon1-1/+2
2018-04-30biology/htslib: import htslib-1.8bacon5-0/+94
HTSlib is an implementation of a unified C library for accessing common file formats, such as SAM, CRAM, VCF, and BCF, used for high-throughput sequencing data. It is the core library used by samtools and bcftools.
2018-04-29ncbi-blast+: removed references to wipadam1-2/+2
2018-04-27Add ncbi-blast+bacon1-1/+2
2018-04-27biology/ncbi-blast+: import ncbi-blast+-2.7.1bacon11-0/+4348
The Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families. OK wiz@
2018-04-16Recursive bump for new fribidi dependency in pango.wiz4-8/+8
2018-03-12Recursive bumps for fontconfig and libzip dependency changes.wiz4-8/+8
2018-02-11plinkseq: use httpswiz1-3/+3
2018-01-17Belated PKGREVISION bump for devel/protobuf update.jperkin1-2/+2
Fixes at least joyent/pkgsrc#60.
2018-01-07Fix indentation in buildlink3.mk files.rillig1-2/+2
The actual fix as been done by "pkglint -F */*/buildlink3.mk", and was reviewed manually. There are some .include lines that still are indented with zero spaces although the surrounding .if is indented. This is existing practice.
2018-01-01Sort PLIST files.rillig3-6/+6
Unsorted entries in PLIST files have generated a pkglint warning for at least 12 years. Somewhat more recently, pkglint has learned to sort PLIST files automatically. Since pkglint 5.4.23, the sorting is only done in obvious, simple cases. These have been applied by running: pkglint -Cnone,PLIST -Wnone,plist-sort -r -F
2018-01-01Replaced $(ROUND) with ${CURLY} variable references.rillig1-1/+1
This has been a pkglint warning for several years now, and pkglint can even fix it automatically. And it did for this commit. Only in lang/mercury, two passes of autofixing were necessary because there were nested variables.
2018-01-01Cleanup: replace curly braces with parentheses.rillig3-15/+15
2017-12-27Re-add patch to bring timeval definition and select prototype into scope.he3-2/+19
Bump PKGREVISION.
2017-12-24chemical-mime-data: comment out dead siteswiz1-4/+3
2017-12-17biology/bwa: Update to 0.7.17bacon4-34/+39
Numerous bug fixes, enhancements, and new command-line options ok wiz@
2017-10-03transfig: remove, replaced by print/fig2devwiz1-2/+2
2017-09-17p5-Bio-ASN1-EntrezGene: update to 1.72.wiz2-8/+10
1.72 2016-09-02 06:50:53-05:00 America/Chicago * Full release (no changes from 1.71 beyond version) 1.71 2016-09-01 22:57:25-05:00 America/Chicago (TRIAL RELEASE) * Minor bump for impending BioPerl 1.7 release. * #2 : 'Unescaped left brace in regex is deprecated' with newer versions of perl fixed [fjossandon]
2017-09-17bioperl: update to 1.007002.wiz2-7/+9
Add p5-Text-Diff as test dependency. 1.7.2 - "Entebbe" [Bugs] * #247 - Omit unnecessary parent_id attribute added by GFF3Loader [nathanweeks] * #245 - Code coverage fixes [zmughal,cjfields] * #237 - Fix warning in Bio::DB::IndexedBase [willmclaren,bosborne] * #238 - Use a Travis cron job for network tests [zmughal,cjfields] * #218 - Bio::DB::Flat::BinarySearch should use _fh() instead of fh() as fh() does not take arguments in [thibauthourlier,bosborne] * #227 - Bio::SeqIO Ignores first line of sequence [VAR121,bosborne] * #223 - Use Travis Perl helper script and enable coverage [zmughal,cjfields] * #222 - Fix test RemoteDB/Taxonomy.t: requires networking [zmughal,cjfields] * #216 - Apply carsonhh's patch (Inline::C fixes) [carsonh,bosborne] * #213 - Support FTS5 in Bio::DB::SeqFeature::Store::DBI::SQLite [nathanweeks,bosborne] * #210 - Sorting qualifiers while write embl files [hdevillers,cjfields] * #209 - Fixed bug in _toDsspKey() [jvolkening,hlapp] [Code changes] * PAML-related code from bioperl and bioperl-run are now in a separate distribution on CPAN [carandraug]
2017-09-04Comment out dead sites.wiz1-2/+2
2017-09-03Comment out dead MASTER_SITES/HOMEPAGEs.wiz1-3/+3
2017-08-01Comment out some dead HOMEPAGEs.wiz5-10/+10
2017-06-05Recursive revbump from lang/perl5 5.26.0ryoon1-2/+2
2017-05-10 Deleting p5-BioPerl. Duplicate with bioperl. Sorry.mef4-37/+1
2017-05-07Add following lines for make testmef1-1/+4
BUILD_DEPENDS+= p5-Test-Most-[0-9]*:../../devel/p5-Test-Most BUILD_DEPENDS+= p5-Text-Diff-[0-9]*:../../textproc/p5-Text-Diff
2017-05-07Added biology/p5-BioPerl version 1.007001mef1-1/+2
2017-05-07Import p5-BioPerl-1.007001 as biology/p5-BioPerl.mef3-0/+32
Easy first time access to BioPerl via functions.
2017-02-14Add missing include.joerg2-1/+14
2017-02-12Recursive revbump from fonts/harfbuzzryoon4-8/+8
2017-02-06Recursive bump for harfbuzz's new graphite2 dependency.wiz4-8/+8
2017-01-19Convert all occurrences (353 by my count) ofagc2-7/+7
MASTER_SITES= site1 \ site2 style continuation lines to be simple repeated MASTER_SITES+= site1 MASTER_SITES+= site2 lines. As previewed on tech-pkg. With thanks to rillig for fixing pkglint accordingly.
2017-01-01Add python-3.6 to incompatible versions.wiz1-2/+2
2016-12-17Fixed pkglint warnings.rillig3-10/+10