summaryrefslogtreecommitdiff
path: root/parallel
AgeCommit message (Collapse)AuthorFilesLines
2022-09-22parallel/openmpi: fix dependent build with Fortran modulesthor1-1/+3
2022-09-22parallel/mpi-ch: fix dependent build with Fortran modulesthor1-1/+3
2022-09-06py-ipyparallel: fix PLISTwiz2-4/+4
Bump PKGREVISION. Fix a pkglint warning while here.
2022-08-25parallel/openmpi: fix buildlink path from wipthor1-2/+2
Stupid! Redundant! Sorry.
2022-08-25parallel/openmpi: update to 4.1.4thor15-698/+318
This is a long overdue update with some refinements, notably default-on f90 option. We skipped some major releases, so please excuse me not including the whole changelog. See https://open-mpi.org for more information.
2022-08-25parallel/mpi-ch: update to 3.4.3thor9-45/+498
This also removes the MPI_PREFIX business. No parallel installation of multiple MPIs intended right now. Also, the f90 option is default now to support upcoming parallel-computing packages in the default setup. =============================================================================== Changes in 3.4.3 =============================================================================== # Fix bugs in GPFS ROMIO driver # Fix bugs in DAOS ROMIO driver # Fix nemesis shm bug on systems with weak memory consistency # Fix progress issue in ch4:ucx finalize # Fix nonblocking collective issue during finalize # Fix UBSan warnings in 32-bit builds # Fix predefined datatype handle leaks # Fix bug in topology-aware communicator creation # Fix Slurm nodelist parsing # Fix testcase for out-of-tree configuration # Fix truncation testcase for possible disconnection error # Hydra help message improvements # Add support for DAOS pool and container labels =============================================================================== Changes in 3.4.2 =============================================================================== # Update configure options to support --disable-ze and --disable-cuda to explicitly disable GPU-awareness # Update embedded hwloc to version 2.4.1. Fixes a build issue on systems with OpenCL. # Fixes for Argobots thread package support. # Fixes for the DAOS ROMIO driver (thanks to Intel for contributing) # Fix additional bugs in GPU-aware nonblocking op collectives # Fix 'make check' errors with embedded MPL and json-c libraries # Fix support for GNI provider in ch4:ofi netmod # Fix bug in dynamic window memory registration with ch4:ofi netmod # Fix bug in persistent pt2pt when used with MPI_PROC_NULL # Fix bug in hydra nameserver when freeing published entries # Fix bugs in GPFS ROMIO driver (thanks to IBM for contributing) # Fix bug in darray type creation (thanks to IBM for contributing) # Fix bugs in XPMEM support # Fix bug in MPI_Comm_spawn_multiple where non-root args were not ignored =============================================================================== Changes in 3.4.1 =============================================================================== # Fix bug in GPU memory hooks that caused failures with NCCL # Fix bug in GPU-aware nonblocking op collectives # Hide symbols from embedded json-c module # Removed anonymous struct/union usage in MPL # Disable Java support in embedded UCX module =============================================================================== Changes in 3.4 =============================================================================== # ch4 replaces ch3 as the default device configuration. If no network module is specified at configuration-time, MPICH will search the user environment in order to select one to build. The user will be prompted to choose if no preferred network library is detected. # Add support for Yaksa datatype engine (default in ch4). # Add support for GPU buffers (CUDA, Level Zero) in pt2pt, collectives, and one-sided communication. # Add support for XPMEM. # Add support for multiple virtual communication interfaces for more efficient MPI_THREAD_MULTIPLE (experimental). # Add DAOS ADIO driver to ROMIO (contributed by Intel). # Add Quobyte ADIO driver to ROMIO (contributed by Quobyte). # Add support for Arm compiler toolchain # Add support for NVIDIA HPC compilers # Add support for flang/f18 Fortran compiler # Add support for AddressSanitizer and UndefinedBehaviorSanitizer to debug configuration # Remove mxm, llc, and portals4 netmods from ch3. # Remove support for logical reduction operations on floating point types. # Remove MPIX_Mutex interfaces. # Further improvements to ch4 business card exchange: extra long address support and fixes for PMIx integration. # Un-inline non-critical ch4 code for improved build times. # Fix several test program bugs. # Fix several static analysis and compiler warnings. # Change the signature of MPID_Init to include requested and provided thread levels. =============================================================================== Changes in 3.3.2 =============================================================================== # Add support for struct sockaddr in MPICH, Hydra, and PMI socket code. Works with both IPv4 and IPv6 addresses. # Fix localhost detection on FreeBSD and macOS, avoiding long delay during startup. # Fix thread-local storage detection. # Fix several test program bugs. # Fix several static analysis and compiler warnings. =============================================================================== Changes in 3.3.1 =============================================================================== # Fix bug in MPI_Testany/MPI_Waitany that could cause deadlock # Add missing functionality in Argobots library support # Fix configure-time detection for thread local storage support # Better support for reproducible builds. Thanks to Bernhard Wiedemann for the report and fixes # Fix support for XL compiler toolchain # Add support for -static-intel linking option # Fix building on systems without weak symbols # Fix several static analysis and compiler warnings =============================================================================== Changes in 3.3 =============================================================================== # CH4 Device: A new device layer implementation designed for low software overheads. CH4 has experimental support for OFI and UCX network libraries, and POSIX shared memory. Thanks to Intel, Mellanox, and RIKEN AICS for participating in the CH4 coding effort. # Fixed SLURM integration in Hydra for new node list format. # Added support for PMIx (https://pmix.github.io/pmix/) client library in CH4 netmods. Note that you must use a compatible PMIx server in this configuration. # Better organization of collectives in the MPI layer. The new scheme, which de-couples implementation from selection logic, enables easier integration of additional algorithms. # TSP collectives framework: A C++-template style framework for collective algorithms is added to allow single collective implementation to move data over generic or device-specific transport functions. # Improvements to derived datatype testing (DTPools - https://wiki.mpich.org/mpich/index.php/DTPools). # Added new "non-catastrophic" error codes to expose internal resource exhaustion. # Added info hints to MPI_Comm_split_type to support splitting communicators by machine topology. Both on-node (socket, core, etc.) and off-node (switch-level) hints are defined. # Improvements to MPI_THREAD_MULTIPLE in CH4 through new thread safety models at the Virtual Network Interface (VNI) level. This introduces two new models that leverage work-queues to offload operations and improve scalability under contention. # Message Driven Thread Activation (MDTA). An alternative locking model is defined for MPI_THREAD_MULTIPLE in CH4. # Added PMI usage optimizations for business card exchange in CH4 netmods. # Improvements on MPI_Abort. MPI_Abort invoked on subcommunicators will only abort the connected processes within that communicator. ` # Cleanup of whitespace (ch3 excluded) using the maint/code-cleanup.sh script. For instructions on how to update PRs/branches based on MPICH before the cleanup, see https://github.com/pmodels/mpich/wiki/Code-Cleanup-Procedure. # Removed the PAMI device and poe PMI client. # C99 compiler support is now required to build MPICH. # Several other minor bug fixes, memory leak fixes, and code cleanup. A full list of changes is available at the following link: http://git.mpich.org/mpich.git/shortlog/v3.2..v3.3 A list of bugs that have been fixed is available at the following link: https://github.com/pmodels/mpich/milestone/25?closed=1
2022-08-22hwloc: explicitly disable openclmarkd1-1/+2
2022-08-17opencl-clang: explicitly set GITHUB_PROJECTtnn1-4/+4
This works around a subtle bug in github.mk. Long story short is GITHUB_PROJECT is used to infer WRKSRC when github.mk is in use. In addition, GITHUB_PROJECT is inferred from DISTNAME when not explicitly set by the package. If one wants to reference WRKSRC in a BUILDLINK_TRANSFORM, that breaks because cwrappers.mk is evaluated before bsd.fetch.mk and GITHUB_PROJECT is not yet known at that point, leading to an invalid WRKSRC expansion.
2022-08-17opencl-clang: take out --version-script where not supportedtnn1-1/+5
2022-08-15opencl-clang: update to 14.0.0 to fix the buildtnn4-27/+9
Changes unknown + untested.
2022-08-15spirv-llvm-translator: update to 14.0.0 fix the buildtnn25-346/+28
Changes unknown.
2022-08-15spirv-llvm-translator: mark as BROKENwiz1-1/+3
This version does not support llvm 14. Please update to 14.0.0 (available) which supports it.
2022-08-15pocl: mark as BROKENwiz1-1/+3
This needs to be updated to 3.0 (available) to support llvm 14.
2022-08-12parallel/openmp: llvm updatepin3-12/+10
2022-07-29parallel/openmpi: Update DESCR from upstreamgdt1-7/+12
The only change is MPI 3.1 from 2 (labeled as "implemented or in short-term development", so that's not wrong). Add a note that the pkgsrc version is old and is MPI 3.0 (by inspection from mpi.h).
2022-07-27openmpi: use https for master sitewiz1-2/+2
2022-07-27parallel/mpi-ch: Fix CHECK_WRKREF_SKIP pathsgdt1-4/+4
2022-07-27parallel/mpi-ch: Add CHECK_WRKREF_SKIPgdt1-1/+7
The binaries have the WRKDIR in strings, printed via the -info argument to show how it was built. (I don't understand how this is buidling for anyone else.)
2022-07-25*: remove pkg-config from tools where no buildlink3.mk file is includedwiz1-2/+1
Bulk build on NetBSD of these packages had the same result as before (build succeeds, no PLIST change).
2022-07-24openmp: Ensure proper alloca definition is found on NetBSD.nia1-3/+5
Reported by Mayuresh.
2022-07-22parallel/pvm3: use -fallow-argument-mismatch for fortran sources.he4-10/+32
Respect passed LDFLAGS. Add comments to a couple of patches.
2022-07-05py-ipyparallel: updated to 8.4.1adam3-9/+9
8.4.1 is a tiny release, adding support for Python 3.11
2022-07-01threadingbuildingblocks: fix build for GCCadam2-1/+17
2022-07-01threadingbuildingblocks: updated to 2021.5.0adam5-46/+19
oneTBB 2021.5.0 What's New Reworked synchronization mechanism to reduce contention when multiple task_arena’s are used concurrently. Fixed sporadic memory corruption. Enabled Microsoft Visual Studio* 2022 and Python 3.9 support. Preview features Extended task_group interface with a new run_and_wait overload to accept task_handle. Fixed Issues Fixed possible correctness issue in queuing_rw_mutex on non-Intel platforms. Fixed GCC* 11 warnings. Fixed sporadic memory corruption.
2022-06-30*: Revbump packages that use Python at runtime without a PKGNAME prefixnia1-2/+2
2022-06-28*: recursive bump for perl 5.36wiz13-21/+26
2022-06-17py-ipyparallel: updated to 8.4.0adam4-127/+34
8.4.0 is a small release, with some bugfixes and improvements to the release process. Bugfixes: - (`%px`) only skip redisplay of streamed errors if outputs are complete Compatibility improvements: - Avoid use of recently deprecated asyncio/tornado APIs around 'current' event loops that are not running. Build improvements: - Switch to hatch backend for packaging
2022-05-16threadingbuildingblocks: Portability fixes.jperkin3-2/+21
2022-05-01py-case, py-billiard: not for python 2.xwiz2-7/+4
2022-04-18revbump for textproc/icu updateadam10-20/+20
2022-04-06py-ipyparallel: updated to 8.2.1adam3-11/+11
8.2.1 Fixes some compatibility issues with latest dask, ipykernel, and setuptools, as well as some typos and improved documentation.
2022-04-02sge: get slightly further building, still BROKENnia3-5/+29
2022-03-25gridscheduler: removewiz16-1247/+1
Last release from 2011, no upstream activity since 2012, does not build in any bulk builds I could find.
2022-03-06lua-lanes: update to 3.16.0nia4-69/+6
Changes: - fix require() wrapper to return all values returned by original require() - Lanes no longer relies on malloc/free for internal allocations, but uses the primary alloc function from the master Lua state (excepted for LuaJIT flavors, because the allocator seems to assume that it won't be used oustide the Lua state). - __lanesclone is now called only once with 3 parameters dest, source, size -> BREAKS CUSTOM DEEP USERDATA API
2022-02-13py-ipyparallel: revert previous, but make py-packaging a build dependencywiz1-4/+7
2022-02-13py-ipyparallel: add a missing dependencywiz1-6/+4
Remove some dependencies that are not requested explicitly and pulled in anyway.
2022-02-07py-ipyparallel: updated to 8.2.0adam3-11/+11
8.2.0 is a small release, mostly of small bugfixes and improvements. Changes: `len(AsyncMapResult)` and progress ports now use the number of items in the map, not the number of messages. Enhancements: - Show output prior to errors in `%%px` Bugs fixed: - Fix cases where engine id could be `-1` in tracebacks - Add missing `pbs` to engine launcher entrypoints
2022-02-04py-ipyparallel: added version 8.1.0adam7-1/+368
IPython Parallel (ipyparallel) is a Python package and collection of CLI scripts for controlling clusters of IPython processes, built on the Jupyter protocol.
2022-02-03llvm: updated to 13.0.1adam1-4/+4
13.0.1: Bug-fix release.
2022-01-04*: bump PKGREVISION for egg.mk userswiz1-1/+2
They now have a tool dependency on py-setuptools instead of a DEPENDS
2021-12-08revbump for icu and libffiadam11-19/+22
2021-11-23llvm: update llvm and friends to 13.0.0wiz26-43/+370
Tested on NetBSD-current and 9.1/amd64. Also update packages tightly coupled to llvm, if new versions are available. Mark creduce, include-what-you-use, and zig broken (waiting for a new release). Changes: Non-comprehensive list of changes in this release Windows Control-flow Enforcement Technology: the -ehcontguard option now emits valid unwind entrypoints which are validated when the context is being set during exception handling. Flang is now included in the binary packages released by LLVM. The debuginfo-test project has been renamed cross-project-tests and is now intended for testing components from multiple projects, not just debug information. The new “cross-project-tests” name replaces “debuginfo-test” in LLVM_ENABLE_PROJECTS, and a new check-cross-project-tests target has been added for running all tests in the project. The pre-existing check-debuginfo- test target remains for running just the debug information tests. (D95339 and D96513) Changes to the LLVM IR The inalloca attribute now has a mandatory type field, similar to byval and sret. The opaque pointer type ptr has been introduced. It is still in the process of being worked on and should not be used yet. Using the legacy pass manager for the optimization pipeline is deprecated and will be removed after LLVM 14. In the meantime, only minimal effort will be made to maintain the legacy pass manager for the optimization pipeline. Changes to building LLVM The build system now supports building multiple distributions, so that you can e.g. have one distribution containing just tools and another for libraries (to enable development). See Multi-distribution configurations for details. Changes to the AArch64 Backend Introduced assembly support for Armv9-A’s Realm Management Extension (RME) and Scalable Matrix Extension (SME). Produce proper cross-section relative relocations on COFF Fixed the calling convention on Windows for variadic functions involving floats in the fixed arguments Changes to the ARM Backend Produce proper cross-section relative relocations on COFF Changes to the Hexagon Target The Hexagon target now supports V68/HVX ISA. Changes to the C API The C API functions LLVMGetAlignment and LLVMSetAlignment now allow changing alignment on atomicrmw and cmpxchg instructions A new entry LLVMDIArgListMetadataKind was added to the LLVMMetadataKind enum, representing DIArgLists (D88175) Type attributes have been added to LLVM-C, introducing LLVMCreateTypeAttribute, LLVMGetTypeAttributeValue and LLVMIsTypeAttribute. (D977763’) The LTO_API_VERSION was bumped to 28, introducing a new function lto_set_debug_options for parsing LTO debug options (D92611) LLVMJITTargetSymbolFlags was renamed to LLVMJITSymbolTargetFlags (rG8d718a0bff73af066675a6258c01307937c33cf9) The C API received support for creating custom ORCv2 MaterializationUnits and APIs to retrieve an LLJIT instance’s linking layers. An ABI breaking change for LLVMOrcAbsoluteSymbols was introduced to make it consistent with LLVMOrcCreateCustomMaterializationUnit. (rGc8fc5e3ba942057d6c4cdcd1faeae69a28e7b671) The C API received support for adding ORCv2 object buffers directly to an object layer. (rG7b73cd684a8d5fb44d34064200f10e2723085c33) A breaking change to LLVMGetInlineAsm was introduced, adding a ninth argument LLVMBool CanThrow (D95745) The C API received support for calling into the new pass manager. (D102136) The C API function LLVMIntrinsicCopyOverloadedName has been deprecated. Please migrate to LLVMIntrinsicCopyOverloadedName2 which takes an extra module argument and which also handles unnamed types. (D99173) The C API received support for dumping objects from ORCv2 (rGcec8e69f01c3374cb38c6683058381b96fab8f89) A breaking change to LLVMOrcObjectTransformLayerTransformFunction was introduced, changing the order of the function pointer’s arguments. (rG8962c68ad007a525f9daa987c99eda57e0d0069a) The C API received support for accessing utilities from the LLJIT IRTransformLayer and ThreadSafeModule classes. (D103855) The C API received support for creating lazy-export MaterializationUnits (D104672) The C API function LLVMPassBuilderOptionsSetCoroutines was removed because couroutine passes have been enabled by default. (D105877) comdat noduplicates was renamed to comdat nodeduplicate and as a result, LLVMNoDuplicatesComdatSelectionKind was renamed to LLVMNoDeduplicateComdatSelectionKind. (D106319) Changes to the FastISel infrastructure FastISel no longer tracks killed registers, and instead leaves this to the register allocator. This means that hasTrivialKill() is removed, as well as the OpNIsKill parameters to the fastEmit_*() family of functions. Changes to the LLVM tools The options --build-id-link-{dir,input,output} have been deleted. (D96310) Support for in-order processors has been added to llvm-mca. (D94928) llvm-objdump supports -M {att,intel} now. --x86-asm-syntax is a deprecated internal option which will be removed in LLVM 14.0.0. (D101695) The llvm-readobj short aliases -s (previously --sections) and -t (previously --syms) have been changed to --syms and --section-details respectively, to match llvm-readelf. (D105055) The llvm-nm short aliases -M (--print-armap), -U (--defined-only), and -W (--no-weak) are now deprecated. Use the long form versions instead. The alias --just-symbol-name is now deprecated in favor of --format=just-symbols and -j. (D105330) In lli the default JIT engine switched from MCJIT (-jit-kind=mcjit) to ORC (-jit-kind=orc). (D98931) llvm-rc got support for invoking Clang to preprocess its input. (D100755) llvm-rc got a GNU windres compatible frontend, llvm-windres. (D100756) llvm-ml has improved compatibility with MS ml.exe, managing to assemble more asm files. Changes to LLDB LLDB executable is now included in pre-built LLVM binaries. LLDB now includes full featured support for AArch64 SVE register access. LLDB now supports AArch64 Pointer Authentication, allowing stack unwind with signed return address. LLDB now supports debugging programs on AArch64 Linux that use memory tagging (MTE). Added memory tag read and memory tag write commands. The memory region command will note when a region has memory tagging enabled. Synchronous and asynchronous tag faults are recognised. Synchronous tag faults have memory tag annotations in addition to the usual fault address.
2021-11-21openmp: Fix build on NetBSD/i386ryoon2-1/+18
2021-11-20opencl-clang: simplifywiz2-9/+7
2021-11-20parallel/Makefile: + spirv-headers.wiz1-1/+2
2021-11-20parallel/spirv-headers: import SPIRV-Headers-1.2.198.0wiz5-0/+118
This package contains machine-readable files for the SPIR-V Registry. This includes: * Header files for various languages. * JSON files describing the grammar for the SPIR-V core instruction set and the extended instruction sets. * The XML registry file. * A tool to build the headers from the JSON grammar.
2021-11-20pocl: update to 1.8.wiz11-229/+10
Notable User Facing Changes --------------------------- - support for LLVM 13 - CMake: Inter-Procedural Optimization is enabled on code of runtime library (libpocl.so is compiled with -flto on systems that support it). - LTTng tracing improved - more command types are traced, and also some synchronous API calls (like clCreateBuffer) are traced. - poclcc, tests and examples can be disabled with CMake options - Valgrind support improved by making Valgrind aware of pocl's reference counting of cl_* objects - kernels which are called by kernels are now force-inlined - Support for NetBSD. - Support for Unix systems without libdl. - PoCL can now (optionally) respond to SIGUSR2 by printing some live debug information. - improved SPIR support for CUDA devices Notable Bug Fixes ----------------- - Fixed a potential crash on Unix systems without sysfs mounted. - Fixed compilation errors when building on macOS. - Fixed POCL_FAST_INIT macro; POCL_INIT_LOCK must be invoked with only one argument. - Fix bin/poclcc to not depend on OpenCL 2.0 symbols - Fixed miscompilation in kernel loops with multiple conditionals with barriers in them. Other ----- - Add cmake options PARALLEL_COMPILE_JOBS, PARALLEL_LINK_JOBS to use ninja's seperate compile and link job pools. - Improve memory architecture, buffer migration and allocation. Buffers are now allocated on a device when first used (previously each buffer was allocated on every device in context). - the single global LLVMContext was replaced with multiple LLVMContexts, one per OpenCL cl_context. OpenCL code can now be compiled in parallel when using separate cl_contexts. This feature is disabled by default since it significantly slowed down PyOpenCL. This should be resolved by separating LLVM compilation in their own threads in the future. - a new OpenCL extension was added to PoCL: cl_pocl_content_size. The extension allows the user to give optimization hint to PoCL, which will be used internally by PoCL to optimize buffer transfers between multiple devices.
2021-10-26parallel: Replace RMD160 checksums with BLAKE2s checksumsnia29-58/+58
All checksums have been double-checked against existing RMD160 and SHA512 hashes
2021-10-07parallel: Remove SHA1 hashes for distfilesnia29-58/+29
2021-09-29revbump for boost-libsadam1-2/+2