Age | Commit message (Collapse) | Author | Files | Lines |
|
BUGS FIXED
* Make Intel Broadwell configurations work on Windows.
FEATURES
* Work around faulty cpuid on some recent Intel chips (this allows GMP to run on Skylake Pentiums).
* Support thumb-less ARM chips.
|
|
Changes between GMP version 6.0.* and 6.1.0
BUGS FIXED
* The public function mpn_com is now correctly declared in gmp.h.
* Healed possible failures of mpn_sec_sqr for non-cryptographic sizes for
some obsolete CPUs.
* The option --disable-assembly now disables all inlined asm.
* Fixed bug affecting mini-gmp's bitwise functions mpz_setbit, mpz_clrbit,
and mpz_combit.
* Various problems related to precision for mpf have been fixed.
* Fixed ABI incompatible stack alignment in calls from assembly code.
* Fixed PIC bug in popcount affecting Intel processors using the 32-bit ABI.
SPEEDUPS
* Speedup for Intel Broadwell and Skylake though assembly code making use of
new ADX instructions.
* Square root is now faster when the remainder is not needed. Also the speed
to compute the k-th root improved, for small sizes.
* Improved arm64 support.
FEATURES
* New C++ functions gcd and lcm for mpz_class.
* New public mpn functions mpn_divexact_1, mpn_zero_p, and mpn_cnd_swap.
* New public mpq_cmp_z function, to efficiently compare rationals with
integers.
* Support for Darwin in all x86 code, thereby enabling fat builds on Darwin.
* Support for more 32-bit arm processors.
* Support for compilation with clang/llvm on more platforms. Caution: GMP
triggers mis-compilation bugs in clang for many platforms, such as arm, x86
(32-bit and 64-bit), powerpc, mips.
* Support for AVX-less modern x86 CPUs. (Such support might be missing either
because the CPU vendor chose to disable AVX, or because the running kernel
lacks AVX context switch support.)
* Stack usage trimmed; we believe 512 KiB is now sufficient for any GMP
call, irrespective of operand size.
* Support for NetBSD under Xen; we switch off AVX unconditionally under
NetBSD since a bug in NetBSD makes AVX fail under Xen.
MISC
* We now use manufacturers' code names for x86 CPUs, e.g., "haswell" instead
of names derived from the commercial brands.
* Small improvements and better coverage for the test suite.
* The various FreeBSD problems listed for 6.0.0 affect this release too.
* Tuned values for FFT multiplications are provided for larger number on
many platforms.
|
|
date: 2015-04-03 09:41:26 +0200; author: mrg; state: Exp; commitid: HjchnsCySB9wh7gy;
port some changes from netbsd gmp and adapt them to the gmp 6.x.
this allows devel/gmp to build on armv4.
|
|
Issues found with existing distfiles:
distfiles/eclipse-sourceBuild-srcIncluded-3.0.1.zip
distfiles/fortran-utils-1.1.tar.gz
distfiles/ivykis-0.39.tar.gz
distfiles/enum-1.11.tar.gz
distfiles/pvs-3.2-libraries.tgz
distfiles/pvs-3.2-linux.tgz
distfiles/pvs-3.2-solaris.tgz
distfiles/pvs-3.2-system.tgz
No changes made to these distinfo files.
Otherwise, existing SHA1 digests verified and found to be the same on
the machine holding the existing distfiles (morden). All existing
SHA1 digests retained for now as an audit trail.
|
|
The work directories are almost empty, so it doesn't save much space.
It makes debugging annoying and it breaks local DISTDIR settings.
|
|
version string... use awk instead to construct directly from gmp.h
|
|
this allows devel/gmp to build on armv4.
|
|
|
|
transformed compiler flags when built through pkgsrc because
config.guess overrides are in effect. Fixes build with cwrappers on i386.
Also add a comment that we should enable the runtime CPU detection support
on amd64 some time in the future.
|
|
|
|
hgnested is a Mercurial extension to work with nested repositories.
It was inspired by the forest extension.
The extension allows to apply common Mercurial commands to all the nested
repositories at once like pull, push etc. It also allows to fetch a complete
tree of repositories through ssh or http.
To enable the "hgnested" extension, create an entry for it in your hgrc, like
this:
[extensions]
hgnested =
See `hg help hgnested` for the complete list of commands.
For more information please visit the following website:
http://code.google.com/p/hgnested/
|
|
|
|
|
|
Changes between 6.0.0 and 6.0.0a:
Not documented.
Changes between GMP version 5.1.* and 6.0.0
BUGS FIXED
* The function mpz_invert now considers any number invertible in Z/1Z.
* The mpn multiply code now handles operands of more than 2^31 limbs
correctly. (Note however that the mpz code is limited to 2^32 bits on
32-bit hosts and 2^37 bits on 64-bit hosts.)
* Contains all fixes from release 5.1.3.
SPEEDUPS
* Plain division of large operands is faster and more monotonous in operand
size.
* Major speedup for ARM, in particular ARM Cortex-A15, thanks to improved
assembly.
* Major speedup for SPARC T4/T5 and speedup also for T3, thanks to a lot of
new assembly.
* Speedup for Intel Sandy Bridge, Ivy Bridge, Haswell, thanks to rewritten
and vastly expanded assembly support. Speedup also for the older Core 2
and Nehalem.
* Faster mixed arithmetic between mpq_class and double.
* With g++, optimise more operations when one argument is a simple constant.
FEATURES
* Support for new Intel and AMD CPUs.
* Support for ARM64 alias Aarch64 alias ARMv8.
* New public functions mpn_sec_mul and mpn_sec_sqr, implementing side-channel
silent multiplication and squaring.
* New public functions mpn_sec_div_qr and mpn_sec_div_r, implementing
side-channel silent division.
* New public functions mpn_cnd_add_n and mpn_cnd_sub_n. Side-channel silent
conditional addition and subtraction.
* New public function mpn_sec_powm, implementing side-channel silent modexp.
* New public function mpn_sec_invert, implementing side-channel silent
modular inversion.
* Better support for applications which use the mpz_t type, but nevertheless
need to call some of the lower-level mpn functions. See the documentation
for mpz_limbs_read and related functions.
MISC
[FreeBSD bashing removed]
|
|
* gmp.h is a wrapper to include archtecture specific include file.
* builtin library is in lib${LIBABISUFFIX}.
|
|
|
|
|
|
|
|
BUGS FIXED
* The internal functions mpn_sbpi1_div_qr_sec mpn_sbpi1_div_r_sec could
compute garbage with a low probability. They are now rewritten, and the
test code has been improved.
* A bug in the ia64 implementation of mpn_divrem_2, clobbering some
callee-save registers, has been fixed. This is an internal
function, with the bug manifesting itself as miscomputation in,
e.g., mpn_sqrtrem.
* The documentation now correctly says 'const' for input arguments.
|
|
Changes between GMP version 5.1.1 and 5.1.2
BUGS FIXED
* A bug in mpz_powm_ui triggered by base arguments of at least 15000 decimal
digits or mod arguments of at least 7500 decimal digits has been fixed.
* A AMD Bulldozer specific bug affecting the 64-bit Windows ABI has been
fixed. This bug was in a key function (mpn_mul_1) and made both Bulldozer
specific builds and fat builds run on Bulldozer completely non-functional.
MISC
* Fixes and generalisations to the test suite.
* Minor portability enhancements.
|
|
ok agc
|
|
|
|
Use "checksum" instead.
|
|
BUGS FIXED
On Windows 64-bit, an error causing link errors about __gmp_binvert_limb_table has been fixed.
Aarch64 alias ARM64 support now works.
A possible buffer overrun in mpz_ior has been fixed.
A rare sign flip in mpz_remove has been fixed.
A bug causing problems with mpf numbers with absolute value ≥ 231 has been fixed.
Several bugs in mini-gmp have been fixed.
A bug caused by automake, related to the 'distcheck' target, has been fixed by upgrading the automake used for GMP release engineering.
FEATURES
Preliminary support for the x32 ABI under x86-64.
MISC
The mini-gmp testsuite now tests the entire set of functions.
Various improvements of the GMP testsuite.
|
|
|
|
needed when building lang/gcc* with option gcc-inplace-math.
|
|
BUGS FIXED
* When reading a C++ number (like mpz_class) in an istream reaches the end
of the stream, the eofbit is now set.
* The result sign of mpz_rootrem's remainder is now always correct.
* The mpz_remove function now handles negative divisors.
* Contains all fixes from release 5.0.5.
SPEEDUPS
* The n-factorial and n-over-k functions have been reimplemented for great
speedups for small and large operands.
* New subquadratic algorithm for the Kronecker/Jacobi/Legendre symbol.
* Major speedup for ARM, in particular ARM Cortex-A9 and A15, thanks to broad
assembly support.
* Significant speedup or POWER6 and POWER7 thanks to improved assembly.
* The performance under M$ Windows' 64-bit ABI has been greatly improved
thanks to complete assembly support.
* Minor speed improvements of many functions and for many platforms.
FEATURES
* Many new CPUs recognised.
* New functions for multi-factorials, and primorial: mpz_2fac_ui,
mpz_mfac_uiui and mpz_primorial_ui.
* The mpz_powm_sec function now uses side-channel silent division for
converting into Montgomery residues.
* The fat binary mechanism is now more robust in its CPU recognition.
MISC
* Inclusion of assembly code is now controlled by the configure options
--enable-assembly and --disable-assembly. The "none" CPU targets is gone.
* In C++, the conversions mpq_class->mpz_class, mpf_class->mpz_class and
mpf_class->mpq_class are now explicit.
* Includes "mini-gmp", a small, portable, but less efficient, implementation
of a subset of GMP's mpn and mpz interfaces. Used in GMP bootstrap, but it
can also be bundled with applications as a fallback when the real GMP
library is unavailable.
* The ABIs under AIX are no longer called aix32 and aix64, but mode64 and 32.
This is more consistent with other powerpc systems.
* The coverage of the testsuite has been improved, using the lcov tool. See
also http://gmplib.org/devel/lcov/.
* It is now possible to compile GMP using a C++ compiler.
* K&R C compilers are no longer supported.
* The BSD MP compatibility functions have been removed.
|
|
|
|
changes: bugfixes, in particular for glitches with newer CPUs
|
|
|
|
changes:
-bugfixes
-New CPUs recognised
|
|
|
|
|
|
|
|
NetBSD-current with gcc45)
|
|
Changes between GMP version 5.0.1 and 5.0.2
BUGS FIXED
* Many minor bugs related to portability fixed.
* The support for HPPA 2.0N now works, after an assembly bug fix.
* A test case type error has been fixed. The symptom of this bug
was spurious 'make check' failures.
SPEEDUPS
* None, except indirectly through recognition of new CPUs.
FEATURES
* Fat builds are now supported for 64-bit x86 processors also under Darwin.
MISC
* None.
|
|
Unfortunately, with recent gcc, the x86 version of mpn fails to build on
Solaris. Add it to Darwin on the list of platforms for which we build a
generic mpn instead.
|
|
so the shared library now works as well. Thanks to Matt Thomas for vax
asm help.
|
|
|
|
|
|
|
|
|
|
Changes in GMP 5.0.1
BUGS FIXED
- Fat builds fixed.
- Fixed crash for huge multiplies when old FFT_TABLE2 type of parameter selection tables' sentinel was smaller than multiplied
operands.
- The solib numbers now reflect the removal of the documented but preliminary mpn_bdivmod function; we correctly flag
incompatibility with GMP 4.3. GMP 5.0.0 has this wrong, and should perhaps be uninstalled to avoid confusion.
SPEEDUPS
- Multiplication of large numbers has indirectly been sped up through better FFT tuning and processor recognition. Since many
operations depend on multiplication, there will be a general speedup.
FEATURES
- More Core i3, i5 an Core i7 processor models are recognised.
- Fixes and workarounds for Mac OS quirks should make this GMP version build using many of the different versions of "Xcode".
MISC
- The amount of scratch memory needed for multiplication of huge numbers have been reduced substantially (but is still larger
than in GMP 4.3.)
- Likewise, the amount of scratch memory needed for division of large numbers have been reduced substantially.
- The FFT tuning code of tune/tuneup.c has been completely rewritten, and new, large FFT parameter selection tables are provided
for many machines.
- Upgraded to the latest autoconf, automake, libtool.
Changes in GMP 5.0.0
BUGS FIXED
- None (contains the same fixes as release 4.3.2).
SPEEDUPS
- Multiplication has been overhauled:
1. Multiplication of larger same size operands has been improved with the addition of two new Toom functions and a new
internal function mpn_mulmod_bnm1 (computing U * V mod (B^n-1), B being the word base. This latter function is used for the
largest products, waiting for a better Schoenhage-Strassen U * V mod (B^n+1) implementation.
2. Likewise for squaring.
3. Multiplication of different size operands has been improved with the addition of many new Toom function, and by selecting
underlying functions better from the main multiply functions.
- Division and mod have been overhauled:
1. Plain "schoolbook" division is reimplemented using faster quotient approximation.
2. Division Q = N/D, R = N mod D where both the quotient and remainder are needed now runs in time O(M(log(N))). This is an
improvement of a factor log(log(N))
3. Division where just the quotient is needed is now O(M(log(Q))) on average.
4. Modulo operations using Montgomery REDC form now take time O(M(n)).
5. Exact division Q = N/D by means of mpz_divexact has been improved for all sizes, and now runs in time O(M(log(N))).
- The function mpz_powm is now faster for all sizes. Its complexity has gone from O(M(n)log(n)m) to O(M(n)m) where n is the size
of the modulo argument and m is the size of the exponent. It is also radically faster for even modulus, since it now partially
factors such modulus and performs two smaller modexp operations, then uses CRT.
- The internal support for multiplication yielding just the lower n limbs has been improved by using Mulders' algorithm.
- Computation of inverses, both plain 1/N and 1/N mod B^n have been improved by using well-tuned Newton iterations, and
wrap-around multiplication using mpn_mulmod_bnm1.
- A new algorithm makes mpz_perfect_power_p asymptotically faster.
- The function mpz_remove uses a much faster algorithm, is better tuned, and also benefits from the division improvements.
- Intel Atom and VIA Nano specific optimisations.
- Plus hundreds of smaller improvements and tweaks!
FEATURES
- New mpz function: mpz_powm_sec for side-channel quiet modexp computations.
- New mpn functions: mpn_sqr, mpn_and_n, mpn_ior_n, mpn_xor_n, mpn_nand_n, mpn_nior_n, mpn_xnor_n, mpn_andn_n, mpn_iorn_n,
mpn_com, mpn_neg, mpn_copyi, mpn_copyd, mpn_zero.
- The function mpn_tdiv_qr now allows certain argument overlap.
- Support for fat binaries for 64-bit x86 processors has been added.
- A new type, mp_bitcnt_t for bignum bit counts, has been introduced.
- Support for Windows64 through mingw64 has been added.
- The cofactors of mpz_gcdext and mpn_gcdext are now more strictly normalised, returning to how GMP 4.2 worked. (Note that also
release 4.3.2 has this change.)
MISC
- The mpn_mul function should no longer be used for squaring, instead use the new mpn_sqr.
- The algorithm selection has been improved, the number of thresholds have more than doubled, and the tuning and use of existing
thresholds have been improved.
- The tune/speed program can measure many of new functions.
- The mpn_bdivmod function has been removed. We do not consider this an incompatible change, since the function was marked as
preliminary.
- The testsuite has been enhanced in various ways.
Changes in GMP 4.3.2
Bugs:
- Fixed bug in mpf_eq.
- Fixed overflow issues in mpz_set_str, mpz_inp_str, mpf_set_str, and mpf_get_str.
- Avoid unbounded stack allocation for unbalanced multiplication.
- Fixed bug in FFT multiplication.
Speedups:
- None, except that improved processor recognition helps affected processors.
Features:
- Recognise more "Core 2" processor variants.
- The cofactors of mpz_gcdext and mpn_gcdext are now more strictly normalised, returning to how GMP 4.2 worked.
|
|
to fix sunpro build.
|
|
also affects some files.
Changes between GMP version 4.3.0 and 4.3.1
Bugs:
* Fixed bug in mpn_gcdext, affecting also mpz_gcdext and mpz_invert.
The bug could cause a cofactor to have a leading zero limb, which
could lead to crashes or miscomputation later on.
* Fixed some minor documentation issues.
Features:
* Workarounds for various issues with Mac OS X's build tools.
* Recognise more IBM "POWER" processor variants.
Changes between GMP version 4.2.X and 4.3.0
Bugs:
* Fixed bug in mpz_perfect_power_p with recognition of negative perfect
powers that can be written both as an even and odd power.
* We might accidentally have added bugs since there is a large amount of
new code in this release.
Speedups:
* Vastly improved assembly code for x86-64 processors from AMD and Intel.
* Major improvements also for many other processor families, such as
Alpha, PowerPC, and Itanium.
* New sub-quadratic mpn_gcd and mpn_gcdext, as well as improved basecase
gcd code.
* The multiply FFT code has been slightly improved.
* Balanced multiplication now uses 4-way Toom in addition to schoolbook,
Karatsuba, 3-way Toom, and FFT.
* Unbalanced multiplication has been vastly improved.
* Improved schoolbook division by means of faster quotient approximation.
* Several new algorithms for division and mod by single limbs, giving
many-fold speedups.
* Improved nth root computations.
* The mpz_nextprime function uses sieving and is much faster.
* Countless minor tweaks.
Features:
* Updated support for fat binaries for x86_32 include current processors
* Lots of new mpn internal interfaces. Some of them will become public
in a future GMP release.
* Support for the 32-bit ABI under x86-apple-darwin.
* x86 CPU recognition code should now default better for future
processors.
* The experimental nails feature does not work in this release, but
it might be re-enabled in the future.
Misc:
* The gmp_version variable now always contains three parts. For this
release, it is "4.3.0".
|
|
block). Uncomment some commented out LICENSE lines while here.
|
|
This changes the buildlink3.mk files to use an include guard for the
recursive include. The use of BUILDLINK_DEPTH, BUILDLINK_DEPENDS,
BUILDLINK_PACKAGES and BUILDLINK_ORDER is handled by a single new
variable BUILDLINK_TREE. Each buildlink3.mk file adds a pair of
enter/exit marker, which can be used to reconstruct the tree and
to determine first level includes. Avoiding := for large variables
(BUILDLINK_ORDER) speeds up parse time as += has linear complexity.
The include guard reduces system time by avoiding reading files over and
over again. For complex packages this reduces both %user and %sys time to
half of the former time.
|
|
|
|
|
|
|