From 5dca13f0c29d4cf4e9150853d14d050c251b9f7d Mon Sep 17 00:00:00 2001 From: rillig Date: Tue, 28 Feb 2006 23:22:49 +0000 Subject: Added a rant on the Perl programming language. --- pkgtools/pkglint/files/doc/chap.code.xml | 72 +++++++++++++++++++++++++++++++- 1 file changed, 70 insertions(+), 2 deletions(-) (limited to 'pkgtools') diff --git a/pkgtools/pkglint/files/doc/chap.code.xml b/pkgtools/pkglint/files/doc/chap.code.xml index 6eb19490caf..0cc9517fac1 100644 --- a/pkgtools/pkglint/files/doc/chap.code.xml +++ b/pkgtools/pkglint/files/doc/chap.code.xml @@ -1,4 +1,4 @@ - + Code structure @@ -213,6 +213,74 @@ regular expression. The class Location is currently unused. - + + +Perl programming style + + The &pkglint; source code has evolved from FreeBSD's portlint, + which has been written in Perl, and up to now, &pkglint; is written + in Perl. Since one of the main ingredients to &pkglint; are regular + expressions, this choice seems natural, and indeed the Perl regular + expressions are a great help to keep the code short. But &pkglint; + is more than just throwing regular expressions at the package + files. + + In 2004, when the &pkglint; source code comprised about + 40 kilobytes, this was quite appropriate. Since then, the code + has become much more structured and various abstraction layers have + been inserted. It became more and more clear that the Perl + programming language has not been designed with nice-looking source + code in mind. + + The first example are subroutines and their parameters. In + most other languages, the names of the parameters are mentioned in + the subroutine definition. Not so in Perl. The parameters to each + subroutine are passed in the $@ array. The usual + way to get named parameters is to write assign the parameter array + to a list of local variables. This extra statement is a nuisance, + but it is merely syntactical. + + More serious is the way the arguments are passed to a + subroutine. Perl allows the programmer to define subroutines with a + weak form of prototypes, which helps to catch calls to subroutines + that provide a wrong number of arguments. This feature catches many + bugs that are easily overlooked. The downside is that anything + besides using scalars as parameter types is difficult to understand + and quickly leads to unexpected behavior. Therefore the subroutines + in &pkglint; only use this style for parameter passing. Oh, and by + the way, the subroutine prototypes are only checked for in certain + situations like direct calls. In method calls, nothing is checked at + all. Since almost all diagnostics are produced by calling + $line->log_warning() or + $line->log_error(), most of the subroutine calls in + &pkglint; go unchecked. + + Instead of using magic numbers, well written code defines + named constants for these numbers and then refers to them using + their names, giving the reader extra information that plain numbers + could not give. Although the constant definitions look quite good in + &pkglint; there is one big caveat. The Perl programming language + does not know constants. So these definitions are rather shortcuts + for defining functions that return the value of the constant. And as + functions in Perl have package-wide scope, so have these constants. + This is why the namespace prefixes like SWST_ are + necessary to avoid name clashes. + + Most of the constants would be written as an enumeration data + type if Perl had one. The same limitation applies for many of the + classes (implemented as packages in Perl) that are simply structs. + The typical Perl implementation of structs are classes, er, packages + which then use methods for accessing the fields. Again, the names of + these methods are only checked at runtime, so there is no language + support for detecting spelling mistakes in field names. + + Another area where Perl fails to detect many errors is the + loose type system. You can apply almost every operator to almost + every data type, and the Perl language will give you more or less + what you want. Especially it does not prevent you from matching + a regular expression against a reference. It will simply compute + a string representation of the reference and match the regular + expression against that. + -- cgit v1.2.3