PERLFAQ3(1)
NNAAMMEE
perlfaq3 - Programming Tools ($Revision: 1.22 $, $Date:
1997/04/24 22:43:42 $)
DDEESSCCRRIIPPTTIIOONN
This section of the FAQ answers questions related to
programmer tools and programming support.
HHooww ddoo II ddoo ((aannyytthhiinngg))??
Have you looked at CPAN (see the perlfaq2 manpage)? The
chances are that someone has already written a module that
can solve your problem. Have you read the appropriate man
pages? Here's a brief index:
Objects perlref, perlmod, perlobj, perltie
Data Structures perlref, perllol, perldsc
Modules perlmod, perlmodlib, perlsub
Regexps perlre, perlfunc, perlop
Moving to perl5 perltrap, perl
Linking w/C perlxstut, perlxs, perlcall, perlguts, perlembed
Various http://www.perl.com/CPAN/doc/FMTEYEWTK/index.html
(not a man-page but still useful)
the perltoc manpage provides a crude table of contents for
the perl man page set.
HHooww ccaann II uussee PPeerrll iinntteerraaccttiivveellyy??
The typical approach uses the Perl debugger, described in
the perldebug(1) man page, on an "empty" program, like
this:
perl -de 42
Now just type in any legal Perl code, and it will be
immediately evaluated. You can also examine the symbol
table, get stack backtraces, check variable values, set
breakpoints, and other operations typically found in
symbolic debuggers
IIss tthheerree aa PPeerrll sshheellll??
In general, no. The Shell.pm module (distributed with
perl) makes perl try commands which aren't part of the
Perl language as shell commands. perlsh from the source
distribution is simplistic and uninteresting, but may
still be what you want.
HHooww ddoo II ddeebbuugg mmyy PPeerrll pprrooggrraammss??
Have you used -w?
Have you tried use strict?
Did you check the returns of each and every system call?
Did you read the perltrap manpage?
Have you tried the Perl debugger, described in the
perldebug manpage?
HHooww ddoo II pprrooffiillee mmyy PPeerrll pprrooggrraammss??
You should get the Devel::DProf module from CPAN, and also
use Benchmark.pm from the standard distribution.
Benchmark lets you time specific portions of your code,
while Devel::DProf gives detailed breakdowns of where your
code spends its time.
HHooww ddoo II ccrroossss--rreeffeerreennccee mmyy PPeerrll pprrooggrraammss??
The B::Xref module, shipped with the new, alpha-release
Perl compiler (not the general distribution), can be used
to generate cross-reference reports for Perl programs.
perl -MO=Xref[,OPTIONS] foo.pl
IIss tthheerree aa pprreettttyy--pprriinntteerr ((ffoorrmmaatttteerr)) ffoorr PPeerrll??
There is no program that will reformat Perl as much as
indent(1) will do for C. The complex feedback between the
scanner and the parser (this feedback is what confuses the
vgrind and emacs programs) makes it challenging at best to
write a stand-alone Perl parser.
Of course, if you simply follow the guidelines in the
perlstyle manpage, you shouldn't need to reformat.
Your editor can and should help you with source
formatting. The perl-mode for emacs can provide a
remarkable amount of help with most (but not all) code,
and even less programmable editors can provide significant
assistance.
If you are using to using vgrind program for printing out
nice code to a laser printer, you can take a stab at this
using
http://www.perl.com/CPAN/doc/misc/tips/working.vgrind.entry,
but the results are not particularly satisfying for
sophisticated code.
IIss tthheerree aa ccttaaggss ffoorr PPeerrll??
There's a simple one at
http://www.perl.com/CPAN/authors/id/TOMC/scripts/ptags.gz
which may do the trick.
WWhheerree ccaann II ggeett PPeerrll mmaaccrrooss ffoorr vvii??
For a complete version of Tom Christiansen's vi
configuration file, see
ftp://ftp.perl.com/pub/vi/toms.exrc, the standard
benchmark file for vi emulators. This runs best with nvi,
the current version of vi out of Berkeley, which
incidentally can be built with an embedded Perl
interpreter -- see http://www.perl.com/CPAN/src/misc .
WWhheerree ccaann II ggeett ppeerrll--mmooddee ffoorr eemmaaccss??
Since Emacs version 19 patchlevel 22 or so, there have
been both a perl-mode.el and support for the perl debugger
built in. These should come with the standard Emacs 19
distribution.
In the perl source directory, you'll find a directory
called "emacs", which contains a cperl-mode that color-
codes keywords, provides context-sensitive help, and other
nifty things.
Note that the perl-mode of emacs will have fits with
"main'foo" (single quote), and mess up the indentation and
hilighting. You should be using "main::foo", anyway.
HHooww ccaann II uussee ccuurrsseess wwiitthh PPeerrll??
The Curses module from CPAN provides a dynamically
loadable object module interface to a curses library.
HHooww ccaann II uussee XX oorr TTkk wwiitthh PPeerrll??
Tk is a completely Perl-based, object-oriented interface
to the Tk toolkit that doesn't force you to use Tcl just
to get at Tk. Sx is an interface to the Athena Widget
set. Both are available from CPAN.
HHooww ccaann II ggeenneerraattee ssiimmppllee mmeennuuss wwiitthhoouutt uussiinngg CCGGII oorr TTkk??
The
http://www.perl.com/CPAN/authors/id/SKUNZ/perlmenu.v4.0.tar.gz
module, which is curses-based, can help with this.
CCaann II ddyynnaammiiccaallllyy llooaadd CC rroouuttiinneess iinnttoo PPeerrll??
If your system architecture supports it, then the standard
perl on your system should also provide you with this via
the DynaLoader module. Read the perlxstut manpage for
details.
WWhhaatt iiss uunndduummpp??
See the next questions.
HHooww ccaann II mmaakkee mmyy PPeerrll pprrooggrraamm rruunn ffaasstteerr??
The best way to do this is to come up with a better
algorithm. This can often make a dramatic difference.
Chapter 8 in the Camel has some efficiency tips in it you
might want to look at.
Other approaches include autoloading seldom-used Perl
code. See the AutoSplit and AutoLoader modules in the
standard distribution for that. Or you could locate the
bottleneck and think about writing just that part in C,
the way we used to take bottlenecks in C code and write
them in assembler. Similar to rewriting in C is the use
of modules that have critical sections written in C (for
instance, the PDL module from CPAN).
In some cases, it may be worth it to use the backend
compiler to produce byte code (saving compilation time) or
compile into C, which will certainly save compilation time
and sometimes a small amount (but not much) execution
time. See the question about compiling your Perl
programs.
If you're currently linking your perl executable to a
shared libc.so, you can often gain a 10-25% performance
benefit by rebuilding it to link with a static libc.a
instead. This will make a bigger perl executable, but
your Perl programs (and programmers) may thank you for it.
See the INSTALL file in the source distribution for more
information.
Unsubstantiated reports allege that Perl interpreters that
use sfio outperform those that don't (for IO intensive
applications). To try this, see the INSTALL file in the
source distribution, especially the "Selecting File IO
mechanisms" section.
The undump program was an old attempt to speed up your
Perl program by storing the already-compiled form to disk.
This is no longer a viable option, as it only worked on a
few architectures, and wasn't a good solution anyway.
HHooww ccaann II mmaakkee mmyy PPeerrll pprrooggrraamm ttaakkee lleessss mmeemmoorryy??
When it comes to time-space tradeoffs, Perl nearly always
prefers to throw memory at a problem. Scalars in Perl use
more memory than strings in C, arrays take more that, and
hashes use even more. While there's still a lot to be
done, recent releases have been addressing these issues.
For example, as of 5.004, duplicate hash keys are shared
amongst all hashes using them, so require no reallocation.
In some cases, using substr() or vec() to simulate arrays
can be highly beneficial. For example, an array of a
thousand booleans will take at least 20,000 bytes of
space, but it can be turned into one 125-byte bit vector
for a considerable memory savings. The standard
Tie::SubstrHash module can also help for certain types of
data structure. If you're working with specialist data
structures (matrices, for instance) modules that implement
these in C may use less memory than equivalent Perl
modules.
Another thing to try is learning whether your Perl was
compiled with the system malloc or with Perl's builtin
malloc. Whichever one it is, try using the other one and
see whether this makes a difference. Information about
malloc is in the INSTALL file in the source distribution.
You can find out whether you are using perl's malloc by
typing perl -V:usemymalloc.
IIss iitt uunnssaaffee ttoo rreettuurrnn aa ppooiinntteerr ttoo llooccaall ddaattaa??
No, Perl's garbage collection system takes care of this.
sub makeone {
my @a = ( 1 .. 10 );
return \@a;
}
for $i ( 1 .. 10 ) {
push @many, makeone();
}
print $many[4][5], "\n";
print "@many\n";
HHooww ccaann II ffrreeee aann aarrrraayy oorr hhaasshh ssoo mmyy pprrooggrraamm sshhrriinnkkss??
You can't. Memory the system allocates to a program will
never be returned to the system. That's why long-running
programs sometimes re-exec themselves.
However, judicious use of my() on your variables will help
make sure that they go out of scope so that Perl can free
up their storage for use in other parts of your program.
(NB: my() variables also execute about 10% faster than
globals.) A global variable, of course, never goes out of
scope, so you can't get its space automatically reclaimed,
although undef()ing and/or delete()ing it will achieve the
same effect. In general, memory allocation and de-
allocation isn't something you can or should be worrying
about much in Perl, but even this capability
(preallocation of data types) is in the works.
HHooww ccaann II mmaakkee mmyy CCGGII ssccrriipptt mmoorree eeffffiicciieenntt??
Beyond the normal measures described to make general Perl
programs faster or smaller, a CGI program has additional
issues. It may be run several times per second. Given
that each time it runs it will need to be re-compiled and
will often allocate a megabyte or more of system memory,
this can be a killer. Compiling into C iissnn''tt ggooiinngg ttoo
hheellpp yyoouu because the process start-up overhead is where
the bottleneck is.
There are at least two popular ways to avoid this
overhead. One solution involves running the Apache HTTP
server (available from http://www.apache.org/) with either
of the mod_perl or mod_fastcgi plugin modules. With
mod_perl and the Apache::* modules (from CPAN), httpd will
run with an embedded Perl interpreter which pre-compiles
your script and then executes it within the same address
space without forking. The Apache extension also gives
Perl access to the internal server API, so modules written
in Perl can do just about anything a module written in C
can. With the FCGI module (from CPAN), a Perl executable
compiled with sfio (see the INSTALL file in the
distribution) and the mod_fastcgi module (available from
http://www.fastcgi.com/) each of your perl scripts becomes
a permanent CGI daemon processes.
Both of these solutions can have far-reaching effects on
your system and on the way you write your CGI scripts, so
investigate them with care.
HHooww ccaann II hhiiddee tthhee ssoouurrccee ffoorr mmyy PPeerrll pprrooggrraamm??
Delete it. :-) Seriously, there are a number of (mostly
unsatisfactory) solutions with varying levels of
"security".
First of all, however, you can't take away read
permission, because the source code has to be readable in
order to be compiled and interpreted. (That doesn't mean
that a CGI script's source is readable by people on the
web, though.) So you have to leave the permissions at the
socially friendly 0755 level.
Some people regard this as a security problem. If your
program does insecure things, and relies on people not
knowing how to exploit those insecurities, it is not
secure. It is often possible for someone to determine the
insecure things and exploit them without viewing the
source. Security through obscurity, the name for hiding
your bugs instead of fixing them, is little security
indeed.
You can try using encryption via source filters (Filter::*
from CPAN). But crackers might be able to decrypt it.
You can try using the byte code compiler and interpreter
described below, but crackers might be able to de-compile
it. You can try using the native-code compiler described
below, but crackers might be able to disassemble it.
These pose varying degrees of difficulty to people wanting
to get at your code, but none can definitively conceal it
(this is true of every language, not just Perl).
If you're concerned about people profiting from your code,
then the bottom line is that nothing but a restrictive
licence will give you legal security. License your
software and pepper it with threatening statements like
"This is unpublished proprietary software of XYZ Corp.
Your access to it does not give you permission to use it
blah blah blah." We are not lawyers, of course, so you
should see a lawyer if you want to be sure your licence's
wording will stand up in court.
HHooww ccaann II ccoommppiillee mmyy PPeerrll pprrooggrraamm iinnttoo bbyyttee ccooddee oorr CC??
Malcolm Beattie has written a multifunction backend
compiler, available from CPAN, that can do both these
things. It is as of Feb-1997 in late alpha release, which
means it's fun to play with if you're a programmer but not
really for people looking for turn-key solutions.
Please understand that merely compiling into C does not in
and of itself guarantee that your code will run very much
faster. That's because except for lucky cases where a lot
of native type inferencing is possible, the normal Perl
run time system is still present and thus will still take
just as long to run and be just as big. Most programs
save little more than compilation time, leaving execution
no more than 10-30% faster. A few rare programs actually
benefit significantly (like several times faster), but
this takes some tweaking of your code.
Malcolm will be in charge of the 5.005 release of Perl
itself to try to unify and merge his compiler and
multithreading work into the main release.
You'll probably be astonished to learn that the current
version of the compiler generates a compiled form of your
script whose executable is just as big as the original
perl executable, and then some. That's because as
currently written, all programs are prepared for a full
eval() statement. You can tremendously reduce this cost
by building a shared libperl.so library and linking
against that. See the INSTALL podfile in the perl source
distribution for details. If you link your main perl
binary with this, it will make it miniscule. For example,
on one author's system, /usr/bin/perl is only 11k in size!
HHooww ccaann II ggeett ''##!!ppeerrll'' ttoo wwoorrkk oonn [[MMSS--DDOOSS,,NNTT,,......]]??
For OS/2 just use
extproc perl -S -your_switches
as the first line in *.cmd file (-S due to a bug in
cmd.exe's `extproc' handling). For DOS one should first
invent a corresponding batch file, and codify it in
ALTERNATIVE_SHEBANG (see the INSTALL file in the source
distribution for more information).
The Win95/NT installation, when using the Activeware port
of Perl, will modify the Registry to associate the .pl
extension with the perl interpreter. If you install
another port, or (eventually) build your own Win95/NT Perl
using WinGCC, then you'll have to modify the Registry
yourself.
Macintosh perl scripts will have the the appropriate
Creator and Type, so that double-clicking them will invoke
the perl application.
IMPORTANT!: Whatever you do, PLEASE don't get frustrated,
and just throw the perl interpreter into your cgi-bin
directory, in order to get your scripts working for a web
server. This is an EXTREMELY big security risk. Take the
time to figure out how to do it correctly.
CCaann II wwrriittee uusseeffuull ppeerrll pprrooggrraammss oonn tthhee ccoommmmaanndd lliinnee??
Yes. Read the perlrun manpage for more information. Some
examples follow. (These assume standard Unix shell
quoting rules.)
# sum first and last fields
perl -lane 'print $F[0] + $F[-1]'
# identify text files
perl -le 'for(@ARGV) {print if -f && -T _}' *
# remove comments from C program
perl -0777 -pe 's{/\*.*?\*/}{}gs' foo.c
# make file a month younger than today, defeating reaper daemons
perl -e '$X=24*60*60; utime(time(),time() + 30 * $X,@ARGV)' *
# find first unused uid
perl -le '$i++ while getpwuid($i); print $i'
# display reasonable manpath
echo $PATH | perl -nl -072 -e '
s![^/+]*$!man!&&-d&&!$s{$_}++&&push@m,$_;END{print"@m"}'
Ok, the last one was actually an obfuscated perl entry.
:-)
WWhhyy ddoonn''tt ppeerrll oonnee--lliinneerrss wwoorrkk oonn mmyy DDOOSS//MMaacc//VVMMSS ssyysstteemm??
The problem is usually that the command interpreters on
those systems have rather different ideas about quoting
than the Unix shells under which the one-liners were
created. On some systems, you may have to change single-
quotes to double ones, which you must NOT do on Unix or
Plan9 systems. You might also have to change a single %
to a %%.
For example:
# Unix
perl -e 'print "Hello world\n"'
# DOS, etc.
perl -e "print \"Hello world\n\""
# Mac
print "Hello world\n"
(then Run "Myscript" or Shift-Command-R)
# VMS
perl -e "print ""Hello world\n"""
The problem is that none of this is reliable: it depends
on the command interpreter. Under Unix, the first two
often work. Under DOS, it's entirely possible neither
works. If 4DOS was the command shell, I'd probably have
better luck like this:
perl -e "print <Ctrl-x>"Hello world\n<Ctrl-x>""
Under the Mac, it depends which environment you are using.
The MacPerl shell, or MPW, is much like Unix shells in its
support for several quoting variants, except that it makes
free use of the Mac's non-ASCII characters as control
characters.
I'm afraid that there is no general solution to all of
this. It is a mess, pure and simple.
[Some of this answer was contributed by Kenneth
Albanowski.]
WWhheerree ccaann II lleeaarrnn aabboouutt CCGGII oorr WWeebb pprrooggrraammmmiinngg iinn PPeerrll??
For modules, get the CGI or LWP modules from CPAN. For
textbooks, see the two especially dedicated to web stuff
in the question on books. For problems and questions
related to the web, like "Why do I get 500 Errors" or "Why
doesn't it run from the browser right when it runs fine on
the command line", see these sources:
The Idiot's Guide to Solving Perl/CGI Problems, by Tom Christiansen
http://www.perl.com/perl/faq/idiots-guide.html
Frequently Asked Questions about CGI Programming, by Nick Kew
ftp://rtfm.mit.edu/pub/usenet/news.answers/www/cgi-faq
http://www3.pair.com/webthing/docs/cgi/faqs/cgifaq.shtml
Perl/CGI programming FAQ, by Shishir Gundavaram and Tom Christiansen
http://www.perl.com/perl/faq/perl-cgi-faq.html
The WWW Security FAQ, by Lincoln Stein
http://www-genome.wi.mit.edu/WWW/faqs/www-security-faq.html
World Wide Web FAQ, by Thomas Boutell
http://www.boutell.com/faq/
WWhheerree ccaann II lleeaarrnn aabboouutt oobbjjeecctt--oorriieenntteedd PPeerrll pprrooggrraammmmiinngg??
the perltoot manpage is a good place to start, and you can
use the perlobj manpage and the perlbot manpage for
reference. Perltoot didn't come out until the 5.004
release, but you can get a copy (in pod, html, or
postscript) from http://www.perl.com/CPAN/doc/FMTEYEWTK/ .
WWhheerree ccaann II lleeaarrnn aabboouutt lliinnkkiinngg CC wwiitthh PPeerrll?? [[hh22xxss,,
xxssuubbpppp]]
If you want to call C from Perl, start with the perlxstut
manpage, moving on to the perlxs manpage, the xsubpp
manpage, and the perlguts manpage. If you want to call
Perl from C, then read the perlembed manpage, the perlcall
manpage, and the perlguts manpage. Don't forget that you
can learn a lot from looking at how the authors of
existing extension modules wrote their code and solved
their problems.
II''vvee rreeaadd ppeerrlleemmbbeedd,, ppeerrllgguuttss,, eettcc..,, bbuutt II ccaann''tt eemmbbeedd
ppeerrll iinn mmyy CC pprrooggrraamm,, wwhhaatt aamm II ddooiinngg wwrroonngg??
Download the ExtUtils::Embed kit from CPAN and run `make
test'. If the tests pass, read the pods again and again
and again. If they fail, see the perlbug manpage and send
a bugreport with the output of make test TEST_VERBOSE=1
along with perl -V.
WWhheenn II ttrriieedd ttoo rruunn mmyy ssccrriipptt,, II ggoott tthhiiss mmeessssaaggee.. WWhhaatt
ddooeess iitt mmeeaann??
the perldiag manpage has a complete list of perl's error
messages and warnings, with explanatory text. You can
also use the splain program (distributed with perl) to
explain the error messages:
perl program 2>diag.out
splain [-v] [-p] diag.out
or change your program to explain the messages for you:
use diagnostics;
or
use diagnostics -verbose;
WWhhaatt''ss MMaakkeeMMaakkeerr??
This module (part of the standard perl distribution) is
designed to write a Makefile for an extension module from
a Makefile.PL. For more information, see the
ExtUtils::MakeMaker manpage.
AAUUTTHHOORR AANNDD CCOOPPYYRRIIGGHHTT
Copyright (c) 1997 Tom Christiansen and Nathan Torkington.
All rights reserved. See the perlfaq manpage for
distribution information.