• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
1****************************
2  What's New in Python 2.4
3****************************
4
5:Author: A.M. Kuchling
6
7.. |release| replace:: 1.02
8
9.. $Id: whatsnew24.tex 54632 2007-03-31 11:59:54Z georg.brandl $
10.. Don't write extensive text for new sections; I'll do that.
11.. Feel free to add commented-out reminders of things that need
12.. to be covered.  --amk
13
14This article explains the new features in Python 2.4.1, released on March 30,
152005.
16
17Python 2.4 is a medium-sized release.  It doesn't introduce as many changes as
18the radical Python 2.2, but introduces more features than the conservative 2.3
19release.  The most significant new language features are function decorators and
20generator expressions; most other changes are to the standard library.
21
22According to the CVS change logs, there were 481 patches applied and 502 bugs
23fixed between Python 2.3 and 2.4.  Both figures are likely to be underestimates.
24
25This article doesn't attempt to provide a complete specification of every single
26new feature, but instead provides a brief introduction to each feature.  For
27full details, you should refer to the documentation for Python 2.4, such as the
28Python Library Reference and the Python Reference Manual.  Often you will be
29referred to the PEP for a particular new feature for explanations of the
30implementation and design rationale.
31
32.. ======================================================================
33
34
35PEP 218: Built-In Set Objects
36=============================
37
38Python 2.3 introduced the :mod:`sets` module.  C implementations of set data
39types have now been added to the Python core as two new built-in types,
40:func:`set(iterable)` and :func:`frozenset(iterable)`.  They provide high speed
41operations for membership testing, for eliminating duplicates from sequences,
42and for mathematical operations like unions, intersections, differences, and
43symmetric differences. ::
44
45   >>> a = set('abracadabra')              # form a set from a string
46   >>> 'z' in a                            # fast membership testing
47   False
48   >>> a                                   # unique letters in a
49   set(['a', 'r', 'b', 'c', 'd'])
50   >>> ''.join(a)                          # convert back into a string
51   'arbcd'
52
53   >>> b = set('alacazam')                 # form a second set
54   >>> a - b                               # letters in a but not in b
55   set(['r', 'd', 'b'])
56   >>> a | b                               # letters in either a or b
57   set(['a', 'c', 'r', 'd', 'b', 'm', 'z', 'l'])
58   >>> a & b                               # letters in both a and b
59   set(['a', 'c'])
60   >>> a ^ b                               # letters in a or b but not both
61   set(['r', 'd', 'b', 'm', 'z', 'l'])
62
63   >>> a.add('z')                          # add a new element
64   >>> a.update('wxy')                     # add multiple new elements
65   >>> a
66   set(['a', 'c', 'b', 'd', 'r', 'w', 'y', 'x', 'z'])
67   >>> a.remove('x')                       # take one element out
68   >>> a
69   set(['a', 'c', 'b', 'd', 'r', 'w', 'y', 'z'])
70
71The :func:`frozenset` type is an immutable version of :func:`set`. Since it is
72immutable and hashable, it may be used as a dictionary key or as a member of
73another set.
74
75The :mod:`sets` module remains in the standard library, and may be useful if you
76wish to subclass the :class:`Set` or :class:`ImmutableSet` classes.  There are
77currently no plans to deprecate the module.
78
79
80.. seealso::
81
82   :pep:`218` - Adding a Built-In Set Object Type
83      Originally proposed by Greg Wilson and ultimately implemented by Raymond
84      Hettinger.
85
86.. ======================================================================
87
88
89PEP 237: Unifying Long Integers and Integers
90============================================
91
92The lengthy transition process for this PEP, begun in Python 2.2, takes another
93step forward in Python 2.4.  In 2.3, certain integer operations that would
94behave differently after int/long unification triggered :exc:`FutureWarning`
95warnings and returned values limited to 32 or 64 bits (depending on your
96platform).  In 2.4, these expressions no longer produce a warning and instead
97produce a different result that's usually a long integer.
98
99The problematic expressions are primarily left shifts and lengthy hexadecimal
100and octal constants.  For example, ``2 << 32`` results in a warning in 2.3,
101evaluating to 0 on 32-bit platforms.  In Python 2.4, this expression now returns
102the correct answer, 8589934592.
103
104
105.. seealso::
106
107   :pep:`237` - Unifying Long Integers and Integers
108      Original PEP written by Moshe Zadka and GvR.  The changes for 2.4 were
109      implemented by  Kalle Svensson.
110
111.. ======================================================================
112
113
114PEP 289: Generator Expressions
115==============================
116
117The iterator feature introduced in Python 2.2 and the :mod:`itertools` module
118make it easier to write programs that loop through large data sets without
119having the entire data set in memory at one time.  List comprehensions don't fit
120into this picture very well because they produce a Python list object containing
121all of the items.  This unavoidably pulls all of the objects into memory, which
122can be a problem if your data set is very large.  When trying to write a
123functionally-styled program, it would be natural to write something like::
124
125   links = [link for link in get_all_links() if not link.followed]
126   for link in links:
127       ...
128
129instead of  ::
130
131   for link in get_all_links():
132       if link.followed:
133           continue
134       ...
135
136The first form is more concise and perhaps more readable, but if you're dealing
137with a large number of link objects you'd have to write the second form to avoid
138having all link objects in memory at the same time.
139
140Generator expressions work similarly to list comprehensions but don't
141materialize the entire list; instead they create a generator that will return
142elements one by one.  The above example could be written as::
143
144   links = (link for link in get_all_links() if not link.followed)
145   for link in links:
146       ...
147
148Generator expressions always have to be written inside parentheses, as in the
149above example.  The parentheses signalling a function call also count, so if you
150want to create an iterator that will be immediately passed to a function you
151could write::
152
153   print sum(obj.count for obj in list_all_objects())
154
155Generator expressions differ from list comprehensions in various small ways.
156Most notably, the loop variable (*obj* in the above example) is not accessible
157outside of the generator expression.  List comprehensions leave the variable
158assigned to its last value; future versions of Python will change this, making
159list comprehensions match generator expressions in this respect.
160
161
162.. seealso::
163
164   :pep:`289` - Generator Expressions
165      Proposed by Raymond Hettinger and implemented by Jiwon Seo with early efforts
166      steered by Hye-Shik Chang.
167
168.. ======================================================================
169
170
171PEP 292: Simpler String Substitutions
172=====================================
173
174Some new classes in the standard library provide an alternative mechanism for
175substituting variables into strings; this style of substitution may be better
176for applications where untrained users need to edit templates.
177
178The usual way of substituting variables by name is the ``%`` operator::
179
180   >>> '%(page)i: %(title)s' % {'page':2, 'title': 'The Best of Times'}
181   '2: The Best of Times'
182
183When writing the template string, it can be easy to forget the ``i`` or ``s``
184after the closing parenthesis.  This isn't a big problem if the template is in a
185Python module, because you run the code, get an "Unsupported format character"
186:exc:`ValueError`, and fix the problem.  However, consider an application such
187as Mailman where template strings or translations are being edited by users who
188aren't aware of the Python language.  The format string's syntax is complicated
189to explain to such users, and if they make a mistake, it's difficult to provide
190helpful feedback to them.
191
192PEP 292 adds a :class:`Template` class to the :mod:`string` module that uses
193``$`` to indicate a substitution::
194
195   >>> import string
196   >>> t = string.Template('$page: $title')
197   >>> t.substitute({'page':2, 'title': 'The Best of Times'})
198   '2: The Best of Times'
199
200If a key is missing from the dictionary, the :meth:`substitute` method will
201raise a :exc:`KeyError`.  There's also a :meth:`safe_substitute` method that
202ignores missing keys::
203
204   >>> t = string.Template('$page: $title')
205   >>> t.safe_substitute({'page':3})
206   '3: $title'
207
208
209.. seealso::
210
211   :pep:`292` - Simpler String Substitutions
212      Written and implemented  by Barry Warsaw.
213
214.. ======================================================================
215
216
217PEP 318: Decorators for Functions and Methods
218=============================================
219
220Python 2.2 extended Python's object model by adding static methods and class
221methods, but it didn't extend Python's syntax to provide any new way of defining
222static or class methods.  Instead, you had to write a :keyword:`def` statement
223in the usual way, and pass the resulting method to a :func:`staticmethod` or
224:func:`classmethod` function that would wrap up the function as a method of the
225new type. Your code would look like this::
226
227   class C:
228      def meth (cls):
229          ...
230
231      meth = classmethod(meth)   # Rebind name to wrapped-up class method
232
233If the method was very long, it would be easy to miss or forget the
234:func:`classmethod` invocation after the function body.
235
236The intention was always to add some syntax to make such definitions more
237readable, but at the time of 2.2's release a good syntax was not obvious.  Today
238a good syntax *still* isn't obvious but users are asking for easier access to
239the feature; a new syntactic feature has been added to meet this need.
240
241The new feature is called "function decorators".  The name comes from the idea
242that :func:`classmethod`, :func:`staticmethod`, and friends are storing
243additional information on a function object; they're *decorating* functions with
244more details.
245
246The notation borrows from Java and uses the ``'@'`` character as an indicator.
247Using the new syntax, the example above would be written::
248
249   class C:
250
251      @classmethod
252      def meth (cls):
253          ...
254
255
256The ``@classmethod`` is shorthand for the ``meth=classmethod(meth)`` assignment.
257More generally, if you have the following::
258
259   @A
260   @B
261   @C
262   def f ():
263       ...
264
265It's equivalent to the following pre-decorator code::
266
267   def f(): ...
268   f = A(B(C(f)))
269
270Decorators must come on the line before a function definition, one decorator per
271line, and can't be on the same line as the def statement, meaning that ``@A def
272f(): ...`` is illegal.  You can only decorate function definitions, either at
273the module level or inside a class; you can't decorate class definitions.
274
275A decorator is just a function that takes the function to be decorated as an
276argument and returns either the same function or some new object.  The return
277value of the decorator need not be callable (though it typically is), unless
278further decorators will be applied to the result.  It's easy to write your own
279decorators.  The following simple example just sets an attribute on the function
280object::
281
282   >>> def deco(func):
283   ...    func.attr = 'decorated'
284   ...    return func
285   ...
286   >>> @deco
287   ... def f(): pass
288   ...
289   >>> f
290   <function f at 0x402ef0d4>
291   >>> f.attr
292   'decorated'
293   >>>
294
295As a slightly more realistic example, the following decorator checks that the
296supplied argument is an integer::
297
298   def require_int (func):
299       def wrapper (arg):
300           assert isinstance(arg, int)
301           return func(arg)
302
303       return wrapper
304
305   @require_int
306   def p1 (arg):
307       print arg
308
309   @require_int
310   def p2(arg):
311       print arg*2
312
313An example in :pep:`318` contains a fancier version of this idea that lets you
314both specify the required type and check the returned type.
315
316Decorator functions can take arguments.  If arguments are supplied, your
317decorator function is called with only those arguments and must return a new
318decorator function; this function must take a single function and return a
319function, as previously described.  In other words, ``@A @B @C(args)`` becomes::
320
321   def f(): ...
322   _deco = C(args)
323   f = A(B(_deco(f)))
324
325Getting this right can be slightly brain-bending, but it's not too difficult.
326
327A small related change makes the :attr:`func_name` attribute of functions
328writable.  This attribute is used to display function names in tracebacks, so
329decorators should change the name of any new function that's constructed and
330returned.
331
332
333.. seealso::
334
335   :pep:`318` - Decorators for Functions, Methods and Classes
336      Written  by Kevin D. Smith, Jim Jewett, and Skip Montanaro.  Several people
337      wrote patches implementing function decorators, but the one that was actually
338      checked in was patch #979728, written by Mark Russell.
339
340   https://wiki.python.org/moin/PythonDecoratorLibrary
341      This Wiki page contains several examples of decorators.
342
343.. ======================================================================
344
345
346PEP 322: Reverse Iteration
347==========================
348
349A new built-in function, :func:`reversed(seq)`, takes a sequence and returns an
350iterator that loops over the elements of the sequence  in reverse order.   ::
351
352   >>> for i in reversed(xrange(1,4)):
353   ...    print i
354   ...
355   3
356   2
357   1
358
359Compared to extended slicing, such as ``range(1,4)[::-1]``, :func:`reversed` is
360easier to read, runs faster, and uses substantially less memory.
361
362Note that :func:`reversed` only accepts sequences, not arbitrary iterators.  If
363you want to reverse an iterator, first convert it to  a list with :func:`list`.
364::
365
366   >>> input = open('/etc/passwd', 'r')
367   >>> for line in reversed(list(input)):
368   ...   print line
369   ...
370   root:*:0:0:System Administrator:/var/root:/bin/tcsh
371     ...
372
373
374.. seealso::
375
376   :pep:`322` - Reverse Iteration
377      Written and implemented by Raymond Hettinger.
378
379.. ======================================================================
380
381
382PEP 324: New subprocess Module
383==============================
384
385The standard library provides a number of ways to execute a subprocess, offering
386different features and different levels of complexity.
387:func:`os.system(command)` is easy to use, but slow (it runs a shell process
388which executes the command) and dangerous (you have to be careful about escaping
389the shell's metacharacters).  The :mod:`popen2` module offers classes that can
390capture standard output and standard error from the subprocess, but the naming
391is confusing.  The :mod:`subprocess` module cleans  this up, providing a unified
392interface that offers all the features you might need.
393
394Instead of :mod:`popen2`'s collection of classes, :mod:`subprocess` contains a
395single class called :class:`Popen`  whose constructor supports a number of
396different keyword arguments. ::
397
398   class Popen(args, bufsize=0, executable=None,
399               stdin=None, stdout=None, stderr=None,
400               preexec_fn=None, close_fds=False, shell=False,
401               cwd=None, env=None, universal_newlines=False,
402               startupinfo=None, creationflags=0):
403
404*args* is commonly a sequence of strings that will be the arguments to the
405program executed as the subprocess.  (If the *shell* argument is true, *args*
406can be a string which will then be passed on to the shell for interpretation,
407just as :func:`os.system` does.)
408
409*stdin*, *stdout*, and *stderr* specify what the subprocess's input, output, and
410error streams will be.  You can provide a file object or a file descriptor, or
411you can use the constant ``subprocess.PIPE`` to create a pipe between the
412subprocess and the parent.
413
414.. index::
415   single: universal newlines; What's new
416
417The constructor has a number of handy options:
418
419* *close_fds* requests that all file descriptors be closed before running the
420  subprocess.
421
422* *cwd* specifies the working directory in which the subprocess will be executed
423  (defaulting to whatever the parent's working directory is).
424
425* *env* is a dictionary specifying environment variables.
426
427* *preexec_fn* is a function that gets called before the child is started.
428
429* *universal_newlines* opens the child's input and output using Python's
430  :term:`universal newlines` feature.
431
432Once you've created the :class:`Popen` instance,  you can call its :meth:`wait`
433method to pause until the subprocess has exited, :meth:`poll` to check if it's
434exited without pausing,  or :meth:`communicate(data)` to send the string *data*
435to the subprocess's standard input.   :meth:`communicate(data)`  then reads any
436data that the subprocess has sent to its standard output  or standard error,
437returning a tuple ``(stdout_data, stderr_data)``.
438
439:func:`call` is a shortcut that passes its arguments along to the :class:`Popen`
440constructor, waits for the command to complete, and returns the status code of
441the subprocess.  It can serve as a safer analog to :func:`os.system`::
442
443   sts = subprocess.call(['dpkg', '-i', '/tmp/new-package.deb'])
444   if sts == 0:
445       # Success
446       ...
447   else:
448       # dpkg returned an error
449       ...
450
451The command is invoked without use of the shell.  If you really do want to  use
452the shell, you can add ``shell=True`` as a keyword argument and provide a string
453instead of a sequence::
454
455   sts = subprocess.call('dpkg -i /tmp/new-package.deb', shell=True)
456
457The PEP takes various examples of shell and Python code and shows how they'd be
458translated into Python code that uses :mod:`subprocess`.  Reading this section
459of the PEP is highly recommended.
460
461
462.. seealso::
463
464   :pep:`324` - subprocess - New process module
465      Written and implemented by Peter Åstrand, with assistance from Fredrik Lundh and
466      others.
467
468.. ======================================================================
469
470
471PEP 327: Decimal Data Type
472==========================
473
474Python has always supported floating-point (FP) numbers, based on the underlying
475C :c:type:`double` type, as a data type.  However, while most programming
476languages provide a floating-point type, many people (even programmers) are
477unaware that floating-point numbers don't represent certain decimal fractions
478accurately.  The new :class:`Decimal` type can represent these fractions
479accurately, up to a user-specified precision limit.
480
481
482Why is Decimal needed?
483----------------------
484
485The limitations arise from the representation used for floating-point numbers.
486FP numbers are made up of three components:
487
488* The sign, which is positive or negative.
489
490* The mantissa, which is a single-digit binary number   followed by a fractional
491  part.  For example, ``1.01`` in base-2 notation is ``1 + 0/2 + 1/4``, or 1.25 in
492  decimal notation.
493
494* The exponent, which tells where the decimal point is located in the number
495  represented.
496
497For example, the number 1.25 has positive sign, a mantissa value of 1.01 (in
498binary), and an exponent of 0 (the decimal point doesn't need to be shifted).
499The number 5 has the same sign and mantissa, but the exponent is 2 because the
500mantissa is multiplied by 4 (2 to the power of the exponent 2); 1.25 \* 4 equals
5015.
502
503Modern systems usually provide floating-point support that conforms to a
504standard called IEEE 754.  C's :c:type:`double` type is usually implemented as a
50564-bit IEEE 754 number, which uses 52 bits of space for the mantissa.  This
506means that numbers can only be specified to 52 bits of precision.  If you're
507trying to represent numbers whose expansion repeats endlessly, the expansion is
508cut off after 52 bits. Unfortunately, most software needs to produce output in
509base 10, and common fractions in base 10 are often repeating decimals in binary.
510For example, 1.1 decimal is binary ``1.0001100110011 ...``; .1 = 1/16 + 1/32 +
5111/256 plus an infinite number of additional terms.  IEEE 754 has to chop off
512that infinitely repeated decimal after 52 digits, so the representation is
513slightly inaccurate.
514
515Sometimes you can see this inaccuracy when the number is printed::
516
517   >>> 1.1
518   1.1000000000000001
519
520The inaccuracy isn't always visible when you print the number because the FP-to-
521decimal-string conversion is provided by the C library, and most C libraries try
522to produce sensible output.  Even if it's not displayed, however, the inaccuracy
523is still there and subsequent operations can magnify the error.
524
525For many applications this doesn't matter.  If I'm plotting points and
526displaying them on my monitor, the difference between 1.1 and 1.1000000000000001
527is too small to be visible.  Reports often limit output to a certain number of
528decimal places, and if you round the number to two or three or even eight
529decimal places, the error is never apparent.  However, for applications where it
530does matter,  it's a lot of work to implement your own custom arithmetic
531routines.
532
533Hence, the :class:`Decimal` type was created.
534
535
536The :class:`Decimal` type
537-------------------------
538
539A new module, :mod:`decimal`, was added to Python's standard library.  It
540contains two classes, :class:`Decimal` and :class:`Context`.  :class:`Decimal`
541instances represent numbers, and :class:`Context` instances are used to wrap up
542various settings such as the precision and default rounding mode.
543
544:class:`Decimal` instances are immutable, like regular Python integers and FP
545numbers; once it's been created, you can't change the value an instance
546represents.  :class:`Decimal` instances can be created from integers or
547strings::
548
549   >>> import decimal
550   >>> decimal.Decimal(1972)
551   Decimal("1972")
552   >>> decimal.Decimal("1.1")
553   Decimal("1.1")
554
555You can also provide tuples containing the sign, the mantissa represented  as a
556tuple of decimal digits, and the exponent::
557
558   >>> decimal.Decimal((1, (1, 4, 7, 5), -2))
559   Decimal("-14.75")
560
561Cautionary note: the sign bit is a Boolean value, so 0 is positive and 1 is
562negative.
563
564Converting from floating-point numbers poses a bit of a problem: should the FP
565number representing 1.1 turn into the decimal number for exactly 1.1, or for 1.1
566plus whatever inaccuracies are introduced? The decision was to dodge the issue
567and leave such a conversion out of the API.  Instead, you should convert the
568floating-point number into a string using the desired precision and pass the
569string to the :class:`Decimal` constructor::
570
571   >>> f = 1.1
572   >>> decimal.Decimal(str(f))
573   Decimal("1.1")
574   >>> decimal.Decimal('%.12f' % f)
575   Decimal("1.100000000000")
576
577Once you have :class:`Decimal` instances, you can perform the usual mathematical
578operations on them.  One limitation: exponentiation requires an integer
579exponent::
580
581   >>> a = decimal.Decimal('35.72')
582   >>> b = decimal.Decimal('1.73')
583   >>> a+b
584   Decimal("37.45")
585   >>> a-b
586   Decimal("33.99")
587   >>> a*b
588   Decimal("61.7956")
589   >>> a/b
590   Decimal("20.64739884393063583815028902")
591   >>> a ** 2
592   Decimal("1275.9184")
593   >>> a**b
594   Traceback (most recent call last):
595     ...
596   decimal.InvalidOperation: x ** (non-integer)
597
598You can combine :class:`Decimal` instances with integers, but not with floating-
599point numbers::
600
601   >>> a + 4
602   Decimal("39.72")
603   >>> a + 4.5
604   Traceback (most recent call last):
605     ...
606   TypeError: You can interact Decimal only with int, long or Decimal data types.
607   >>>
608
609:class:`Decimal` numbers can be used with the :mod:`math` and :mod:`cmath`
610modules, but note that they'll be immediately converted to  floating-point
611numbers before the operation is performed, resulting in a possible loss of
612precision and accuracy.  You'll also get back a regular floating-point number
613and not a :class:`Decimal`.   ::
614
615   >>> import math, cmath
616   >>> d = decimal.Decimal('123456789012.345')
617   >>> math.sqrt(d)
618   351364.18288201344
619   >>> cmath.sqrt(-d)
620   351364.18288201344j
621
622:class:`Decimal` instances have a :meth:`sqrt` method that returns a
623:class:`Decimal`, but if you need other things such as trigonometric functions
624you'll have to implement them. ::
625
626   >>> d.sqrt()
627   Decimal("351364.1828820134592177245001")
628
629
630The :class:`Context` type
631-------------------------
632
633Instances of the :class:`Context` class encapsulate several settings for
634decimal operations:
635
636* :attr:`prec` is the precision, the number of decimal places.
637
638* :attr:`rounding` specifies the rounding mode.  The :mod:`decimal` module has
639  constants for the various possibilities: :const:`ROUND_DOWN`,
640  :const:`ROUND_CEILING`,  :const:`ROUND_HALF_EVEN`, and various others.
641
642* :attr:`traps` is a dictionary specifying what happens on encountering certain
643  error conditions: either  an exception is raised or  a value is returned.  Some
644  examples of error conditions are division by zero, loss of precision, and
645  overflow.
646
647There's a thread-local default context available by calling :func:`getcontext`;
648you can change the properties of this context to alter the default precision,
649rounding, or trap handling.  The following example shows the effect of changing
650the precision of the default context::
651
652   >>> decimal.getcontext().prec
653   28
654   >>> decimal.Decimal(1) / decimal.Decimal(7)
655   Decimal("0.1428571428571428571428571429")
656   >>> decimal.getcontext().prec = 9
657   >>> decimal.Decimal(1) / decimal.Decimal(7)
658   Decimal("0.142857143")
659
660The default action for error conditions is selectable; the module can either
661return a special value such as infinity or not-a-number, or exceptions can be
662raised::
663
664   >>> decimal.Decimal(1) / decimal.Decimal(0)
665   Traceback (most recent call last):
666     ...
667   decimal.DivisionByZero: x / 0
668   >>> decimal.getcontext().traps[decimal.DivisionByZero] = False
669   >>> decimal.Decimal(1) / decimal.Decimal(0)
670   Decimal("Infinity")
671   >>>
672
673The :class:`Context` instance also has various methods for formatting  numbers
674such as :meth:`to_eng_string` and :meth:`to_sci_string`.
675
676For more information, see the documentation for the :mod:`decimal` module, which
677includes a quick-start tutorial and a reference.
678
679
680.. seealso::
681
682   :pep:`327` - Decimal Data Type
683      Written by Facundo Batista and implemented by Facundo Batista, Eric Price,
684      Raymond Hettinger, Aahz, and Tim Peters.
685
686   http://www.lahey.com/float.htm
687      The article uses Fortran code to illustrate many of the problems that floating-
688      point inaccuracy can cause.
689
690   http://speleotrove.com/decimal/
691      A description of a decimal-based representation.  This representation is being
692      proposed as a standard, and underlies the new Python decimal type.  Much of this
693      material was written by Mike Cowlishaw, designer of the Rexx language.
694
695.. ======================================================================
696
697
698PEP 328: Multi-line Imports
699===========================
700
701One language change is a small syntactic tweak aimed at making it easier to
702import many names from a module.  In a ``from module import names`` statement,
703*names* is a sequence of names separated by commas.  If the sequence is  very
704long, you can either write multiple imports from the same module, or you can use
705backslashes to escape the line endings like this::
706
707   from SimpleXMLRPCServer import SimpleXMLRPCServer,\
708               SimpleXMLRPCRequestHandler,\
709               CGIXMLRPCRequestHandler,\
710               resolve_dotted_attribute
711
712The syntactic change in Python 2.4 simply allows putting the names within
713parentheses.  Python ignores newlines within a parenthesized expression, so the
714backslashes are no longer needed::
715
716   from SimpleXMLRPCServer import (SimpleXMLRPCServer,
717                                   SimpleXMLRPCRequestHandler,
718                                   CGIXMLRPCRequestHandler,
719                                   resolve_dotted_attribute)
720
721The PEP also proposes that all :keyword:`import` statements be absolute imports,
722with a leading ``.`` character to indicate a relative import.  This part of the
723PEP was not implemented for Python 2.4, but was completed for Python 2.5.
724
725
726.. seealso::
727
728   :pep:`328` - Imports: Multi-Line and Absolute/Relative
729      Written by Aahz.  Multi-line imports were implemented by Dima Dorfman.
730
731.. ======================================================================
732
733
734PEP 331: Locale-Independent Float/String Conversions
735====================================================
736
737The :mod:`locale` modules lets Python software select various conversions and
738display conventions that are localized to a particular country or language.
739However, the module was careful to not change the numeric locale because various
740functions in Python's implementation required that the numeric locale remain set
741to the ``'C'`` locale.  Often this was because the code was using the C
742library's :c:func:`atof` function.
743
744Not setting the numeric locale caused trouble for extensions that used third-
745party C libraries, however, because they wouldn't have the correct locale set.
746The motivating example was GTK+, whose user interface widgets weren't displaying
747numbers in the current locale.
748
749The solution described in the PEP is to add three new functions to the Python
750API that perform ASCII-only conversions, ignoring the locale setting:
751
752* :c:func:`PyOS_ascii_strtod(str, ptr)`  and :c:func:`PyOS_ascii_atof(str, ptr)`
753  both convert a string to a C :c:type:`double`.
754
755* :c:func:`PyOS_ascii_formatd(buffer, buf_len, format, d)` converts a
756  :c:type:`double` to an ASCII string.
757
758The code for these functions came from the GLib library
759(https://developer.gnome.org/glib/stable/), whose developers kindly
760relicensed the relevant functions and donated them to the Python Software
761Foundation.  The :mod:`locale` module  can now change the numeric locale,
762letting extensions such as GTK+  produce the correct results.
763
764
765.. seealso::
766
767   :pep:`331` - Locale-Independent Float/String Conversions
768      Written by Christian R. Reis, and implemented by Gustavo Carneiro.
769
770.. ======================================================================
771
772
773Other Language Changes
774======================
775
776Here are all of the changes that Python 2.4 makes to the core Python language.
777
778* Decorators for functions and methods were added (:pep:`318`).
779
780* Built-in :func:`set` and :func:`frozenset` types were  added (:pep:`218`).
781  Other new built-ins include the :func:`reversed(seq)` function (:pep:`322`).
782
783* Generator expressions were added (:pep:`289`).
784
785* Certain numeric expressions no longer return values restricted to 32 or 64
786  bits (:pep:`237`).
787
788* You can now put parentheses around the list of names in a ``from module import
789  names`` statement (:pep:`328`).
790
791* The :meth:`dict.update` method now accepts the same argument forms as the
792  :class:`dict` constructor.  This includes any mapping, any iterable of key/value
793  pairs, and keyword arguments. (Contributed by Raymond Hettinger.)
794
795* The string methods :meth:`ljust`, :meth:`rjust`, and :meth:`center` now take
796  an optional argument for specifying a fill character other than a space.
797  (Contributed by Raymond Hettinger.)
798
799* Strings also gained an :meth:`rsplit` method that works like the :meth:`split`
800  method but splits from the end of the string.   (Contributed by Sean
801  Reifschneider.) ::
802
803     >>> 'www.python.org'.split('.', 1)
804     ['www', 'python.org']
805     'www.python.org'.rsplit('.', 1)
806     ['www.python', 'org']
807
808* Three keyword parameters, *cmp*, *key*, and *reverse*, were added to the
809  :meth:`sort` method of lists. These parameters make some common usages of
810  :meth:`sort` simpler. All of these parameters are optional.
811
812  For the *cmp* parameter, the value should be a comparison function that takes
813  two parameters and returns -1, 0, or +1 depending on how the parameters compare.
814  This function will then be used to sort the list.  Previously this was the only
815  parameter that could be provided to :meth:`sort`.
816
817  *key* should be a single-parameter function that takes a list element and
818  returns a comparison key for the element.  The list is then sorted using the
819  comparison keys.  The following example sorts a list case-insensitively::
820
821     >>> L = ['A', 'b', 'c', 'D']
822     >>> L.sort()                 # Case-sensitive sort
823     >>> L
824     ['A', 'D', 'b', 'c']
825     >>> # Using 'key' parameter to sort list
826     >>> L.sort(key=lambda x: x.lower())
827     >>> L
828     ['A', 'b', 'c', 'D']
829     >>> # Old-fashioned way
830     >>> L.sort(cmp=lambda x,y: cmp(x.lower(), y.lower()))
831     >>> L
832     ['A', 'b', 'c', 'D']
833
834  The last example, which uses the *cmp* parameter, is the old way to perform a
835  case-insensitive sort.  It works but is slower than using a *key* parameter.
836  Using *key* calls :meth:`lower` method once for each element in the list while
837  using *cmp* will call it twice for each comparison, so using *key* saves on
838  invocations of the :meth:`lower` method.
839
840  For simple key functions and comparison functions, it is often possible to avoid
841  a :keyword:`lambda` expression by using an unbound method instead.  For example,
842  the above case-insensitive sort is best written as::
843
844     >>> L.sort(key=str.lower)
845     >>> L
846     ['A', 'b', 'c', 'D']
847
848  Finally, the *reverse* parameter takes a Boolean value.  If the value is true,
849  the list will be sorted into reverse order. Instead of ``L.sort();
850  L.reverse()``, you can now write ``L.sort(reverse=True)``.
851
852  The results of sorting are now guaranteed to be stable.  This means that two
853  entries with equal keys will be returned in the same order as they were input.
854  For example, you can sort a list of people by name, and then sort the list by
855  age, resulting in a list sorted by age where people with the same age are in
856  name-sorted order.
857
858  (All changes to :meth:`sort` contributed by Raymond Hettinger.)
859
860* There is a new built-in function :func:`sorted(iterable)` that works like the
861  in-place :meth:`list.sort` method but can be used in expressions.  The
862  differences are:
863
864* the input may be any iterable;
865
866* a newly formed copy is sorted, leaving the original intact; and
867
868* the expression returns the new sorted copy
869
870  ::
871
872     >>> L = [9,7,8,3,2,4,1,6,5]
873     >>> [10+i for i in sorted(L)]       # usable in a list comprehension
874     [11, 12, 13, 14, 15, 16, 17, 18, 19]
875     >>> L                               # original is left unchanged
876     [9,7,8,3,2,4,1,6,5]
877     >>> sorted('Monty Python')          # any iterable may be an input
878     [' ', 'M', 'P', 'h', 'n', 'n', 'o', 'o', 't', 't', 'y', 'y']
879
880     >>> # List the contents of a dict sorted by key values
881     >>> colormap = dict(red=1, blue=2, green=3, black=4, yellow=5)
882     >>> for k, v in sorted(colormap.iteritems()):
883     ...     print k, v
884     ...
885     black 4
886     blue 2
887     green 3
888     red 1
889     yellow 5
890
891  (Contributed by Raymond Hettinger.)
892
893* Integer operations will no longer trigger an :exc:`OverflowWarning`. The
894  :exc:`OverflowWarning` warning will disappear in Python 2.5.
895
896* The interpreter gained a new switch, :option:`-m`, that takes a name, searches
897  for the corresponding  module on ``sys.path``, and runs the module as a script.
898  For example,  you can now run the Python profiler with ``python -m profile``.
899  (Contributed by Nick Coghlan.)
900
901* The :func:`eval(expr, globals, locals)` and :func:`execfile(filename, globals,
902  locals)` functions and the :keyword:`exec` statement now accept any mapping type
903  for the *locals* parameter.  Previously this had to be a regular Python
904  dictionary.  (Contributed by Raymond Hettinger.)
905
906* The :func:`zip` built-in function and :func:`itertools.izip` now return an
907  empty list if called with no arguments. Previously they raised a
908  :exc:`TypeError` exception.  This makes them more suitable for use with variable
909  length argument lists::
910
911     >>> def transpose(array):
912     ...    return zip(*array)
913     ...
914     >>> transpose([(1,2,3), (4,5,6)])
915     [(1, 4), (2, 5), (3, 6)]
916     >>> transpose([])
917     []
918
919  (Contributed by Raymond Hettinger.)
920
921* Encountering a failure while importing a module no longer leaves a partially-
922  initialized module object in ``sys.modules``.  The incomplete module object left
923  behind would fool further imports of the same module into succeeding, leading to
924  confusing errors.   (Fixed by Tim Peters.)
925
926* :const:`None` is now a constant; code that binds a new value to  the name
927  ``None`` is now a syntax error. (Contributed by Raymond Hettinger.)
928
929.. ======================================================================
930
931
932Optimizations
933-------------
934
935* The inner loops for list and tuple slicing were optimized and now run about
936  one-third faster.  The inner loops for dictionaries were also optimized,
937  resulting in performance boosts for :meth:`keys`, :meth:`values`, :meth:`items`,
938  :meth:`iterkeys`, :meth:`itervalues`, and :meth:`iteritems`. (Contributed by
939  Raymond Hettinger.)
940
941* The machinery for growing and shrinking lists was optimized for speed and for
942  space efficiency.  Appending and popping from lists now runs faster due to more
943  efficient code paths and less frequent use of the underlying system
944  :c:func:`realloc`.  List comprehensions also benefit.   :meth:`list.extend` was
945  also optimized and no longer converts its argument into a temporary list before
946  extending the base list.  (Contributed by Raymond Hettinger.)
947
948* :func:`list`, :func:`tuple`, :func:`map`, :func:`filter`, and :func:`zip` now
949  run several times faster with non-sequence arguments that supply a
950  :meth:`__len__` method.  (Contributed by Raymond Hettinger.)
951
952* The methods :meth:`list.__getitem__`, :meth:`dict.__getitem__`, and
953  :meth:`dict.__contains__` are now implemented as :class:`method_descriptor`
954  objects rather than :class:`wrapper_descriptor` objects.  This form of  access
955  doubles their performance and makes them more suitable for use as arguments to
956  functionals: ``map(mydict.__getitem__, keylist)``. (Contributed by Raymond
957  Hettinger.)
958
959* Added a new opcode, ``LIST_APPEND``, that simplifies the generated bytecode
960  for list comprehensions and speeds them up by about a third.  (Contributed by
961  Raymond Hettinger.)
962
963* The peephole bytecode optimizer has been improved to  produce shorter, faster
964  bytecode; remarkably, the resulting bytecode is  more readable.  (Enhanced by
965  Raymond Hettinger.)
966
967* String concatenations in statements of the form ``s = s + "abc"`` and ``s +=
968  "abc"`` are now performed more efficiently in certain circumstances.  This
969  optimization won't be present in other Python implementations such as Jython, so
970  you shouldn't rely on it; using the :meth:`join` method of strings is still
971  recommended when you want to efficiently glue a large number of strings
972  together. (Contributed by Armin Rigo.)
973
974The net result of the 2.4 optimizations is that Python 2.4 runs the pystone
975benchmark around 5% faster than Python 2.3 and 35% faster than Python 2.2.
976(pystone is not a particularly good benchmark, but it's the most commonly used
977measurement of Python's performance.  Your own applications may show greater or
978smaller benefits from Python 2.4.)
979
980.. pystone is almost useless for comparing different versions of Python;
981   instead, it excels at predicting relative Python performance on different
982   machines.  So, this section would be more informative if it used other tools
983   such as pybench and parrotbench.  For a more application oriented benchmark,
984   try comparing the timings of test_decimal.py under 2.3 and 2.4.
985
986.. ======================================================================
987
988
989New, Improved, and Deprecated Modules
990=====================================
991
992As usual, Python's standard library received a number of enhancements and bug
993fixes.  Here's a partial list of the most notable changes, sorted alphabetically
994by module name. Consult the :file:`Misc/NEWS` file in the source tree for a more
995complete list of changes, or look through the CVS logs for all the details.
996
997* The :mod:`asyncore` module's :func:`loop` function now has a *count* parameter
998  that lets you perform a limited number of passes through the polling loop.  The
999  default is still to loop forever.
1000
1001* The :mod:`base64` module now has more complete RFC 3548 support for Base64,
1002  Base32, and Base16 encoding and decoding, including optional case folding and
1003  optional alternative alphabets. (Contributed by Barry Warsaw.)
1004
1005* The :mod:`bisect` module now has an underlying C implementation for improved
1006  performance. (Contributed by Dmitry Vasiliev.)
1007
1008* The CJKCodecs collections of East Asian codecs, maintained by Hye-Shik Chang,
1009  was integrated into 2.4.   The new encodings are:
1010
1011* Chinese (PRC): gb2312, gbk, gb18030, big5hkscs, hz
1012
1013* Chinese (ROC): big5, cp950
1014
1015* Japanese: cp932, euc-jis-2004, euc-jp, euc-jisx0213, iso-2022-jp,
1016    iso-2022-jp-1, iso-2022-jp-2, iso-2022-jp-3, iso-2022-jp-ext, iso-2022-jp-2004,
1017    shift-jis, shift-jisx0213, shift-jis-2004
1018
1019* Korean: cp949, euc-kr, johab, iso-2022-kr
1020
1021* Some other new encodings were added: HP Roman8,  ISO_8859-11, ISO_8859-16,
1022  PCTP-154, and TIS-620.
1023
1024* The UTF-8 and UTF-16 codecs now cope better with receiving partial input.
1025  Previously the :class:`StreamReader` class would try to read more data, making
1026  it impossible to resume decoding from the stream.  The :meth:`read` method will
1027  now return as much data as it can and future calls will resume decoding where
1028  previous ones left off.  (Implemented by Walter Dörwald.)
1029
1030* There is a new :mod:`collections` module for  various specialized collection
1031  datatypes.   Currently it contains just one type, :class:`deque`,  a double-
1032  ended queue that supports efficiently adding and removing elements from either
1033  end::
1034
1035     >>> from collections import deque
1036     >>> d = deque('ghi')        # make a new deque with three items
1037     >>> d.append('j')           # add a new entry to the right side
1038     >>> d.appendleft('f')       # add a new entry to the left side
1039     >>> d                       # show the representation of the deque
1040     deque(['f', 'g', 'h', 'i', 'j'])
1041     >>> d.pop()                 # return and remove the rightmost item
1042     'j'
1043     >>> d.popleft()             # return and remove the leftmost item
1044     'f'
1045     >>> list(d)                 # list the contents of the deque
1046     ['g', 'h', 'i']
1047     >>> 'h' in d                # search the deque
1048     True
1049
1050  Several modules, such as the :mod:`Queue` and :mod:`threading` modules, now take
1051  advantage of :class:`collections.deque` for improved performance.  (Contributed
1052  by Raymond Hettinger.)
1053
1054* The :mod:`ConfigParser` classes have been enhanced slightly. The :meth:`read`
1055  method now returns a list of the files that were successfully parsed, and the
1056  :meth:`set` method raises :exc:`TypeError` if passed a *value* argument that
1057  isn't a string.   (Contributed by John Belmonte and David Goodger.)
1058
1059* The :mod:`curses` module now supports the ncurses extension
1060  :func:`use_default_colors`.  On platforms where the terminal supports
1061  transparency, this makes it possible to use a transparent background.
1062  (Contributed by Jörg Lehmann.)
1063
1064* The :mod:`difflib` module now includes an :class:`HtmlDiff` class that creates
1065  an HTML table showing a side by side comparison of two versions of a text.
1066  (Contributed by Dan Gass.)
1067
1068* The :mod:`email` package was updated to version 3.0,  which dropped various
1069  deprecated APIs and removes support for Python versions earlier than 2.3.  The
1070  3.0 version of the package uses a new incremental parser for MIME messages,
1071  available in the :mod:`email.FeedParser` module.  The new parser doesn't require
1072  reading the entire message into memory, and doesn't raise exceptions if a
1073  message is malformed; instead it records any problems in the  :attr:`defect`
1074  attribute of the message.  (Developed by Anthony Baxter, Barry Warsaw, Thomas
1075  Wouters, and others.)
1076
1077* The :mod:`heapq` module has been converted to C.  The resulting tenfold
1078  improvement in speed makes the module suitable for handling high volumes of
1079  data.  In addition, the module has two new functions :func:`nlargest` and
1080  :func:`nsmallest` that use heaps to find the N largest or smallest values in a
1081  dataset without the expense of a full sort.  (Contributed by Raymond Hettinger.)
1082
1083* The :mod:`httplib` module now contains constants for HTTP status codes defined
1084  in various HTTP-related RFC documents.  Constants have names such as
1085  :const:`OK`, :const:`CREATED`, :const:`CONTINUE`, and
1086  :const:`MOVED_PERMANENTLY`; use pydoc to get a full list.  (Contributed by
1087  Andrew Eland.)
1088
1089* The :mod:`imaplib` module now supports IMAP's THREAD command (contributed by
1090  Yves Dionne) and new :meth:`deleteacl` and :meth:`myrights` methods (contributed
1091  by Arnaud Mazin).
1092
1093* The :mod:`itertools` module gained a :func:`groupby(iterable[, *func*])`
1094  function. *iterable* is something that can be iterated over to return a stream
1095  of elements, and the optional *func* parameter is a function that takes an
1096  element and returns a key value; if omitted, the key is simply the element
1097  itself.  :func:`groupby` then groups the elements into subsequences which have
1098  matching values of the key, and returns a series of 2-tuples containing the key
1099  value and an iterator over the subsequence.
1100
1101  Here's an example to make this clearer.  The *key* function simply returns
1102  whether a number is even or odd, so the result of :func:`groupby` is to return
1103  consecutive runs of odd or even numbers. ::
1104
1105     >>> import itertools
1106     >>> L = [2, 4, 6, 7, 8, 9, 11, 12, 14]
1107     >>> for key_val, it in itertools.groupby(L, lambda x: x % 2):
1108     ...    print key_val, list(it)
1109     ...
1110     0 [2, 4, 6]
1111     1 [7]
1112     0 [8]
1113     1 [9, 11]
1114     0 [12, 14]
1115     >>>
1116
1117  :func:`groupby` is typically used with sorted input.  The logic for
1118  :func:`groupby` is similar to the Unix ``uniq`` filter which makes it handy for
1119  eliminating, counting, or identifying duplicate elements::
1120
1121     >>> word = 'abracadabra'
1122     >>> letters = sorted(word)   # Turn string into a sorted list of letters
1123     >>> letters
1124     ['a', 'a', 'a', 'a', 'a', 'b', 'b', 'c', 'd', 'r', 'r']
1125     >>> for k, g in itertools.groupby(letters):
1126     ...    print k, list(g)
1127     ...
1128     a ['a', 'a', 'a', 'a', 'a']
1129     b ['b', 'b']
1130     c ['c']
1131     d ['d']
1132     r ['r', 'r']
1133     >>> # List unique letters
1134     >>> [k for k, g in groupby(letters)]
1135     ['a', 'b', 'c', 'd', 'r']
1136     >>> # Count letter occurrences
1137     >>> [(k, len(list(g))) for k, g in groupby(letters)]
1138     [('a', 5), ('b', 2), ('c', 1), ('d', 1), ('r', 2)]
1139
1140  (Contributed by Hye-Shik Chang.)
1141
1142* :mod:`itertools` also gained a function named :func:`tee(iterator, N)` that
1143  returns *N* independent iterators that replicate *iterator*.  If *N* is omitted,
1144  the default is 2. ::
1145
1146     >>> L = [1,2,3]
1147     >>> i1, i2 = itertools.tee(L)
1148     >>> i1,i2
1149     (<itertools.tee object at 0x402c2080>, <itertools.tee object at 0x402c2090>)
1150     >>> list(i1)               # Run the first iterator to exhaustion
1151     [1, 2, 3]
1152     >>> list(i2)               # Run the second iterator to exhaustion
1153     [1, 2, 3]
1154
1155  Note that :func:`tee` has to keep copies of the values returned  by the
1156  iterator; in the worst case, it may need to keep all of them.   This should
1157  therefore be used carefully if the leading iterator can run far ahead of the
1158  trailing iterator in a long stream of inputs. If the separation is large, then
1159  you might as well use  :func:`list` instead.  When the iterators track closely
1160  with one another, :func:`tee` is ideal.  Possible applications include
1161  bookmarking, windowing, or lookahead iterators. (Contributed by Raymond
1162  Hettinger.)
1163
1164* A number of functions were added to the :mod:`locale`  module, such as
1165  :func:`bind_textdomain_codeset` to specify a particular encoding and a family of
1166  :func:`l\*gettext` functions that return messages in the chosen encoding.
1167  (Contributed by Gustavo Niemeyer.)
1168
1169* Some keyword arguments were added to the :mod:`logging` package's
1170  :func:`basicConfig` function to simplify log configuration.  The default
1171  behavior is to log messages to standard error, but various keyword arguments can
1172  be specified to log to a particular file, change the logging format, or set the
1173  logging level. For example::
1174
1175     import logging
1176     logging.basicConfig(filename='/var/log/application.log',
1177         level=0,  # Log all messages
1178         format='%(levelname):%(process):%(thread):%(message)')
1179
1180  Other additions to the :mod:`logging` package include a :meth:`log(level, msg)`
1181  convenience method, as well as a :class:`TimedRotatingFileHandler` class that
1182  rotates its log files at a timed interval.  The module already had
1183  :class:`RotatingFileHandler`, which rotated logs once the file exceeded a
1184  certain size.  Both classes derive from a new :class:`BaseRotatingHandler` class
1185  that can be used to implement other rotating handlers.
1186
1187  (Changes implemented by Vinay Sajip.)
1188
1189* The :mod:`marshal` module now shares interned strings on unpacking a  data
1190  structure.  This may shrink the size of certain pickle strings, but the primary
1191  effect is to make :file:`.pyc` files significantly smaller. (Contributed by
1192  Martin von Löwis.)
1193
1194* The :mod:`nntplib` module's :class:`NNTP` class gained :meth:`description` and
1195  :meth:`descriptions` methods to retrieve  newsgroup descriptions for a single
1196  group or for a range of groups. (Contributed by Jürgen A. Erhard.)
1197
1198* Two new functions were added to the :mod:`operator` module,
1199  :func:`attrgetter(attr)` and :func:`itemgetter(index)`. Both functions return
1200  callables that take a single argument and return the corresponding attribute or
1201  item; these callables make excellent data extractors when used with :func:`map`
1202  or :func:`sorted`.  For example::
1203
1204     >>> L = [('c', 2), ('d', 1), ('a', 4), ('b', 3)]
1205     >>> map(operator.itemgetter(0), L)
1206     ['c', 'd', 'a', 'b']
1207     >>> map(operator.itemgetter(1), L)
1208     [2, 1, 4, 3]
1209     >>> sorted(L, key=operator.itemgetter(1)) # Sort list by second tuple item
1210     [('d', 1), ('c', 2), ('b', 3), ('a', 4)]
1211
1212  (Contributed by Raymond Hettinger.)
1213
1214* The :mod:`optparse` module was updated in various ways.  The module now passes
1215  its messages through :func:`gettext.gettext`, making it possible to
1216  internationalize Optik's help and error messages.  Help messages for options can
1217  now include the string ``'%default'``, which will be replaced by the option's
1218  default value.  (Contributed by Greg Ward.)
1219
1220* The long-term plan is to deprecate the :mod:`rfc822` module in some future
1221  Python release in favor of the :mod:`email` package. To this end, the
1222  :func:`email.Utils.formatdate` function has been changed to make it usable as a
1223  replacement for :func:`rfc822.formatdate`.  You may want to write new e-mail
1224  processing code with this in mind.  (Change implemented by Anthony Baxter.)
1225
1226* A new :func:`urandom(n)` function was added to the :mod:`os` module, returning
1227  a string containing *n* bytes of random data.  This function provides access to
1228  platform-specific sources of randomness such as :file:`/dev/urandom` on Linux or
1229  the Windows CryptoAPI.  (Contributed by Trevor Perrin.)
1230
1231* Another new function: :func:`os.path.lexists(path)`  returns true if the file
1232  specified by *path* exists, whether or not it's a symbolic link.  This differs
1233  from the existing :func:`os.path.exists(path)` function, which returns false if
1234  *path* is a symlink that points to a destination that doesn't exist.
1235  (Contributed by Beni Cherniavsky.)
1236
1237* A new :func:`getsid` function was added to the :mod:`posix` module that
1238  underlies the :mod:`os` module. (Contributed by J. Raynor.)
1239
1240* The :mod:`poplib` module now supports POP over SSL.  (Contributed by Hector
1241  Urtubia.)
1242
1243* The :mod:`profile` module can now profile C extension functions. (Contributed
1244  by Nick Bastin.)
1245
1246* The :mod:`random` module has a new method called :meth:`getrandbits(N)` that
1247  returns a long integer *N* bits in length.  The existing :meth:`randrange`
1248  method now uses :meth:`getrandbits` where appropriate, making generation of
1249  arbitrarily large random numbers more efficient.  (Contributed by Raymond
1250  Hettinger.)
1251
1252* The regular expression language accepted by the :mod:`re` module was extended
1253  with simple conditional expressions, written as ``(?(group)A|B)``.  *group* is
1254  either a numeric group ID or a group name defined with ``(?P<group>...)``
1255  earlier in the expression.  If the specified group matched, the regular
1256  expression pattern *A* will be tested against the string; if the group didn't
1257  match, the pattern *B* will be used instead. (Contributed by Gustavo Niemeyer.)
1258
1259* The :mod:`re` module is also no longer recursive, thanks to a massive amount
1260  of work by Gustavo Niemeyer.  In a recursive regular expression engine, certain
1261  patterns result in a large amount of C stack space being consumed, and it was
1262  possible to overflow the stack. For example, if you matched a 30000-byte string
1263  of ``a`` characters against the expression ``(a|b)+``, one stack frame was
1264  consumed per character.  Python 2.3 tried to check for stack overflow and raise
1265  a :exc:`RuntimeError` exception, but certain patterns could sidestep the
1266  checking and if you were unlucky Python could segfault. Python 2.4's regular
1267  expression engine can match this pattern without problems.
1268
1269* The :mod:`signal` module now performs tighter error-checking on the parameters
1270  to the :func:`signal.signal` function.  For example, you can't set a handler on
1271  the :const:`SIGKILL` signal; previous versions of Python would quietly accept
1272  this, but 2.4 will raise a :exc:`RuntimeError` exception.
1273
1274* Two new functions were added to the :mod:`socket` module. :func:`socketpair`
1275  returns a pair of connected sockets and :func:`getservbyport(port)` looks up the
1276  service name for a given port number. (Contributed by Dave Cole and Barry
1277  Warsaw.)
1278
1279* The :func:`sys.exitfunc` function has been deprecated.  Code should be using
1280  the existing :mod:`atexit` module, which correctly handles calling multiple exit
1281  functions.  Eventually :func:`sys.exitfunc` will become a purely internal
1282  interface, accessed only by :mod:`atexit`.
1283
1284* The :mod:`tarfile` module now generates GNU-format tar files by default.
1285  (Contributed by Lars Gustaebel.)
1286
1287* The :mod:`threading` module now has an elegantly simple way to support
1288  thread-local data.  The module contains a :class:`local` class whose attribute
1289  values are local to different threads. ::
1290
1291     import threading
1292
1293     data = threading.local()
1294     data.number = 42
1295     data.url = ('www.python.org', 80)
1296
1297  Other threads can assign and retrieve their own values for the :attr:`number`
1298  and :attr:`url` attributes.  You can subclass :class:`local` to initialize
1299  attributes or to add methods. (Contributed by Jim Fulton.)
1300
1301* The :mod:`timeit` module now automatically disables periodic garbage
1302  collection during the timing loop.  This change makes consecutive timings more
1303  comparable.  (Contributed by Raymond Hettinger.)
1304
1305* The :mod:`weakref` module now supports a wider variety of objects including
1306  Python functions, class instances, sets, frozensets, deques, arrays, files,
1307  sockets, and regular expression pattern objects. (Contributed by Raymond
1308  Hettinger.)
1309
1310* The :mod:`xmlrpclib` module now supports a multi-call extension for
1311  transmitting multiple XML-RPC calls in a single HTTP operation. (Contributed by
1312  Brian Quinlan.)
1313
1314* The :mod:`mpz`, :mod:`rotor`, and :mod:`xreadlines` modules have  been
1315  removed.
1316
1317.. ======================================================================
1318.. whole new modules get described in subsections here
1319.. =====================
1320
1321
1322cookielib
1323---------
1324
1325The :mod:`cookielib` library supports client-side handling for HTTP cookies,
1326mirroring the :mod:`Cookie` module's server-side cookie support. Cookies are
1327stored in cookie jars; the library transparently stores cookies offered by the
1328web server in the cookie jar, and fetches the cookie from the jar when
1329connecting to the server. As in web browsers, policy objects control whether
1330cookies are accepted or not.
1331
1332In order to store cookies across sessions, two implementations of cookie jars
1333are provided: one that stores cookies in the Netscape format so applications can
1334use the Mozilla or Lynx cookie files, and one that stores cookies in the same
1335format as the Perl libwww library.
1336
1337:mod:`urllib2` has been changed to interact with :mod:`cookielib`:
1338:class:`HTTPCookieProcessor` manages a cookie jar that is used when accessing
1339URLs.
1340
1341This module was contributed by John J. Lee.
1342
1343.. ==================
1344
1345
1346doctest
1347-------
1348
1349The :mod:`doctest` module underwent considerable refactoring thanks to Edward
1350Loper and Tim Peters.  Testing can still be as simple as running
1351:func:`doctest.testmod`, but the refactorings allow customizing the module's
1352operation in various ways
1353
1354The new :class:`DocTestFinder` class extracts the tests from a given  object's
1355docstrings::
1356
1357   def f (x, y):
1358       """>>> f(2,2)
1359   4
1360   >>> f(3,2)
1361   6
1362       """
1363       return x*y
1364
1365   finder = doctest.DocTestFinder()
1366
1367   # Get list of DocTest instances
1368   tests = finder.find(f)
1369
1370The new :class:`DocTestRunner` class then runs individual tests and can produce
1371a summary of the results::
1372
1373   runner = doctest.DocTestRunner()
1374   for t in tests:
1375       tried, failed = runner.run(t)
1376
1377   runner.summarize(verbose=1)
1378
1379The above example produces the following output::
1380
1381   1 items passed all tests:
1382      2 tests in f
1383   2 tests in 1 items.
1384   2 passed and 0 failed.
1385   Test passed.
1386
1387:class:`DocTestRunner` uses an instance of the :class:`OutputChecker` class to
1388compare the expected output with the actual output.  This class takes a number
1389of different flags that customize its behaviour; ambitious users can also write
1390a completely new subclass of :class:`OutputChecker`.
1391
1392The default output checker provides a number of handy features. For example,
1393with the :const:`doctest.ELLIPSIS` option flag, an ellipsis (``...``) in the
1394expected output matches any substring,  making it easier to accommodate outputs
1395that vary in minor ways::
1396
1397   def o (n):
1398       """>>> o(1)
1399   <__main__.C instance at 0x...>
1400   >>>
1401   """
1402
1403Another special string, ``<BLANKLINE>``, matches a blank line::
1404
1405   def p (n):
1406       """>>> p(1)
1407   <BLANKLINE>
1408   >>>
1409   """
1410
1411Another new capability is producing a diff-style display of the output by
1412specifying the :const:`doctest.REPORT_UDIFF` (unified diffs),
1413:const:`doctest.REPORT_CDIFF` (context diffs), or :const:`doctest.REPORT_NDIFF`
1414(delta-style) option flags.  For example::
1415
1416   def g (n):
1417       """>>> g(4)
1418   here
1419   is
1420   a
1421   lengthy
1422   >>>"""
1423       L = 'here is a rather lengthy list of words'.split()
1424       for word in L[:n]:
1425           print word
1426
1427Running the above function's tests with :const:`doctest.REPORT_UDIFF` specified,
1428you get the following output:
1429
1430.. code-block:: none
1431
1432   **********************************************************************
1433   File "t.py", line 15, in g
1434   Failed example:
1435       g(4)
1436   Differences (unified diff with -expected +actual):
1437       @@ -2,3 +2,3 @@
1438        is
1439        a
1440       -lengthy
1441       +rather
1442   **********************************************************************
1443
1444.. ======================================================================
1445
1446
1447Build and C API Changes
1448=======================
1449
1450Some of the changes to Python's build process and to the C API are:
1451
1452* Three new convenience macros were added for common return values from
1453  extension functions: :c:macro:`Py_RETURN_NONE`, :c:macro:`Py_RETURN_TRUE`, and
1454  :c:macro:`Py_RETURN_FALSE`. (Contributed by Brett Cannon.)
1455
1456* Another new macro, :c:macro:`Py_CLEAR(obj)`,  decreases the reference count of
1457  *obj* and sets *obj* to the null pointer.  (Contributed by Jim Fulton.)
1458
1459* A new function, :c:func:`PyTuple_Pack(N, obj1, obj2, ..., objN)`, constructs
1460  tuples from a variable length argument list of Python objects.  (Contributed by
1461  Raymond Hettinger.)
1462
1463* A new function, :c:func:`PyDict_Contains(d, k)`, implements fast dictionary
1464  lookups without masking exceptions raised during the look-up process.
1465  (Contributed by Raymond Hettinger.)
1466
1467* The :c:macro:`Py_IS_NAN(X)` macro returns 1 if  its float or double argument
1468  *X* is a NaN.   (Contributed by Tim Peters.)
1469
1470* C code can avoid unnecessary locking by using the new
1471  :c:func:`PyEval_ThreadsInitialized` function to tell  if any thread operations
1472  have been performed.  If this function  returns false, no lock operations are
1473  needed. (Contributed by Nick Coghlan.)
1474
1475* A new function, :c:func:`PyArg_VaParseTupleAndKeywords`, is the same as
1476  :c:func:`PyArg_ParseTupleAndKeywords` but takes a  :c:type:`va_list` instead of a
1477  number of arguments. (Contributed by Greg Chapman.)
1478
1479* A new method flag, :const:`METH_COEXISTS`, allows a function defined in slots
1480  to co-exist with a :c:type:`PyCFunction` having the same name.  This can halve
1481  the access time for a method such as :meth:`set.__contains__`.  (Contributed by
1482  Raymond Hettinger.)
1483
1484* Python can now be built with additional profiling for the interpreter itself,
1485  intended as an aid to people developing the Python core.  Providing
1486  :option:`!--enable-profiling` to the :program:`configure` script will let you
1487  profile the interpreter with :program:`gprof`, and providing the
1488  :option:`!--with-tsc` switch enables profiling using the Pentium's Time-Stamp-
1489  Counter register.  Note that the :option:`!--with-tsc` switch is slightly
1490  misnamed, because the profiling feature also works on the PowerPC platform,
1491  though that processor architecture doesn't call that register "the TSC
1492  register".  (Contributed by Jeremy Hylton.)
1493
1494* The :c:type:`tracebackobject` type has been renamed to
1495  :c:type:`PyTracebackObject`.
1496
1497.. ======================================================================
1498
1499
1500Port-Specific Changes
1501---------------------
1502
1503* The Windows port now builds under MSVC++ 7.1 as well as version 6.
1504  (Contributed by Martin von Löwis.)
1505
1506.. ======================================================================
1507
1508
1509Porting to Python 2.4
1510=====================
1511
1512This section lists previously described changes that may require changes to your
1513code:
1514
1515* Left shifts and hexadecimal/octal constants that are too  large no longer
1516  trigger a :exc:`FutureWarning` and return  a value limited to 32 or 64 bits;
1517  instead they return a long integer.
1518
1519* Integer operations will no longer trigger an :exc:`OverflowWarning`. The
1520  :exc:`OverflowWarning` warning will disappear in Python 2.5.
1521
1522* The :func:`zip` built-in function and :func:`itertools.izip` now return  an
1523  empty list instead of raising a :exc:`TypeError` exception if called with no
1524  arguments.
1525
1526* You can no longer compare the :class:`date` and :class:`~datetime.datetime` instances
1527  provided by the :mod:`datetime` module.  Two  instances of different classes
1528  will now always be unequal, and  relative comparisons (``<``, ``>``) will raise
1529  a :exc:`TypeError`.
1530
1531* :func:`dircache.listdir` now passes exceptions to the caller instead of
1532  returning empty lists.
1533
1534* :func:`LexicalHandler.startDTD` used to receive the public and system IDs in
1535  the wrong order.  This has been corrected; applications relying on the wrong
1536  order need to be fixed.
1537
1538* :func:`fcntl.ioctl` now warns if the *mutate*  argument is omitted and
1539  relevant.
1540
1541* The :mod:`tarfile` module now generates GNU-format tar files by default.
1542
1543* Encountering a failure while importing a module no longer leaves a partially-
1544  initialized module object in ``sys.modules``.
1545
1546* :const:`None` is now a constant; code that binds a new value to  the name
1547  ``None`` is now a syntax error.
1548
1549* The :func:`signals.signal` function now raises a :exc:`RuntimeError` exception
1550  for certain illegal values; previously these errors would pass silently.  For
1551  example, you can no longer set a handler on the :const:`SIGKILL` signal.
1552
1553.. ======================================================================
1554
1555
1556.. _24acks:
1557
1558Acknowledgements
1559================
1560
1561The author would like to thank the following people for offering suggestions,
1562corrections and assistance with various drafts of this article: Koray Can,
1563Hye-Shik Chang, Michael Dyck, Raymond Hettinger, Brian Hurt, Hamish Lawson,
1564Fredrik Lundh, Sean Reifschneider, Sadruddin Rejeb.
1565
1566