• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
1.. highlight:: c
2
3**********************
4Argument Clinic How-To
5**********************
6
7:author: Larry Hastings
8
9
10.. topic:: Abstract
11
12  Argument Clinic is a preprocessor for CPython C files.
13  Its purpose is to automate all the boilerplate involved
14  with writing argument parsing code for "builtins".
15  This document shows you how to convert your first C
16  function to work with Argument Clinic, and then introduces
17  some advanced topics on Argument Clinic usage.
18
19  Currently Argument Clinic is considered internal-only
20  for CPython.  Its use is not supported for files outside
21  CPython, and no guarantees are made regarding backwards
22  compatibility for future versions.  In other words: if you
23  maintain an external C extension for CPython, you're welcome
24  to experiment with Argument Clinic in your own code.  But the
25  version of Argument Clinic that ships with the next version
26  of CPython *could* be totally incompatible and break all your code.
27
28The Goals Of Argument Clinic
29============================
30
31Argument Clinic's primary goal
32is to take over responsibility for all argument parsing code
33inside CPython.  This means that, when you convert a function
34to work with Argument Clinic, that function should no longer
35do any of its own argument parsing—the code generated by
36Argument Clinic should be a "black box" to you, where CPython
37calls in at the top, and your code gets called at the bottom,
38with ``PyObject *args`` (and maybe ``PyObject *kwargs``)
39magically converted into the C variables and types you need.
40
41In order for Argument Clinic to accomplish its primary goal,
42it must be easy to use.  Currently, working with CPython's
43argument parsing library is a chore, requiring maintaining
44redundant information in a surprising number of places.
45When you use Argument Clinic, you don't have to repeat yourself.
46
47Obviously, no one would want to use Argument Clinic unless
48it's solving their problem—and without creating new problems of
49its own.
50So it's paramount that Argument Clinic generate correct code.
51It'd be nice if the code was faster, too, but at the very least
52it should not introduce a major speed regression.  (Eventually Argument
53Clinic *should* make a major speedup possible—we could
54rewrite its code generator to produce tailor-made argument
55parsing code, rather than calling the general-purpose CPython
56argument parsing library.  That would make for the fastest
57argument parsing possible!)
58
59Additionally, Argument Clinic must be flexible enough to
60work with any approach to argument parsing.  Python has
61some functions with some very strange parsing behaviors;
62Argument Clinic's goal is to support all of them.
63
64Finally, the original motivation for Argument Clinic was
65to provide introspection "signatures" for CPython builtins.
66It used to be, the introspection query functions would throw
67an exception if you passed in a builtin.  With Argument
68Clinic, that's a thing of the past!
69
70One idea you should keep in mind, as you work with
71Argument Clinic: the more information you give it, the
72better job it'll be able to do.
73Argument Clinic is admittedly relatively simple right
74now.  But as it evolves it will get more sophisticated,
75and it should be able to do many interesting and smart
76things with all the information you give it.
77
78
79Basic Concepts And Usage
80========================
81
82Argument Clinic ships with CPython; you'll find it in ``Tools/clinic/clinic.py``.
83If you run that script, specifying a C file as an argument:
84
85.. code-block:: shell-session
86
87    $ python3 Tools/clinic/clinic.py foo.c
88
89Argument Clinic will scan over the file looking for lines that
90look exactly like this:
91
92.. code-block:: none
93
94    /*[clinic input]
95
96When it finds one, it reads everything up to a line that looks
97exactly like this:
98
99.. code-block:: none
100
101    [clinic start generated code]*/
102
103Everything in between these two lines is input for Argument Clinic.
104All of these lines, including the beginning and ending comment
105lines, are collectively called an Argument Clinic "block".
106
107When Argument Clinic parses one of these blocks, it
108generates output.  This output is rewritten into the C file
109immediately after the block, followed by a comment containing a checksum.
110The Argument Clinic block now looks like this:
111
112.. code-block:: none
113
114    /*[clinic input]
115    ... clinic input goes here ...
116    [clinic start generated code]*/
117    ... clinic output goes here ...
118    /*[clinic end generated code: checksum=...]*/
119
120If you run Argument Clinic on the same file a second time, Argument Clinic
121will discard the old output and write out the new output with a fresh checksum
122line.  However, if the input hasn't changed, the output won't change either.
123
124You should never modify the output portion of an Argument Clinic block.  Instead,
125change the input until it produces the output you want.  (That's the purpose of the
126checksum—to detect if someone changed the output, as these edits would be lost
127the next time Argument Clinic writes out fresh output.)
128
129For the sake of clarity, here's the terminology we'll use with Argument Clinic:
130
131* The first line of the comment (``/*[clinic input]``) is the *start line*.
132* The last line of the initial comment (``[clinic start generated code]*/``) is the *end line*.
133* The last line (``/*[clinic end generated code: checksum=...]*/``) is the *checksum line*.
134* In between the start line and the end line is the *input*.
135* In between the end line and the checksum line is the *output*.
136* All the text collectively, from the start line to the checksum line inclusively,
137  is the *block*.  (A block that hasn't been successfully processed by Argument
138  Clinic yet doesn't have output or a checksum line, but it's still considered
139  a block.)
140
141
142Converting Your First Function
143==============================
144
145The best way to get a sense of how Argument Clinic works is to
146convert a function to work with it.  Here, then, are the bare
147minimum steps you'd need to follow to convert a function to
148work with Argument Clinic.  Note that for code you plan to
149check in to CPython, you really should take the conversion farther,
150using some of the advanced concepts you'll see later on in
151the document (like "return converters" and "self converters").
152But we'll keep it simple for this walkthrough so you can learn.
153
154Let's dive in!
155
1560. Make sure you're working with a freshly updated checkout
157   of the CPython trunk.
158
1591. Find a Python builtin that calls either :c:func:`PyArg_ParseTuple`
160   or :c:func:`PyArg_ParseTupleAndKeywords`, and hasn't been converted
161   to work with Argument Clinic yet.
162   For my example I'm using ``_pickle.Pickler.dump()``.
163
1642. If the call to the ``PyArg_Parse`` function uses any of the
165   following format units:
166
167   .. code-block:: none
168
169       O&
170       O!
171       es
172       es#
173       et
174       et#
175
176   or if it has multiple calls to :c:func:`PyArg_ParseTuple`,
177   you should choose a different function.  Argument Clinic *does*
178   support all of these scenarios.  But these are advanced
179   topics—let's do something simpler for your first function.
180
181   Also, if the function has multiple calls to :c:func:`PyArg_ParseTuple`
182   or :c:func:`PyArg_ParseTupleAndKeywords` where it supports different
183   types for the same argument, or if the function uses something besides
184   PyArg_Parse functions to parse its arguments, it probably
185   isn't suitable for conversion to Argument Clinic.  Argument Clinic
186   doesn't support generic functions or polymorphic parameters.
187
1883. Add the following boilerplate above the function, creating our block::
189
190    /*[clinic input]
191    [clinic start generated code]*/
192
1934. Cut the docstring and paste it in between the ``[clinic]`` lines,
194   removing all the junk that makes it a properly quoted C string.
195   When you're done you should have just the text, based at the left
196   margin, with no line wider than 80 characters.
197   (Argument Clinic will preserve indents inside the docstring.)
198
199   If the old docstring had a first line that looked like a function
200   signature, throw that line away.  (The docstring doesn't need it
201   anymore—when you use ``help()`` on your builtin in the future,
202   the first line will be built automatically based on the function's
203   signature.)
204
205   Sample::
206
207    /*[clinic input]
208    Write a pickled representation of obj to the open file.
209    [clinic start generated code]*/
210
2115. If your docstring doesn't have a "summary" line, Argument Clinic will
212   complain.  So let's make sure it has one.  The "summary" line should
213   be a paragraph consisting of a single 80-column line
214   at the beginning of the docstring.
215
216   (Our example docstring consists solely of a summary line, so the sample
217   code doesn't have to change for this step.)
218
2196. Above the docstring, enter the name of the function, followed
220   by a blank line.  This should be the Python name of the function,
221   and should be the full dotted path
222   to the function—it should start with the name of the module,
223   include any sub-modules, and if the function is a method on
224   a class it should include the class name too.
225
226   Sample::
227
228    /*[clinic input]
229    _pickle.Pickler.dump
230
231    Write a pickled representation of obj to the open file.
232    [clinic start generated code]*/
233
2347. If this is the first time that module or class has been used with Argument
235   Clinic in this C file,
236   you must declare the module and/or class.  Proper Argument Clinic hygiene
237   prefers declaring these in a separate block somewhere near the
238   top of the C file, in the same way that include files and statics go at
239   the top.  (In our sample code we'll just show the two blocks next to
240   each other.)
241
242   The name of the class and module should be the same as the one
243   seen by Python.  Check the name defined in the :c:type:`PyModuleDef`
244   or :c:type:`PyTypeObject` as appropriate.
245
246   When you declare a class, you must also specify two aspects of its type
247   in C: the type declaration you'd use for a pointer to an instance of
248   this class, and a pointer to the :c:type:`PyTypeObject` for this class.
249
250   Sample::
251
252       /*[clinic input]
253       module _pickle
254       class _pickle.Pickler "PicklerObject *" "&Pickler_Type"
255       [clinic start generated code]*/
256
257       /*[clinic input]
258       _pickle.Pickler.dump
259
260       Write a pickled representation of obj to the open file.
261       [clinic start generated code]*/
262
263
264
265
2668. Declare each of the parameters to the function.  Each parameter
267   should get its own line.  All the parameter lines should be
268   indented from the function name and the docstring.
269
270   The general form of these parameter lines is as follows:
271
272   .. code-block:: none
273
274       name_of_parameter: converter
275
276   If the parameter has a default value, add that after the
277   converter:
278
279   .. code-block:: none
280
281       name_of_parameter: converter = default_value
282
283   Argument Clinic's support for "default values" is quite sophisticated;
284   please see :ref:`the section below on default values <default_values>`
285   for more information.
286
287   Add a blank line below the parameters.
288
289   What's a "converter"?  It establishes both the type
290   of the variable used in C, and the method to convert the Python
291   value into a C value at runtime.
292   For now you're going to use what's called a "legacy converter"—a
293   convenience syntax intended to make porting old code into Argument
294   Clinic easier.
295
296   For each parameter, copy the "format unit" for that
297   parameter from the ``PyArg_Parse()`` format argument and
298   specify *that* as its converter, as a quoted
299   string.  ("format unit" is the formal name for the one-to-three
300   character substring of the ``format`` parameter that tells
301   the argument parsing function what the type of the variable
302   is and how to convert it.  For more on format units please
303   see :ref:`arg-parsing`.)
304
305   For multicharacter format units like ``z#``, use the
306   entire two-or-three character string.
307
308   Sample::
309
310        /*[clinic input]
311        module _pickle
312        class _pickle.Pickler "PicklerObject *" "&Pickler_Type"
313        [clinic start generated code]*/
314
315        /*[clinic input]
316        _pickle.Pickler.dump
317
318           obj: 'O'
319
320       Write a pickled representation of obj to the open file.
321       [clinic start generated code]*/
322
3239. If your function has ``|`` in the format string, meaning some
324   parameters have default values, you can ignore it.  Argument
325   Clinic infers which parameters are optional based on whether
326   or not they have default values.
327
328   If your function has ``$`` in the format string, meaning it
329   takes keyword-only arguments, specify ``*`` on a line by
330   itself before the first keyword-only argument, indented the
331   same as the parameter lines.
332
333   (``_pickle.Pickler.dump`` has neither, so our sample is unchanged.)
334
335
33610. If the existing C function calls :c:func:`PyArg_ParseTuple`
337    (as opposed to :c:func:`PyArg_ParseTupleAndKeywords`), then all its
338    arguments are positional-only.
339
340    To mark all parameters as positional-only in Argument Clinic,
341    add a ``/`` on a line by itself after the last parameter,
342    indented the same as the parameter lines.
343
344    Currently this is all-or-nothing; either all parameters are
345    positional-only, or none of them are.  (In the future Argument
346    Clinic may relax this restriction.)
347
348    Sample::
349
350        /*[clinic input]
351        module _pickle
352        class _pickle.Pickler "PicklerObject *" "&Pickler_Type"
353        [clinic start generated code]*/
354
355        /*[clinic input]
356        _pickle.Pickler.dump
357
358            obj: 'O'
359            /
360
361        Write a pickled representation of obj to the open file.
362        [clinic start generated code]*/
363
36411. It's helpful to write a per-parameter docstring for each parameter.
365    But per-parameter docstrings are optional; you can skip this step
366    if you prefer.
367
368    Here's how to add a per-parameter docstring.  The first line
369    of the per-parameter docstring must be indented further than the
370    parameter definition.  The left margin of this first line establishes
371    the left margin for the whole per-parameter docstring; all the text
372    you write will be outdented by this amount.  You can write as much
373    text as you like, across multiple lines if you wish.
374
375    Sample::
376
377        /*[clinic input]
378        module _pickle
379        class _pickle.Pickler "PicklerObject *" "&Pickler_Type"
380        [clinic start generated code]*/
381
382        /*[clinic input]
383        _pickle.Pickler.dump
384
385            obj: 'O'
386                The object to be pickled.
387            /
388
389        Write a pickled representation of obj to the open file.
390        [clinic start generated code]*/
391
39212. Save and close the file, then run ``Tools/clinic/clinic.py`` on
393    it.  With luck everything worked---your block now has output, and
394    a ``.c.h`` file has been generated! Reopen the file in your
395    text editor to see::
396
397       /*[clinic input]
398       _pickle.Pickler.dump
399
400           obj: 'O'
401               The object to be pickled.
402           /
403
404       Write a pickled representation of obj to the open file.
405       [clinic start generated code]*/
406
407       static PyObject *
408       _pickle_Pickler_dump(PicklerObject *self, PyObject *obj)
409       /*[clinic end generated code: output=87ecad1261e02ac7 input=552eb1c0f52260d9]*/
410
411    Obviously, if Argument Clinic didn't produce any output, it's because
412    it found an error in your input.  Keep fixing your errors and retrying
413    until Argument Clinic processes your file without complaint.
414
415    For readability, most of the glue code has been generated to a ``.c.h``
416    file.  You'll need to include that in your original ``.c`` file,
417    typically right after the clinic module block::
418
419       #include "clinic/_pickle.c.h"
420
42113. Double-check that the argument-parsing code Argument Clinic generated
422    looks basically the same as the existing code.
423
424    First, ensure both places use the same argument-parsing function.
425    The existing code must call either
426    :c:func:`PyArg_ParseTuple` or :c:func:`PyArg_ParseTupleAndKeywords`;
427    ensure that the code generated by Argument Clinic calls the
428    *exact* same function.
429
430    Second, the format string passed in to :c:func:`PyArg_ParseTuple` or
431    :c:func:`PyArg_ParseTupleAndKeywords` should be *exactly* the same
432    as the hand-written one in the existing function, up to the colon
433    or semi-colon.
434
435    (Argument Clinic always generates its format strings
436    with a ``:`` followed by the name of the function.  If the
437    existing code's format string ends with ``;``, to provide
438    usage help, this change is harmless—don't worry about it.)
439
440    Third, for parameters whose format units require two arguments
441    (like a length variable, or an encoding string, or a pointer
442    to a conversion function), ensure that the second argument is
443    *exactly* the same between the two invocations.
444
445    Fourth, inside the output portion of the block you'll find a preprocessor
446    macro defining the appropriate static :c:type:`PyMethodDef` structure for
447    this builtin::
448
449        #define __PICKLE_PICKLER_DUMP_METHODDEF    \
450        {"dump", (PyCFunction)__pickle_Pickler_dump, METH_O, __pickle_Pickler_dump__doc__},
451
452    This static structure should be *exactly* the same as the existing static
453    :c:type:`PyMethodDef` structure for this builtin.
454
455    If any of these items differ in *any way*,
456    adjust your Argument Clinic function specification and rerun
457    ``Tools/clinic/clinic.py`` until they *are* the same.
458
459
46014. Notice that the last line of its output is the declaration
461    of your "impl" function.  This is where the builtin's implementation goes.
462    Delete the existing prototype of the function you're modifying, but leave
463    the opening curly brace.  Now delete its argument parsing code and the
464    declarations of all the variables it dumps the arguments into.
465    Notice how the Python arguments are now arguments to this impl function;
466    if the implementation used different names for these variables, fix it.
467
468    Let's reiterate, just because it's kind of weird.  Your code should now
469    look like this::
470
471        static return_type
472        your_function_impl(...)
473        /*[clinic end generated code: checksum=...]*/
474        {
475        ...
476
477    Argument Clinic generated the checksum line and the function prototype just
478    above it.  You should write the opening (and closing) curly braces for the
479    function, and the implementation inside.
480
481    Sample::
482
483        /*[clinic input]
484        module _pickle
485        class _pickle.Pickler "PicklerObject *" "&Pickler_Type"
486        [clinic start generated code]*/
487        /*[clinic end generated code: checksum=da39a3ee5e6b4b0d3255bfef95601890afd80709]*/
488
489        /*[clinic input]
490        _pickle.Pickler.dump
491
492            obj: 'O'
493                The object to be pickled.
494            /
495
496        Write a pickled representation of obj to the open file.
497        [clinic start generated code]*/
498
499        PyDoc_STRVAR(__pickle_Pickler_dump__doc__,
500        "Write a pickled representation of obj to the open file.\n"
501        "\n"
502        ...
503        static PyObject *
504        _pickle_Pickler_dump_impl(PicklerObject *self, PyObject *obj)
505        /*[clinic end generated code: checksum=3bd30745bf206a48f8b576a1da3d90f55a0a4187]*/
506        {
507            /* Check whether the Pickler was initialized correctly (issue3664).
508               Developers often forget to call __init__() in their subclasses, which
509               would trigger a segfault without this check. */
510            if (self->write == NULL) {
511                PyErr_Format(PicklingError,
512                             "Pickler.__init__() was not called by %s.__init__()",
513                             Py_TYPE(self)->tp_name);
514                return NULL;
515            }
516
517            if (_Pickler_ClearBuffer(self) < 0)
518                return NULL;
519
520            ...
521
52215. Remember the macro with the :c:type:`PyMethodDef` structure for this
523    function?  Find the existing :c:type:`PyMethodDef` structure for this
524    function and replace it with a reference to the macro.  (If the builtin
525    is at module scope, this will probably be very near the end of the file;
526    if the builtin is a class method, this will probably be below but relatively
527    near to the implementation.)
528
529    Note that the body of the macro contains a trailing comma.  So when you
530    replace the existing static :c:type:`PyMethodDef` structure with the macro,
531    *don't* add a comma to the end.
532
533    Sample::
534
535        static struct PyMethodDef Pickler_methods[] = {
536            __PICKLE_PICKLER_DUMP_METHODDEF
537            __PICKLE_PICKLER_CLEAR_MEMO_METHODDEF
538            {NULL, NULL}                /* sentinel */
539        };
540
541
54216. Compile, then run the relevant portions of the regression-test suite.
543    This change should not introduce any new compile-time warnings or errors,
544    and there should be no externally-visible change to Python's behavior.
545
546    Well, except for one difference: ``inspect.signature()`` run on your function
547    should now provide a valid signature!
548
549    Congratulations, you've ported your first function to work with Argument Clinic!
550
551Advanced Topics
552===============
553
554Now that you've had some experience working with Argument Clinic, it's time
555for some advanced topics.
556
557
558Symbolic default values
559-----------------------
560
561The default value you provide for a parameter can't be any arbitrary
562expression.  Currently the following are explicitly supported:
563
564* Numeric constants (integer and float)
565* String constants
566* ``True``, ``False``, and ``None``
567* Simple symbolic constants like ``sys.maxsize``, which must
568  start with the name of the module
569
570In case you're curious, this is implemented in  ``from_builtin()``
571in ``Lib/inspect.py``.
572
573(In the future, this may need to get even more elaborate,
574to allow full expressions like ``CONSTANT - 1``.)
575
576
577Renaming the C functions and variables generated by Argument Clinic
578-------------------------------------------------------------------
579
580Argument Clinic automatically names the functions it generates for you.
581Occasionally this may cause a problem, if the generated name collides with
582the name of an existing C function.  There's an easy solution: override the names
583used for the C functions.  Just add the keyword ``"as"``
584to your function declaration line, followed by the function name you wish to use.
585Argument Clinic will use that function name for the base (generated) function,
586then add ``"_impl"`` to the end and use that for the name of the impl function.
587
588For example, if we wanted to rename the C function names generated for
589``pickle.Pickler.dump``, it'd look like this::
590
591    /*[clinic input]
592    pickle.Pickler.dump as pickler_dumper
593
594    ...
595
596The base function would now be named ``pickler_dumper()``,
597and the impl function would now be named ``pickler_dumper_impl()``.
598
599
600Similarly, you may have a problem where you want to give a parameter
601a specific Python name, but that name may be inconvenient in C.  Argument
602Clinic allows you to give a parameter different names in Python and in C,
603using the same ``"as"`` syntax::
604
605    /*[clinic input]
606    pickle.Pickler.dump
607
608        obj: object
609        file as file_obj: object
610        protocol: object = NULL
611        *
612        fix_imports: bool = True
613
614Here, the name used in Python (in the signature and the ``keywords``
615array) would be ``file``, but the C variable would be named ``file_obj``.
616
617You can use this to rename the ``self`` parameter too!
618
619
620Converting functions using PyArg_UnpackTuple
621--------------------------------------------
622
623To convert a function parsing its arguments with :c:func:`PyArg_UnpackTuple`,
624simply write out all the arguments, specifying each as an ``object``.  You
625may specify the ``type`` argument to cast the type as appropriate.  All
626arguments should be marked positional-only (add a ``/`` on a line by itself
627after the last argument).
628
629Currently the generated code will use :c:func:`PyArg_ParseTuple`, but this
630will change soon.
631
632Optional Groups
633---------------
634
635Some legacy functions have a tricky approach to parsing their arguments:
636they count the number of positional arguments, then use a ``switch`` statement
637to call one of several different :c:func:`PyArg_ParseTuple` calls depending on
638how many positional arguments there are.  (These functions cannot accept
639keyword-only arguments.)  This approach was used to simulate optional
640arguments back before :c:func:`PyArg_ParseTupleAndKeywords` was created.
641
642While functions using this approach can often be converted to
643use :c:func:`PyArg_ParseTupleAndKeywords`, optional arguments, and default values,
644it's not always possible.  Some of these legacy functions have
645behaviors :c:func:`PyArg_ParseTupleAndKeywords` doesn't directly support.
646The most obvious example is the builtin function ``range()``, which has
647an optional argument on the *left* side of its required argument!
648Another example is ``curses.window.addch()``, which has a group of two
649arguments that must always be specified together.  (The arguments are
650called ``x`` and ``y``; if you call the function passing in ``x``,
651you must also pass in ``y``—and if you don't pass in ``x`` you may not
652pass in ``y`` either.)
653
654In any case, the goal of Argument Clinic is to support argument parsing
655for all existing CPython builtins without changing their semantics.
656Therefore Argument Clinic supports
657this alternate approach to parsing, using what are called *optional groups*.
658Optional groups are groups of arguments that must all be passed in together.
659They can be to the left or the right of the required arguments.  They
660can *only* be used with positional-only parameters.
661
662.. note:: Optional groups are *only* intended for use when converting
663          functions that make multiple calls to :c:func:`PyArg_ParseTuple`!
664          Functions that use *any* other approach for parsing arguments
665          should *almost never* be converted to Argument Clinic using
666          optional groups.  Functions using optional groups currently
667          cannot have accurate signatures in Python, because Python just
668          doesn't understand the concept.  Please avoid using optional
669          groups wherever possible.
670
671To specify an optional group, add a ``[`` on a line by itself before
672the parameters you wish to group together, and a ``]`` on a line by itself
673after these parameters.  As an example, here's how ``curses.window.addch``
674uses optional groups to make the first two parameters and the last
675parameter optional::
676
677    /*[clinic input]
678
679    curses.window.addch
680
681        [
682        x: int
683          X-coordinate.
684        y: int
685          Y-coordinate.
686        ]
687
688        ch: object
689          Character to add.
690
691        [
692        attr: long
693          Attributes for the character.
694        ]
695        /
696
697    ...
698
699
700Notes:
701
702* For every optional group, one additional parameter will be passed into the
703  impl function representing the group.  The parameter will be an int named
704  ``group_{direction}_{number}``,
705  where ``{direction}`` is either ``right`` or ``left`` depending on whether the group
706  is before or after the required parameters, and ``{number}`` is a monotonically
707  increasing number (starting at 1) indicating how far away the group is from
708  the required parameters.  When the impl is called, this parameter will be set
709  to zero if this group was unused, and set to non-zero if this group was used.
710  (By used or unused, I mean whether or not the parameters received arguments
711  in this invocation.)
712
713* If there are no required arguments, the optional groups will behave
714  as if they're to the right of the required arguments.
715
716* In the case of ambiguity, the argument parsing code
717  favors parameters on the left (before the required parameters).
718
719* Optional groups can only contain positional-only parameters.
720
721* Optional groups are *only* intended for legacy code.  Please do not
722  use optional groups for new code.
723
724
725Using real Argument Clinic converters, instead of "legacy converters"
726---------------------------------------------------------------------
727
728To save time, and to minimize how much you need to learn
729to achieve your first port to Argument Clinic, the walkthrough above tells
730you to use "legacy converters".  "Legacy converters" are a convenience,
731designed explicitly to make porting existing code to Argument Clinic
732easier.  And to be clear, their use is acceptable when porting code for
733Python 3.4.
734
735However, in the long term we probably want all our blocks to
736use Argument Clinic's real syntax for converters.  Why?  A couple
737reasons:
738
739* The proper converters are far easier to read and clearer in their intent.
740* There are some format units that are unsupported as "legacy converters",
741  because they require arguments, and the legacy converter syntax doesn't
742  support specifying arguments.
743* In the future we may have a new argument parsing library that isn't
744  restricted to what :c:func:`PyArg_ParseTuple` supports; this flexibility
745  won't be available to parameters using legacy converters.
746
747Therefore, if you don't mind a little extra effort, please use the normal
748converters instead of legacy converters.
749
750In a nutshell, the syntax for Argument Clinic (non-legacy) converters
751looks like a Python function call.  However, if there are no explicit
752arguments to the function (all functions take their default values),
753you may omit the parentheses.  Thus ``bool`` and ``bool()`` are exactly
754the same converters.
755
756All arguments to Argument Clinic converters are keyword-only.
757All Argument Clinic converters accept the following arguments:
758
759  ``c_default``
760    The default value for this parameter when defined in C.
761    Specifically, this will be the initializer for the variable declared
762    in the "parse function".  See :ref:`the section on default values <default_values>`
763    for how to use this.
764    Specified as a string.
765
766  ``annotation``
767    The annotation value for this parameter.  Not currently supported,
768    because :pep:`8` mandates that the Python library may not use
769    annotations.
770
771In addition, some converters accept additional arguments.  Here is a list
772of these arguments, along with their meanings:
773
774  ``accept``
775    A set of Python types (and possibly pseudo-types);
776    this restricts the allowable Python argument to values of these types.
777    (This is not a general-purpose facility; as a rule it only supports
778    specific lists of types as shown in the legacy converter table.)
779
780    To accept ``None``, add ``NoneType`` to this set.
781
782  ``bitwise``
783    Only supported for unsigned integers.  The native integer value of this
784    Python argument will be written to the parameter without any range checking,
785    even for negative values.
786
787  ``converter``
788    Only supported by the ``object`` converter.  Specifies the name of a
789    :ref:`C "converter function" <o_ampersand>`
790    to use to convert this object to a native type.
791
792  ``encoding``
793    Only supported for strings.  Specifies the encoding to use when converting
794    this string from a Python str (Unicode) value into a C ``char *`` value.
795
796
797  ``subclass_of``
798    Only supported for the ``object`` converter.  Requires that the Python
799    value be a subclass of a Python type, as expressed in C.
800
801  ``type``
802    Only supported for the ``object`` and ``self`` converters.  Specifies
803    the C type that will be used to declare the variable.  Default value is
804    ``"PyObject *"``.
805
806  ``zeroes``
807    Only supported for strings.  If true, embedded NUL bytes (``'\\0'``) are
808    permitted inside the value.  The length of the string will be passed in
809    to the impl function, just after the string parameter, as a parameter named
810    ``<parameter_name>_length``.
811
812Please note, not every possible combination of arguments will work.
813Usually these arguments are implemented by specific ``PyArg_ParseTuple``
814*format units*, with specific behavior.  For example, currently you cannot
815call ``unsigned_short`` without also specifying ``bitwise=True``.
816Although it's perfectly reasonable to think this would work, these semantics don't
817map to any existing format unit.  So Argument Clinic doesn't support it.  (Or, at
818least, not yet.)
819
820Below is a table showing the mapping of legacy converters into real
821Argument Clinic converters.  On the left is the legacy converter,
822on the right is the text you'd replace it with.
823
824=========   =================================================================================
825``'B'``     ``unsigned_char(bitwise=True)``
826``'b'``     ``unsigned_char``
827``'c'``     ``char``
828``'C'``     ``int(accept={str})``
829``'d'``     ``double``
830``'D'``     ``Py_complex``
831``'es'``    ``str(encoding='name_of_encoding')``
832``'es#'``   ``str(encoding='name_of_encoding', zeroes=True)``
833``'et'``    ``str(encoding='name_of_encoding', accept={bytes, bytearray, str})``
834``'et#'``   ``str(encoding='name_of_encoding', accept={bytes, bytearray, str}, zeroes=True)``
835``'f'``     ``float``
836``'h'``     ``short``
837``'H'``     ``unsigned_short(bitwise=True)``
838``'i'``     ``int``
839``'I'``     ``unsigned_int(bitwise=True)``
840``'k'``     ``unsigned_long(bitwise=True)``
841``'K'``     ``unsigned_long_long(bitwise=True)``
842``'l'``     ``long``
843``'L'``     ``long long``
844``'n'``     ``Py_ssize_t``
845``'O'``     ``object``
846``'O!'``    ``object(subclass_of='&PySomething_Type')``
847``'O&'``    ``object(converter='name_of_c_function')``
848``'p'``     ``bool``
849``'S'``     ``PyBytesObject``
850``'s'``     ``str``
851``'s#'``    ``str(zeroes=True)``
852``'s*'``    ``Py_buffer(accept={buffer, str})``
853``'U'``     ``unicode``
854``'u'``     ``Py_UNICODE``
855``'u#'``    ``Py_UNICODE(zeroes=True)``
856``'w*'``    ``Py_buffer(accept={rwbuffer})``
857``'Y'``     ``PyByteArrayObject``
858``'y'``     ``str(accept={bytes})``
859``'y#'``    ``str(accept={robuffer}, zeroes=True)``
860``'y*'``    ``Py_buffer``
861``'Z'``     ``Py_UNICODE(accept={str, NoneType})``
862``'Z#'``    ``Py_UNICODE(accept={str, NoneType}, zeroes=True)``
863``'z'``     ``str(accept={str, NoneType})``
864``'z#'``    ``str(accept={str, NoneType}, zeroes=True)``
865``'z*'``    ``Py_buffer(accept={buffer, str, NoneType})``
866=========   =================================================================================
867
868As an example, here's our sample ``pickle.Pickler.dump`` using the proper
869converter::
870
871    /*[clinic input]
872    pickle.Pickler.dump
873
874        obj: object
875            The object to be pickled.
876        /
877
878    Write a pickled representation of obj to the open file.
879    [clinic start generated code]*/
880
881One advantage of real converters is that they're more flexible than legacy
882converters.  For example, the ``unsigned_int`` converter (and all the
883``unsigned_`` converters) can be specified without ``bitwise=True``.  Their
884default behavior performs range checking on the value, and they won't accept
885negative numbers.  You just can't do that with a legacy converter!
886
887Argument Clinic will show you all the converters it has
888available.  For each converter it'll show you all the parameters
889it accepts, along with the default value for each parameter.
890Just run ``Tools/clinic/clinic.py --converters`` to see the full list.
891
892Py_buffer
893---------
894
895When using the ``Py_buffer`` converter
896(or the ``'s*'``, ``'w*'``, ``'*y'``, or ``'z*'`` legacy converters),
897you *must* not call :c:func:`PyBuffer_Release` on the provided buffer.
898Argument Clinic generates code that does it for you (in the parsing function).
899
900
901
902Advanced converters
903-------------------
904
905Remember those format units you skipped for your first
906time because they were advanced?  Here's how to handle those too.
907
908The trick is, all those format units take arguments—either
909conversion functions, or types, or strings specifying an encoding.
910(But "legacy converters" don't support arguments.  That's why we
911skipped them for your first function.)  The argument you specified
912to the format unit is now an argument to the converter; this
913argument is either ``converter`` (for ``O&``), ``subclass_of`` (for ``O!``),
914or ``encoding`` (for all the format units that start with ``e``).
915
916When using ``subclass_of``, you may also want to use the other
917custom argument for ``object()``: ``type``, which lets you set the type
918actually used for the parameter.  For example, if you want to ensure
919that the object is a subclass of ``PyUnicode_Type``, you probably want
920to use the converter ``object(type='PyUnicodeObject *', subclass_of='&PyUnicode_Type')``.
921
922One possible problem with using Argument Clinic: it takes away some possible
923flexibility for the format units starting with ``e``.  When writing a
924``PyArg_Parse`` call by hand, you could theoretically decide at runtime what
925encoding string to pass in to :c:func:`PyArg_ParseTuple`.   But now this string must
926be hard-coded at Argument-Clinic-preprocessing-time.  This limitation is deliberate;
927it made supporting this format unit much easier, and may allow for future optimizations.
928This restriction doesn't seem unreasonable; CPython itself always passes in static
929hard-coded encoding strings for parameters whose format units start with ``e``.
930
931
932.. _default_values:
933
934Parameter default values
935------------------------
936
937Default values for parameters can be any of a number of values.
938At their simplest, they can be string, int, or float literals:
939
940.. code-block:: none
941
942    foo: str = "abc"
943    bar: int = 123
944    bat: float = 45.6
945
946They can also use any of Python's built-in constants:
947
948.. code-block:: none
949
950    yep:  bool = True
951    nope: bool = False
952    nada: object = None
953
954There's also special support for a default value of ``NULL``, and
955for simple expressions, documented in the following sections.
956
957
958The ``NULL`` default value
959--------------------------
960
961For string and object parameters, you can set them to ``None`` to indicate
962that there's no default.  However, that means the C variable will be
963initialized to ``Py_None``.  For convenience's sakes, there's a special
964value called ``NULL`` for just this reason: from Python's perspective it
965behaves like a default value of ``None``, but the C variable is initialized
966with ``NULL``.
967
968Expressions specified as default values
969---------------------------------------
970
971The default value for a parameter can be more than just a literal value.
972It can be an entire expression, using math operators and looking up attributes
973on objects.  However, this support isn't exactly simple, because of some
974non-obvious semantics.
975
976Consider the following example:
977
978.. code-block:: none
979
980    foo: Py_ssize_t = sys.maxsize - 1
981
982``sys.maxsize`` can have different values on different platforms.  Therefore
983Argument Clinic can't simply evaluate that expression locally and hard-code it
984in C.  So it stores the default in such a way that it will get evaluated at
985runtime, when the user asks for the function's signature.
986
987What namespace is available when the expression is evaluated?  It's evaluated
988in the context of the module the builtin came from.  So, if your module has an
989attribute called "``max_widgets``", you may simply use it:
990
991.. code-block:: none
992
993    foo: Py_ssize_t = max_widgets
994
995If the symbol isn't found in the current module, it fails over to looking in
996``sys.modules``.  That's how it can find ``sys.maxsize`` for example.  (Since you
997don't know in advance what modules the user will load into their interpreter,
998it's best to restrict yourself to modules that are preloaded by Python itself.)
999
1000Evaluating default values only at runtime means Argument Clinic can't compute
1001the correct equivalent C default value.  So you need to tell it explicitly.
1002When you use an expression, you must also specify the equivalent expression
1003in C, using the ``c_default`` parameter to the converter:
1004
1005.. code-block:: none
1006
1007    foo: Py_ssize_t(c_default="PY_SSIZE_T_MAX - 1") = sys.maxsize - 1
1008
1009Another complication: Argument Clinic can't know in advance whether or not the
1010expression you supply is valid.  It parses it to make sure it looks legal, but
1011it can't *actually* know.  You must be very careful when using expressions to
1012specify values that are guaranteed to be valid at runtime!
1013
1014Finally, because expressions must be representable as static C values, there
1015are many restrictions on legal expressions.  Here's a list of Python features
1016you're not permitted to use:
1017
1018* Function calls.
1019* Inline if statements (``3 if foo else 5``).
1020* Automatic sequence unpacking (``*[1, 2, 3]``).
1021* List/set/dict comprehensions and generator expressions.
1022* Tuple/list/set/dict literals.
1023
1024
1025
1026Using a return converter
1027------------------------
1028
1029By default the impl function Argument Clinic generates for you returns ``PyObject *``.
1030But your C function often computes some C type, then converts it into the ``PyObject *``
1031at the last moment.  Argument Clinic handles converting your inputs from Python types
1032into native C types—why not have it convert your return value from a native C type
1033into a Python type too?
1034
1035That's what a "return converter" does.  It changes your impl function to return
1036some C type, then adds code to the generated (non-impl) function to handle converting
1037that value into the appropriate ``PyObject *``.
1038
1039The syntax for return converters is similar to that of parameter converters.
1040You specify the return converter like it was a return annotation on the
1041function itself.  Return converters behave much the same as parameter converters;
1042they take arguments, the arguments are all keyword-only, and if you're not changing
1043any of the default arguments you can omit the parentheses.
1044
1045(If you use both ``"as"`` *and* a return converter for your function,
1046the ``"as"`` should come before the return converter.)
1047
1048There's one additional complication when using return converters: how do you
1049indicate an error has occurred?  Normally, a function returns a valid (non-``NULL``)
1050pointer for success, and ``NULL`` for failure.  But if you use an integer return converter,
1051all integers are valid.  How can Argument Clinic detect an error?  Its solution: each return
1052converter implicitly looks for a special value that indicates an error.  If you return
1053that value, and an error has been set (``PyErr_Occurred()`` returns a true
1054value), then the generated code will propagate the error.  Otherwise it will
1055encode the value you return like normal.
1056
1057Currently Argument Clinic supports only a few return converters:
1058
1059.. code-block:: none
1060
1061    bool
1062    int
1063    unsigned int
1064    long
1065    unsigned int
1066    size_t
1067    Py_ssize_t
1068    float
1069    double
1070    DecodeFSDefault
1071
1072None of these take parameters.  For the first three, return -1 to indicate
1073error.  For ``DecodeFSDefault``, the return type is ``const char *``; return a ``NULL``
1074pointer to indicate an error.
1075
1076(There's also an experimental ``NoneType`` converter, which lets you
1077return ``Py_None`` on success or ``NULL`` on failure, without having
1078to increment the reference count on ``Py_None``.  I'm not sure it adds
1079enough clarity to be worth using.)
1080
1081To see all the return converters Argument Clinic supports, along with
1082their parameters (if any),
1083just run ``Tools/clinic/clinic.py --converters`` for the full list.
1084
1085
1086Cloning existing functions
1087--------------------------
1088
1089If you have a number of functions that look similar, you may be able to
1090use Clinic's "clone" feature.  When you clone an existing function,
1091you reuse:
1092
1093* its parameters, including
1094
1095  * their names,
1096
1097  * their converters, with all parameters,
1098
1099  * their default values,
1100
1101  * their per-parameter docstrings,
1102
1103  * their *kind* (whether they're positional only,
1104    positional or keyword, or keyword only), and
1105
1106* its return converter.
1107
1108The only thing not copied from the original function is its docstring;
1109the syntax allows you to specify a new docstring.
1110
1111Here's the syntax for cloning a function::
1112
1113    /*[clinic input]
1114    module.class.new_function [as c_basename] = module.class.existing_function
1115
1116    Docstring for new_function goes here.
1117    [clinic start generated code]*/
1118
1119(The functions can be in different modules or classes.  I wrote
1120``module.class`` in the sample just to illustrate that you must
1121use the full path to *both* functions.)
1122
1123Sorry, there's no syntax for partially-cloning a function, or cloning a function
1124then modifying it.  Cloning is an all-or nothing proposition.
1125
1126Also, the function you are cloning from must have been previously defined
1127in the current file.
1128
1129Calling Python code
1130-------------------
1131
1132The rest of the advanced topics require you to write Python code
1133which lives inside your C file and modifies Argument Clinic's
1134runtime state.  This is simple: you simply define a Python block.
1135
1136A Python block uses different delimiter lines than an Argument
1137Clinic function block.  It looks like this::
1138
1139    /*[python input]
1140    # python code goes here
1141    [python start generated code]*/
1142
1143All the code inside the Python block is executed at the
1144time it's parsed.  All text written to stdout inside the block
1145is redirected into the "output" after the block.
1146
1147As an example, here's a Python block that adds a static integer
1148variable to the C code::
1149
1150    /*[python input]
1151    print('static int __ignored_unused_variable__ = 0;')
1152    [python start generated code]*/
1153    static int __ignored_unused_variable__ = 0;
1154    /*[python checksum:...]*/
1155
1156
1157Using a "self converter"
1158------------------------
1159
1160Argument Clinic automatically adds a "self" parameter for you
1161using a default converter.  It automatically sets the ``type``
1162of this parameter to the "pointer to an instance" you specified
1163when you declared the type.  However, you can override
1164Argument Clinic's converter and specify one yourself.
1165Just add your own ``self`` parameter as the first parameter in a
1166block, and ensure that its converter is an instance of
1167``self_converter`` or a subclass thereof.
1168
1169What's the point?  This lets you override the type of ``self``,
1170or give it a different default name.
1171
1172How do you specify the custom type you want to cast ``self`` to?
1173If you only have one or two functions with the same type for ``self``,
1174you can directly use Argument Clinic's existing ``self`` converter,
1175passing in the type you want to use as the ``type`` parameter::
1176
1177    /*[clinic input]
1178
1179    _pickle.Pickler.dump
1180
1181      self: self(type="PicklerObject *")
1182      obj: object
1183      /
1184
1185    Write a pickled representation of the given object to the open file.
1186    [clinic start generated code]*/
1187
1188On the other hand, if you have a lot of functions that will use the same
1189type for ``self``, it's best to create your own converter, subclassing
1190``self_converter`` but overwriting the ``type`` member::
1191
1192    /*[python input]
1193    class PicklerObject_converter(self_converter):
1194        type = "PicklerObject *"
1195    [python start generated code]*/
1196
1197    /*[clinic input]
1198
1199    _pickle.Pickler.dump
1200
1201      self: PicklerObject
1202      obj: object
1203      /
1204
1205    Write a pickled representation of the given object to the open file.
1206    [clinic start generated code]*/
1207
1208
1209Using a "defining class" converter
1210----------------------------------
1211
1212Argument Clinic facilitates gaining access to the defining class of a method.
1213This is useful for :ref:`heap type <heap-types>` methods that need to fetch
1214module level state.  Use :c:func:`PyType_FromModuleAndSpec` to associate a new
1215heap type with a module.  You can now use :c:func:`PyType_GetModuleState` on
1216the defining class to fetch the module state, for example from a module method.
1217
1218Example from ``Modules/zlibmodule.c``.  First, ``defining_class`` is added to
1219the clinic input::
1220
1221    /*[clinic input]
1222    zlib.Compress.compress
1223
1224      cls: defining_class
1225      data: Py_buffer
1226        Binary data to be compressed.
1227      /
1228
1229
1230After running the Argument Clinic tool, the following function signature is
1231generated::
1232
1233    /*[clinic start generated code]*/
1234    static PyObject *
1235    zlib_Compress_compress_impl(compobject *self, PyTypeObject *cls,
1236                                Py_buffer *data)
1237    /*[clinic end generated code: output=6731b3f0ff357ca6 input=04d00f65ab01d260]*/
1238
1239
1240The following code can now use ``PyType_GetModuleState(cls)`` to fetch the
1241module state::
1242
1243    zlibstate *state = PyType_GetModuleState(cls);
1244
1245
1246Each method may only have one argument using this converter, and it must appear
1247after ``self``, or, if ``self`` is not used, as the first argument.  The argument
1248will be of type ``PyTypeObject *``.  The argument will not appear in the
1249``__text_signature__``.
1250
1251The ``defining_class`` converter is not compatible with ``__init__`` and ``__new__``
1252methods, which cannot use the ``METH_METHOD`` convention.
1253
1254It is not possible to use ``defining_class`` with slot methods.  In order to
1255fetch the module state from such methods, use ``_PyType_GetModuleByDef`` to
1256look up the module and then :c:func:`PyModule_GetState` to fetch the module
1257state.  Example from the ``setattro`` slot method in
1258``Modules/_threadmodule.c``::
1259
1260    static int
1261    local_setattro(localobject *self, PyObject *name, PyObject *v)
1262    {
1263        PyObject *module = _PyType_GetModuleByDef(Py_TYPE(self), &thread_module);
1264        thread_module_state *state = get_thread_state(module);
1265        ...
1266    }
1267
1268
1269See also :pep:`573`.
1270
1271
1272Writing a custom converter
1273--------------------------
1274
1275As we hinted at in the previous section... you can write your own converters!
1276A converter is simply a Python class that inherits from ``CConverter``.
1277The main purpose of a custom converter is if you have a parameter using
1278the ``O&`` format unit—parsing this parameter means calling
1279a :c:func:`PyArg_ParseTuple` "converter function".
1280
1281Your converter class should be named ``*something*_converter``.
1282If the name follows this convention, then your converter class
1283will be automatically registered with Argument Clinic; its name
1284will be the name of your class with the ``_converter`` suffix
1285stripped off.  (This is accomplished with a metaclass.)
1286
1287You shouldn't subclass ``CConverter.__init__``.  Instead, you should
1288write a ``converter_init()`` function.  ``converter_init()``
1289always accepts a ``self`` parameter; after that, all additional
1290parameters *must* be keyword-only.  Any arguments passed in to
1291the converter in Argument Clinic will be passed along to your
1292``converter_init()``.
1293
1294There are some additional members of ``CConverter`` you may wish
1295to specify in your subclass.  Here's the current list:
1296
1297``type``
1298    The C type to use for this variable.
1299    ``type`` should be a Python string specifying the type, e.g. ``int``.
1300    If this is a pointer type, the type string should end with ``' *'``.
1301
1302``default``
1303    The Python default value for this parameter, as a Python value.
1304    Or the magic value ``unspecified`` if there is no default.
1305
1306``py_default``
1307    ``default`` as it should appear in Python code,
1308    as a string.
1309    Or ``None`` if there is no default.
1310
1311``c_default``
1312    ``default`` as it should appear in C code,
1313    as a string.
1314    Or ``None`` if there is no default.
1315
1316``c_ignored_default``
1317    The default value used to initialize the C variable when
1318    there is no default, but not specifying a default may
1319    result in an "uninitialized variable" warning.  This can
1320    easily happen when using option groups—although
1321    properly-written code will never actually use this value,
1322    the variable does get passed in to the impl, and the
1323    C compiler will complain about the "use" of the
1324    uninitialized value.  This value should always be a
1325    non-empty string.
1326
1327``converter``
1328    The name of the C converter function, as a string.
1329
1330``impl_by_reference``
1331    A boolean value.  If true,
1332    Argument Clinic will add a ``&`` in front of the name of
1333    the variable when passing it into the impl function.
1334
1335``parse_by_reference``
1336    A boolean value.  If true,
1337    Argument Clinic will add a ``&`` in front of the name of
1338    the variable when passing it into :c:func:`PyArg_ParseTuple`.
1339
1340
1341Here's the simplest example of a custom converter, from ``Modules/zlibmodule.c``::
1342
1343    /*[python input]
1344
1345    class ssize_t_converter(CConverter):
1346        type = 'Py_ssize_t'
1347        converter = 'ssize_t_converter'
1348
1349    [python start generated code]*/
1350    /*[python end generated code: output=da39a3ee5e6b4b0d input=35521e4e733823c7]*/
1351
1352This block adds a converter to Argument Clinic named ``ssize_t``.  Parameters
1353declared as ``ssize_t`` will be declared as type ``Py_ssize_t``, and will
1354be parsed by the ``'O&'`` format unit, which will call the
1355``ssize_t_converter`` converter function.  ``ssize_t`` variables
1356automatically support default values.
1357
1358More sophisticated custom converters can insert custom C code to
1359handle initialization and cleanup.
1360You can see more examples of custom converters in the CPython
1361source tree; grep the C files for the string ``CConverter``.
1362
1363Writing a custom return converter
1364---------------------------------
1365
1366Writing a custom return converter is much like writing
1367a custom converter.  Except it's somewhat simpler, because return
1368converters are themselves much simpler.
1369
1370Return converters must subclass ``CReturnConverter``.
1371There are no examples yet of custom return converters,
1372because they are not widely used yet.  If you wish to
1373write your own return converter, please read ``Tools/clinic/clinic.py``,
1374specifically the implementation of ``CReturnConverter`` and
1375all its subclasses.
1376
1377METH_O and METH_NOARGS
1378----------------------------------------------
1379
1380To convert a function using ``METH_O``, make sure the function's
1381single argument is using the ``object`` converter, and mark the
1382arguments as positional-only::
1383
1384    /*[clinic input]
1385    meth_o_sample
1386
1387         argument: object
1388         /
1389    [clinic start generated code]*/
1390
1391
1392To convert a function using ``METH_NOARGS``, just don't specify
1393any arguments.
1394
1395You can still use a self converter, a return converter, and specify
1396a ``type`` argument to the object converter for ``METH_O``.
1397
1398tp_new and tp_init functions
1399----------------------------------------------
1400
1401You can convert ``tp_new`` and ``tp_init`` functions.  Just name
1402them ``__new__`` or ``__init__`` as appropriate.  Notes:
1403
1404* The function name generated for ``__new__`` doesn't end in ``__new__``
1405  like it would by default.  It's just the name of the class, converted
1406  into a valid C identifier.
1407
1408* No ``PyMethodDef`` ``#define`` is generated for these functions.
1409
1410* ``__init__`` functions return ``int``, not ``PyObject *``.
1411
1412* Use the docstring as the class docstring.
1413
1414* Although ``__new__`` and ``__init__`` functions must always
1415  accept both the ``args`` and ``kwargs`` objects, when converting
1416  you may specify any signature for these functions that you like.
1417  (If your function doesn't support keywords, the parsing function
1418  generated will throw an exception if it receives any.)
1419
1420Changing and redirecting Clinic's output
1421----------------------------------------
1422
1423It can be inconvenient to have Clinic's output interspersed with
1424your conventional hand-edited C code.  Luckily, Clinic is configurable:
1425you can buffer up its output for printing later (or earlier!), or write
1426its output to a separate file.  You can also add a prefix or suffix to
1427every line of Clinic's generated output.
1428
1429While changing Clinic's output in this manner can be a boon to readability,
1430it may result in Clinic code using types before they are defined, or
1431your code attempting to use Clinic-generated code before it is defined.
1432These problems can be easily solved by rearranging the declarations in your file,
1433or moving where Clinic's generated code goes.  (This is why the default behavior
1434of Clinic is to output everything into the current block; while many people
1435consider this hampers readability, it will never require rearranging your
1436code to fix definition-before-use problems.)
1437
1438Let's start with defining some terminology:
1439
1440*field*
1441  A field, in this context, is a subsection of Clinic's output.
1442  For example, the ``#define`` for the ``PyMethodDef`` structure
1443  is a field, called ``methoddef_define``.  Clinic has seven
1444  different fields it can output per function definition:
1445
1446  .. code-block:: none
1447
1448      docstring_prototype
1449      docstring_definition
1450      methoddef_define
1451      impl_prototype
1452      parser_prototype
1453      parser_definition
1454      impl_definition
1455
1456  All the names are of the form ``"<a>_<b>"``,
1457  where ``"<a>"`` is the semantic object represented (the parsing function,
1458  the impl function, the docstring, or the methoddef structure) and ``"<b>"``
1459  represents what kind of statement the field is.  Field names that end in
1460  ``"_prototype"``
1461  represent forward declarations of that thing, without the actual body/data
1462  of the thing; field names that end in ``"_definition"`` represent the actual
1463  definition of the thing, with the body/data of the thing.  (``"methoddef"``
1464  is special, it's the only one that ends with ``"_define"``, representing that
1465  it's a preprocessor #define.)
1466
1467*destination*
1468  A destination is a place Clinic can write output to.  There are
1469  five built-in destinations:
1470
1471  ``block``
1472    The default destination: printed in the output section of
1473    the current Clinic block.
1474
1475  ``buffer``
1476    A text buffer where you can save text for later.  Text sent
1477    here is appended to the end of any existing text.  It's an
1478    error to have any text left in the buffer when Clinic finishes
1479    processing a file.
1480
1481  ``file``
1482    A separate "clinic file" that will be created automatically by Clinic.
1483    The filename chosen for the file is ``{basename}.clinic{extension}``,
1484    where ``basename`` and ``extension`` were assigned the output
1485    from ``os.path.splitext()`` run on the current file.  (Example:
1486    the ``file`` destination for ``_pickle.c`` would be written to
1487    ``_pickle.clinic.c``.)
1488
1489    **Important: When using a** ``file`` **destination, you**
1490    *must check in* **the generated file!**
1491
1492  ``two-pass``
1493    A buffer like ``buffer``.  However, a two-pass buffer can only
1494    be dumped once, and it prints out all text sent to it during
1495    all processing, even from Clinic blocks *after* the dumping point.
1496
1497  ``suppress``
1498    The text is suppressed—thrown away.
1499
1500
1501Clinic defines five new directives that let you reconfigure its output.
1502
1503The first new directive is ``dump``:
1504
1505.. code-block:: none
1506
1507   dump <destination>
1508
1509This dumps the current contents of the named destination into the output of
1510the current block, and empties it.  This only works with ``buffer`` and
1511``two-pass`` destinations.
1512
1513The second new directive is ``output``.  The most basic form of ``output``
1514is like this:
1515
1516.. code-block:: none
1517
1518    output <field> <destination>
1519
1520This tells Clinic to output *field* to *destination*.  ``output`` also
1521supports a special meta-destination, called ``everything``, which tells
1522Clinic to output *all* fields to that *destination*.
1523
1524``output`` has a number of other functions:
1525
1526.. code-block:: none
1527
1528    output push
1529    output pop
1530    output preset <preset>
1531
1532
1533``output push`` and ``output pop`` allow you to push and pop
1534configurations on an internal configuration stack, so that you
1535can temporarily modify the output configuration, then easily restore
1536the previous configuration.  Simply push before your change to save
1537the current configuration, then pop when you wish to restore the
1538previous configuration.
1539
1540``output preset`` sets Clinic's output to one of several built-in
1541preset configurations, as follows:
1542
1543  ``block``
1544    Clinic's original starting configuration.  Writes everything
1545    immediately after the input block.
1546
1547    Suppress the ``parser_prototype``
1548    and ``docstring_prototype``, write everything else to ``block``.
1549
1550  ``file``
1551    Designed to write everything to the "clinic file" that it can.
1552    You then ``#include`` this file near the top of your file.
1553    You may need to rearrange your file to make this work, though
1554    usually this just means creating forward declarations for various
1555    ``typedef`` and ``PyTypeObject`` definitions.
1556
1557    Suppress the ``parser_prototype``
1558    and ``docstring_prototype``, write the ``impl_definition`` to
1559    ``block``, and write everything else to ``file``.
1560
1561    The default filename is ``"{dirname}/clinic/{basename}.h"``.
1562
1563  ``buffer``
1564    Save up most of the output from Clinic, to be written into
1565    your file near the end.  For Python files implementing modules
1566    or builtin types, it's recommended that you dump the buffer
1567    just above the static structures for your module or
1568    builtin type; these are normally very near the end.  Using
1569    ``buffer`` may require even more editing than ``file``, if
1570    your file has static ``PyMethodDef`` arrays defined in the
1571    middle of the file.
1572
1573    Suppress the ``parser_prototype``, ``impl_prototype``,
1574    and ``docstring_prototype``, write the ``impl_definition`` to
1575    ``block``, and write everything else to ``file``.
1576
1577  ``two-pass``
1578    Similar to the ``buffer`` preset, but writes forward declarations to
1579    the ``two-pass`` buffer, and definitions to the ``buffer``.
1580    This is similar to the ``buffer`` preset, but may require
1581    less editing than ``buffer``.  Dump the ``two-pass`` buffer
1582    near the top of your file, and dump the ``buffer`` near
1583    the end just like you would when using the ``buffer`` preset.
1584
1585    Suppresses the ``impl_prototype``, write the ``impl_definition``
1586    to ``block``, write ``docstring_prototype``, ``methoddef_define``,
1587    and ``parser_prototype`` to ``two-pass``, write everything else
1588    to ``buffer``.
1589
1590  ``partial-buffer``
1591    Similar to the ``buffer`` preset, but writes more things to ``block``,
1592    only writing the really big chunks of generated code to ``buffer``.
1593    This avoids the definition-before-use problem of ``buffer`` completely,
1594    at the small cost of having slightly more stuff in the block's output.
1595    Dump the ``buffer`` near the end, just like you would when using
1596    the ``buffer`` preset.
1597
1598    Suppresses the ``impl_prototype``, write the ``docstring_definition``
1599    and ``parser_definition`` to ``buffer``, write everything else to ``block``.
1600
1601The third new directive is ``destination``:
1602
1603.. code-block:: none
1604
1605    destination <name> <command> [...]
1606
1607This performs an operation on the destination named ``name``.
1608
1609There are two defined subcommands: ``new`` and ``clear``.
1610
1611The ``new`` subcommand works like this:
1612
1613.. code-block:: none
1614
1615    destination <name> new <type>
1616
1617This creates a new destination with name ``<name>`` and type ``<type>``.
1618
1619There are five destination types:
1620
1621    ``suppress``
1622        Throws the text away.
1623
1624    ``block``
1625        Writes the text to the current block.  This is what Clinic
1626        originally did.
1627
1628    ``buffer``
1629        A simple text buffer, like the "buffer" builtin destination above.
1630
1631    ``file``
1632        A text file.  The file destination takes an extra argument,
1633        a template to use for building the filename, like so:
1634
1635            destination <name> new <type> <file_template>
1636
1637        The template can use three strings internally that will be replaced
1638        by bits of the filename:
1639
1640            {path}
1641                The full path to the file, including directory and full filename.
1642            {dirname}
1643                The name of the directory the file is in.
1644            {basename}
1645                Just the name of the file, not including the directory.
1646            {basename_root}
1647                Basename with the extension clipped off
1648                (everything up to but not including the last '.').
1649            {basename_extension}
1650                The last '.' and everything after it.  If the basename
1651                does not contain a period, this will be the empty string.
1652
1653        If there are no periods in the filename, {basename} and {filename}
1654        are the same, and {extension} is empty.  "{basename}{extension}"
1655        is always exactly the same as "{filename}"."
1656
1657    ``two-pass``
1658        A two-pass buffer, like the "two-pass" builtin destination above.
1659
1660
1661The ``clear`` subcommand works like this:
1662
1663.. code-block:: none
1664
1665    destination <name> clear
1666
1667It removes all the accumulated text up to this point in the destination.
1668(I don't know what you'd need this for, but I thought maybe it'd be
1669useful while someone's experimenting.)
1670
1671The fourth new directive is ``set``:
1672
1673.. code-block:: none
1674
1675    set line_prefix "string"
1676    set line_suffix "string"
1677
1678``set`` lets you set two internal variables in Clinic.
1679``line_prefix`` is a string that will be prepended to every line of Clinic's output;
1680``line_suffix`` is a string that will be appended to every line of Clinic's output.
1681
1682Both of these support two format strings:
1683
1684  ``{block comment start}``
1685    Turns into the string ``/*``, the start-comment text sequence for C files.
1686
1687  ``{block comment end}``
1688    Turns into the string ``*/``, the end-comment text sequence for C files.
1689
1690The final new directive is one you shouldn't need to use directly,
1691called ``preserve``:
1692
1693.. code-block:: none
1694
1695    preserve
1696
1697This tells Clinic that the current contents of the output should be kept, unmodified.
1698This is used internally by Clinic when dumping output into ``file`` files; wrapping
1699it in a Clinic block lets Clinic use its existing checksum functionality to ensure
1700the file was not modified by hand before it gets overwritten.
1701
1702
1703The #ifdef trick
1704----------------------------------------------
1705
1706If you're converting a function that isn't available on all platforms,
1707there's a trick you can use to make life a little easier.  The existing
1708code probably looks like this::
1709
1710    #ifdef HAVE_FUNCTIONNAME
1711    static module_functionname(...)
1712    {
1713    ...
1714    }
1715    #endif /* HAVE_FUNCTIONNAME */
1716
1717And then in the ``PyMethodDef`` structure at the bottom the existing code
1718will have:
1719
1720.. code-block:: none
1721
1722    #ifdef HAVE_FUNCTIONNAME
1723    {'functionname', ... },
1724    #endif /* HAVE_FUNCTIONNAME */
1725
1726In this scenario, you should enclose the body of your impl function inside the ``#ifdef``,
1727like so::
1728
1729    #ifdef HAVE_FUNCTIONNAME
1730    /*[clinic input]
1731    module.functionname
1732    ...
1733    [clinic start generated code]*/
1734    static module_functionname(...)
1735    {
1736    ...
1737    }
1738    #endif /* HAVE_FUNCTIONNAME */
1739
1740Then, remove those three lines from the ``PyMethodDef`` structure,
1741replacing them with the macro Argument Clinic generated:
1742
1743.. code-block:: none
1744
1745    MODULE_FUNCTIONNAME_METHODDEF
1746
1747(You can find the real name for this macro inside the generated code.
1748Or you can calculate it yourself: it's the name of your function as defined
1749on the first line of your block, but with periods changed to underscores,
1750uppercased, and ``"_METHODDEF"`` added to the end.)
1751
1752Perhaps you're wondering: what if ``HAVE_FUNCTIONNAME`` isn't defined?
1753The ``MODULE_FUNCTIONNAME_METHODDEF`` macro won't be defined either!
1754
1755Here's where Argument Clinic gets very clever.  It actually detects that the
1756Argument Clinic block might be deactivated by the ``#ifdef``.  When that
1757happens, it generates a little extra code that looks like this::
1758
1759    #ifndef MODULE_FUNCTIONNAME_METHODDEF
1760        #define MODULE_FUNCTIONNAME_METHODDEF
1761    #endif /* !defined(MODULE_FUNCTIONNAME_METHODDEF) */
1762
1763That means the macro always works.  If the function is defined, this turns
1764into the correct structure, including the trailing comma.  If the function is
1765undefined, this turns into nothing.
1766
1767However, this causes one ticklish problem: where should Argument Clinic put this
1768extra code when using the "block" output preset?  It can't go in the output block,
1769because that could be deactivated by the ``#ifdef``.  (That's the whole point!)
1770
1771In this situation, Argument Clinic writes the extra code to the "buffer" destination.
1772This may mean that you get a complaint from Argument Clinic:
1773
1774.. code-block:: none
1775
1776    Warning in file "Modules/posixmodule.c" on line 12357:
1777    Destination buffer 'buffer' not empty at end of file, emptying.
1778
1779When this happens, just open your file, find the ``dump buffer`` block that
1780Argument Clinic added to your file (it'll be at the very bottom), then
1781move it above the ``PyMethodDef`` structure where that macro is used.
1782
1783
1784
1785Using Argument Clinic in Python files
1786-------------------------------------
1787
1788It's actually possible to use Argument Clinic to preprocess Python files.
1789There's no point to using Argument Clinic blocks, of course, as the output
1790wouldn't make any sense to the Python interpreter.  But using Argument Clinic
1791to run Python blocks lets you use Python as a Python preprocessor!
1792
1793Since Python comments are different from C comments, Argument Clinic
1794blocks embedded in Python files look slightly different.  They look like this:
1795
1796.. code-block:: python3
1797
1798    #/*[python input]
1799    #print("def foo(): pass")
1800    #[python start generated code]*/
1801    def foo(): pass
1802    #/*[python checksum:...]*/
1803