cases_generator - OpenGrok cross reference for /external/python/cpython3/Tools/cases_generator/

# Tooling to generate interpreters

Documentation for the instruction definitions in `Python/bytecodes.c`
("the DSL") is [here](interpreter_definition.md).

What's currently here:

- `analyzer.py`: code for converting `AST` generated by `Parser`
  to more high-level structure for easier interaction
- `lexer.py`: lexer for C, originally written by Mark Shannon
- `plexer.py`: OO interface on top of lexer.py; main class: `PLexer`
- `parsing.py`: Parser for instruction definition DSL; main class: `Parser`
- `parser.py` helper for interactions with `parsing.py`
- `tierN_generator.py`: a couple of driver scripts to read `Python/bytecodes.c` and
  write `Python/generated_cases.c.h` (and several other files)
- `optimizer_generator.py`: reads `Python/bytecodes.c` and
  `Python/optimizer_bytecodes.c` and writes
  `Python/optimizer_cases.c.h`
- `stack.py`: code to handle generalized stack effects
- `cwriter.py`: code which understands tokens and how to format C code;
  main class: `CWriter`
- `generators_common.py`: helpers for generators
- `opcode_id_generator.py`: generate a list of opcodes and write them to
  `Include/opcode_ids.h`
- `opcode_metadata_generator.py`: reads the instruction definitions and
  write the metadata to `Include/internal/pycore_opcode_metadata.h`
- `py_metadata_generator.py`: reads the instruction definitions and
  write the metadata to `Lib/_opcode_metadata.py`
- `target_generator.py`: generate targets for computed goto dispatch and
  write them to `Python/opcode_targets.h`
- `uop_id_generator.py`: generate a list of uop IDs and write them to
  `Include/internal/pycore_uop_ids.h`
- `uop_metadata_generator.py`: reads the instruction definitions and
  write the metadata to `Include/internal/pycore_uop_metadata.h`

Note that there is some dummy C code at the top and bottom of
`Python/bytecodes.c`
to fool text editors like VS Code into believing this is valid C code.

## A bit about the parser

The parser class uses a pretty standard recursive descent scheme,
but with unlimited backtracking.
The `PLexer` class tokenizes the entire input before parsing starts.
We do not run the C preprocessor.
Each parsing method returns either an AST node (a `Node` instance)
or `None`, or raises `SyntaxError` (showing the error in the C source).

Most parsing methods are decorated with `@contextual`, which automatically
resets the tokenizer input position when `None` is returned.
Parsing methods may also raise `SyntaxError`, which is irrecoverable.
When a parsing method returns `None`, it is possible that after backtracking
a different parsing method returns a valid AST.

Neither the lexer nor the parsers are complete or fully correct.
Most known issues are tersely indicated by `# TODO:` comments.
We plan to fix issues as they become relevant.
Name		Date	Size	#Lines	LOC
..		-	-
README.md	D	04-Jul-2025	2.7 KiB	58	49
_typing_backports.py	D	04-Jul-2025	469	16	10
analyzer.py	D	04-Jul-2025	26.6 KiB	891	747
cwriter.py	D	04-Jul-2025	4.3 KiB	147	128
generators_common.py	D	04-Jul-2025	5.9 KiB	243	213
interpreter_definition.md	D	04-Jul-2025	13.7 KiB	439	349
lexer.py	D	04-Jul-2025	8 KiB	376	313
mypy.ini	D	04-Jul-2025	381	16	13
opcode_id_generator.py	D	04-Jul-2025	1.7 KiB	66	51
opcode_metadata_generator.py	D	04-Jul-2025	13.6 KiB	392	340
optimizer_generator.py	D	04-Jul-2025	7.4 KiB	237	204
parser.py	D	04-Jul-2025	1.8 KiB	67	51
parsing.py	D	04-Jul-2025	15 KiB	481	395
plexer.py	D	04-Jul-2025	3.3 KiB	111	81
py_metadata_generator.py	D	04-Jul-2025	2.8 KiB	98	77
stack.py	D	04-Jul-2025	7.3 KiB	228	195
target_generator.py	D	04-Jul-2025	1.4 KiB	55	44
tier1_generator.py	D	04-Jul-2025	6.5 KiB	206	179
tier2_generator.py	D	04-Jul-2025	7.1 KiB	255	222
uop_id_generator.py	D	04-Jul-2025	2.3 KiB	83	67
uop_metadata_generator.py	D	04-Jul-2025	3.2 KiB	96	81