1===================== 2TableGen Fundamentals 3===================== 4 5.. contents:: 6 :local: 7 8Introduction 9============ 10 11TableGen's purpose is to help a human develop and maintain records of 12domain-specific information. Because there may be a large number of these 13records, it is specifically designed to allow writing flexible descriptions and 14for common features of these records to be factored out. This reduces the 15amount of duplication in the description, reduces the chance of error, and makes 16it easier to structure domain specific information. 17 18The core part of TableGen `parses a file`_, instantiates the declarations, and 19hands the result off to a domain-specific `TableGen backend`_ for processing. 20The current major user of TableGen is the `LLVM code 21generator <CodeGenerator.html>`_. 22 23Note that if you work on TableGen much, and use emacs or vim, that you can find 24an emacs "TableGen mode" and a vim language file in the ``llvm/utils/emacs`` and 25``llvm/utils/vim`` directories of your LLVM distribution, respectively. 26 27.. _intro: 28 29Basic concepts 30-------------- 31 32TableGen files consist of two key parts: 'classes' and 'definitions', both of 33which are considered 'records'. 34 35**TableGen records** have a unique name, a list of values, and a list of 36superclasses. The list of values is the main data that TableGen builds for each 37record; it is this that holds the domain specific information for the 38application. The interpretation of this data is left to a specific `TableGen 39backend`_, but the structure and format rules are taken care of and are fixed by 40TableGen. 41 42**TableGen definitions** are the concrete form of 'records'. These generally do 43not have any undefined values, and are marked with the '``def``' keyword. 44 45**TableGen classes** are abstract records that are used to build and describe 46other records. These 'classes' allow the end-user to build abstractions for 47either the domain they are targeting (such as "Register", "RegisterClass", and 48"Instruction" in the LLVM code generator) or for the implementor to help factor 49out common properties of records (such as "FPInst", which is used to represent 50floating point instructions in the X86 backend). TableGen keeps track of all of 51the classes that are used to build up a definition, so the backend can find all 52definitions of a particular class, such as "Instruction". 53 54**TableGen multiclasses** are groups of abstract records that are instantiated 55all at once. Each instantiation can result in multiple TableGen definitions. 56If a multiclass inherits from another multiclass, the definitions in the 57sub-multiclass become part of the current multiclass, as if they were declared 58in the current multiclass. 59 60.. _described above: 61 62An example record 63----------------- 64 65With no other arguments, TableGen parses the specified file and prints out all 66of the classes, then all of the definitions. This is a good way to see what the 67various definitions expand to fully. Running this on the ``X86.td`` file prints 68this (at the time of this writing): 69 70.. code-block:: llvm 71 72 ... 73 def ADD32rr { // Instruction X86Inst I 74 string Namespace = "X86"; 75 dag OutOperandList = (outs GR32:$dst); 76 dag InOperandList = (ins GR32:$src1, GR32:$src2); 77 string AsmString = "add{l}\t{$src2, $dst|$dst, $src2}"; 78 list<dag> Pattern = [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]; 79 list<Register> Uses = []; 80 list<Register> Defs = [EFLAGS]; 81 list<Predicate> Predicates = []; 82 int CodeSize = 3; 83 int AddedComplexity = 0; 84 bit isReturn = 0; 85 bit isBranch = 0; 86 bit isIndirectBranch = 0; 87 bit isBarrier = 0; 88 bit isCall = 0; 89 bit canFoldAsLoad = 0; 90 bit mayLoad = 0; 91 bit mayStore = 0; 92 bit isImplicitDef = 0; 93 bit isConvertibleToThreeAddress = 1; 94 bit isCommutable = 1; 95 bit isTerminator = 0; 96 bit isReMaterializable = 0; 97 bit isPredicable = 0; 98 bit hasDelaySlot = 0; 99 bit usesCustomInserter = 0; 100 bit hasCtrlDep = 0; 101 bit isNotDuplicable = 0; 102 bit hasSideEffects = 0; 103 bit neverHasSideEffects = 0; 104 InstrItinClass Itinerary = NoItinerary; 105 string Constraints = ""; 106 string DisableEncoding = ""; 107 bits<8> Opcode = { 0, 0, 0, 0, 0, 0, 0, 1 }; 108 Format Form = MRMDestReg; 109 bits<6> FormBits = { 0, 0, 0, 0, 1, 1 }; 110 ImmType ImmT = NoImm; 111 bits<3> ImmTypeBits = { 0, 0, 0 }; 112 bit hasOpSizePrefix = 0; 113 bit hasAdSizePrefix = 0; 114 bits<4> Prefix = { 0, 0, 0, 0 }; 115 bit hasREX_WPrefix = 0; 116 FPFormat FPForm = ?; 117 bits<3> FPFormBits = { 0, 0, 0 }; 118 } 119 ... 120 121This definition corresponds to the 32-bit register-register ``add`` instruction 122of the x86 architecture. ``def ADD32rr`` defines a record named 123``ADD32rr``, and the comment at the end of the line indicates the superclasses 124of the definition. The body of the record contains all of the data that 125TableGen assembled for the record, indicating that the instruction is part of 126the "X86" namespace, the pattern indicating how the instruction should be 127emitted into the assembly file, that it is a two-address instruction, has a 128particular encoding, etc. The contents and semantics of the information in the 129record are specific to the needs of the X86 backend, and are only shown as an 130example. 131 132As you can see, a lot of information is needed for every instruction supported 133by the code generator, and specifying it all manually would be unmaintainable, 134prone to bugs, and tiring to do in the first place. Because we are using 135TableGen, all of the information was derived from the following definition: 136 137.. code-block:: llvm 138 139 let Defs = [EFLAGS], 140 isCommutable = 1, // X = ADD Y,Z --> X = ADD Z,Y 141 isConvertibleToThreeAddress = 1 in // Can transform into LEA. 142 def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), 143 (ins GR32:$src1, GR32:$src2), 144 "add{l}\t{$src2, $dst|$dst, $src2}", 145 [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; 146 147This definition makes use of the custom class ``I`` (extended from the custom 148class ``X86Inst``), which is defined in the X86-specific TableGen file, to 149factor out the common features that instructions of its class share. A key 150feature of TableGen is that it allows the end-user to define the abstractions 151they prefer to use when describing their information. 152 153Each ``def`` record has a special entry called "NAME". This is the name of the 154record ("``ADD32rr``" above). In the general case ``def`` names can be formed 155from various kinds of string processing expressions and ``NAME`` resolves to the 156final value obtained after resolving all of those expressions. The user may 157refer to ``NAME`` anywhere she desires to use the ultimate name of the ``def``. 158``NAME`` should not be defined anywhere else in user code to avoid conflicts. 159 160Running TableGen 161---------------- 162 163TableGen runs just like any other LLVM tool. The first (optional) argument 164specifies the file to read. If a filename is not specified, ``llvm-tblgen`` 165reads from standard input. 166 167To be useful, one of the `TableGen backends`_ must be used. These backends are 168selectable on the command line (type '``llvm-tblgen -help``' for a list). For 169example, to get a list of all of the definitions that subclass a particular type 170(which can be useful for building up an enum list of these records), use the 171``-print-enums`` option: 172 173.. code-block:: bash 174 175 $ llvm-tblgen X86.td -print-enums -class=Register 176 AH, AL, AX, BH, BL, BP, BPL, BX, CH, CL, CX, DH, DI, DIL, DL, DX, EAX, EBP, EBX, 177 ECX, EDI, EDX, EFLAGS, EIP, ESI, ESP, FP0, FP1, FP2, FP3, FP4, FP5, FP6, IP, 178 MM0, MM1, MM2, MM3, MM4, MM5, MM6, MM7, R10, R10B, R10D, R10W, R11, R11B, R11D, 179 R11W, R12, R12B, R12D, R12W, R13, R13B, R13D, R13W, R14, R14B, R14D, R14W, R15, 180 R15B, R15D, R15W, R8, R8B, R8D, R8W, R9, R9B, R9D, R9W, RAX, RBP, RBX, RCX, RDI, 181 RDX, RIP, RSI, RSP, SI, SIL, SP, SPL, ST0, ST1, ST2, ST3, ST4, ST5, ST6, ST7, 182 XMM0, XMM1, XMM10, XMM11, XMM12, XMM13, XMM14, XMM15, XMM2, XMM3, XMM4, XMM5, 183 XMM6, XMM7, XMM8, XMM9, 184 185 $ llvm-tblgen X86.td -print-enums -class=Instruction 186 ABS_F, ABS_Fp32, ABS_Fp64, ABS_Fp80, ADC32mi, ADC32mi8, ADC32mr, ADC32ri, 187 ADC32ri8, ADC32rm, ADC32rr, ADC64mi32, ADC64mi8, ADC64mr, ADC64ri32, ADC64ri8, 188 ADC64rm, ADC64rr, ADD16mi, ADD16mi8, ADD16mr, ADD16ri, ADD16ri8, ADD16rm, 189 ADD16rr, ADD32mi, ADD32mi8, ADD32mr, ADD32ri, ADD32ri8, ADD32rm, ADD32rr, 190 ADD64mi32, ADD64mi8, ADD64mr, ADD64ri32, ... 191 192The default backend prints out all of the records, as `described above`_. 193 194If you plan to use TableGen, you will most likely have to `write a backend`_ 195that extracts the information specific to what you need and formats it in the 196appropriate way. 197 198.. _parses a file: 199 200TableGen syntax 201=============== 202 203TableGen doesn't care about the meaning of data (that is up to the backend to 204define), but it does care about syntax, and it enforces a simple type system. 205This section describes the syntax and the constructs allowed in a TableGen file. 206 207TableGen primitives 208------------------- 209 210TableGen comments 211^^^^^^^^^^^^^^^^^ 212 213TableGen supports BCPL style "``//``" comments, which run to the end of the 214line, and it also supports **nestable** "``/* */``" comments. 215 216.. _TableGen type: 217 218The TableGen type system 219^^^^^^^^^^^^^^^^^^^^^^^^ 220 221TableGen files are strongly typed, in a simple (but complete) type-system. 222These types are used to perform automatic conversions, check for errors, and to 223help interface designers constrain the input that they allow. Every `value 224definition`_ is required to have an associated type. 225 226TableGen supports a mixture of very low-level types (such as ``bit``) and very 227high-level types (such as ``dag``). This flexibility is what allows it to 228describe a wide range of information conveniently and compactly. The TableGen 229types are: 230 231``bit`` 232 A 'bit' is a boolean value that can hold either 0 or 1. 233 234``int`` 235 The 'int' type represents a simple 32-bit integer value, such as 5. 236 237``string`` 238 The 'string' type represents an ordered sequence of characters of arbitrary 239 length. 240 241``bits<n>`` 242 A 'bits' type is an arbitrary, but fixed, size integer that is broken up 243 into individual bits. This type is useful because it can handle some bits 244 being defined while others are undefined. 245 246``list<ty>`` 247 This type represents a list whose elements are some other type. The 248 contained type is arbitrary: it can even be another list type. 249 250Class type 251 Specifying a class name in a type context means that the defined value must 252 be a subclass of the specified class. This is useful in conjunction with 253 the ``list`` type, for example, to constrain the elements of the list to a 254 common base class (e.g., a ``list<Register>`` can only contain definitions 255 derived from the "``Register``" class). 256 257``dag`` 258 This type represents a nestable directed graph of elements. 259 260``code`` 261 This represents a big hunk of text. This is lexically distinct from string 262 values because it doesn't require escaping double quotes and other common 263 characters that occur in code. 264 265To date, these types have been sufficient for describing things that TableGen 266has been used for, but it is straight-forward to extend this list if needed. 267 268.. _TableGen expressions: 269 270TableGen values and expressions 271^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 272 273TableGen allows for a pretty reasonable number of different expression forms 274when building up values. These forms allow the TableGen file to be written in a 275natural syntax and flavor for the application. The current expression forms 276supported include: 277 278``?`` 279 uninitialized field 280 281``0b1001011`` 282 binary integer value 283 284``07654321`` 285 octal integer value (indicated by a leading 0) 286 287``7`` 288 decimal integer value 289 290``0x7F`` 291 hexadecimal integer value 292 293``"foo"`` 294 string value 295 296``[{ ... }]`` 297 code fragment 298 299``[ X, Y, Z ]<type>`` 300 list value. <type> is the type of the list element and is usually optional. 301 In rare cases, TableGen is unable to deduce the element type in which case 302 the user must specify it explicitly. 303 304``{ a, b, c }`` 305 initializer for a "bits<3>" value 306 307``value`` 308 value reference 309 310``value{17}`` 311 access to one bit of a value 312 313``value{15-17}`` 314 access to multiple bits of a value 315 316``DEF`` 317 reference to a record definition 318 319``CLASS<val list>`` 320 reference to a new anonymous definition of CLASS with the specified template 321 arguments. 322 323``X.Y`` 324 reference to the subfield of a value 325 326``list[4-7,17,2-3]`` 327 A slice of the 'list' list, including elements 4,5,6,7,17,2, and 3 from it. 328 Elements may be included multiple times. 329 330``foreach <var> = [ <list> ] in { <body> }`` 331 332``foreach <var> = [ <list> ] in <def>`` 333 Replicate <body> or <def>, replacing instances of <var> with each value 334 in <list>. <var> is scoped at the level of the ``foreach`` loop and must 335 not conflict with any other object introduced in <body> or <def>. Currently 336 only ``def``\s are expanded within <body>. 337 338``foreach <var> = 0-15 in ...`` 339 340``foreach <var> = {0-15,32-47} in ...`` 341 Loop over ranges of integers. The braces are required for multiple ranges. 342 343``(DEF a, b)`` 344 a dag value. The first element is required to be a record definition, the 345 remaining elements in the list may be arbitrary other values, including 346 nested ```dag``' values. 347 348``!strconcat(a, b)`` 349 A string value that is the result of concatenating the 'a' and 'b' strings. 350 351``str1#str2`` 352 "#" (paste) is a shorthand for !strconcat. It may concatenate things that 353 are not quoted strings, in which case an implicit !cast<string> is done on 354 the operand of the paste. 355 356``!cast<type>(a)`` 357 A symbol of type *type* obtained by looking up the string 'a' in the symbol 358 table. If the type of 'a' does not match *type*, TableGen aborts with an 359 error. !cast<string> is a special case in that the argument must be an 360 object defined by a 'def' construct. 361 362``!subst(a, b, c)`` 363 If 'a' and 'b' are of string type or are symbol references, substitute 'b' 364 for 'a' in 'c.' This operation is analogous to $(subst) in GNU make. 365 366``!foreach(a, b, c)`` 367 For each member 'b' of dag or list 'a' apply operator 'c.' 'b' is a dummy 368 variable that should be declared as a member variable of an instantiated 369 class. This operation is analogous to $(foreach) in GNU make. 370 371``!head(a)`` 372 The first element of list 'a.' 373 374``!tail(a)`` 375 The 2nd-N elements of list 'a.' 376 377``!empty(a)`` 378 An integer {0,1} indicating whether list 'a' is empty. 379 380``!if(a,b,c)`` 381 'b' if the result of 'int' or 'bit' operator 'a' is nonzero, 'c' otherwise. 382 383``!eq(a,b)`` 384 'bit 1' if string a is equal to string b, 0 otherwise. This only operates 385 on string, int and bit objects. Use !cast<string> to compare other types of 386 objects. 387 388Note that all of the values have rules specifying how they convert to values 389for different types. These rules allow you to assign a value like "``7``" 390to a "``bits<4>``" value, for example. 391 392Classes and definitions 393----------------------- 394 395As mentioned in the `intro`_, classes and definitions (collectively known as 396'records') in TableGen are the main high-level unit of information that TableGen 397collects. Records are defined with a ``def`` or ``class`` keyword, the record 398name, and an optional list of "`template arguments`_". If the record has 399superclasses, they are specified as a comma separated list that starts with a 400colon character ("``:``"). If `value definitions`_ or `let expressions`_ are 401needed for the class, they are enclosed in curly braces ("``{}``"); otherwise, 402the record ends with a semicolon. 403 404Here is a simple TableGen file: 405 406.. code-block:: llvm 407 408 class C { bit V = 1; } 409 def X : C; 410 def Y : C { 411 string Greeting = "hello"; 412 } 413 414This example defines two definitions, ``X`` and ``Y``, both of which derive from 415the ``C`` class. Because of this, they both get the ``V`` bit value. The ``Y`` 416definition also gets the Greeting member as well. 417 418In general, classes are useful for collecting together the commonality between a 419group of records and isolating it in a single place. Also, classes permit the 420specification of default values for their subclasses, allowing the subclasses to 421override them as they wish. 422 423.. _value definition: 424.. _value definitions: 425 426Value definitions 427^^^^^^^^^^^^^^^^^ 428 429Value definitions define named entries in records. A value must be defined 430before it can be referred to as the operand for another value definition or 431before the value is reset with a `let expression`_. A value is defined by 432specifying a `TableGen type`_ and a name. If an initial value is available, it 433may be specified after the type with an equal sign. Value definitions require 434terminating semicolons. 435 436.. _let expression: 437.. _let expressions: 438.. _"let" expressions within a record: 439 440'let' expressions 441^^^^^^^^^^^^^^^^^ 442 443A record-level let expression is used to change the value of a value definition 444in a record. This is primarily useful when a superclass defines a value that a 445derived class or definition wants to override. Let expressions consist of the 446'``let``' keyword followed by a value name, an equal sign ("``=``"), and a new 447value. For example, a new class could be added to the example above, redefining 448the ``V`` field for all of its subclasses: 449 450.. code-block:: llvm 451 452 class D : C { let V = 0; } 453 def Z : D; 454 455In this case, the ``Z`` definition will have a zero value for its ``V`` value, 456despite the fact that it derives (indirectly) from the ``C`` class, because the 457``D`` class overrode its value. 458 459.. _template arguments: 460 461Class template arguments 462^^^^^^^^^^^^^^^^^^^^^^^^ 463 464TableGen permits the definition of parameterized classes as well as normal 465concrete classes. Parameterized TableGen classes specify a list of variable 466bindings (which may optionally have defaults) that are bound when used. Here is 467a simple example: 468 469.. code-block:: llvm 470 471 class FPFormat<bits<3> val> { 472 bits<3> Value = val; 473 } 474 def NotFP : FPFormat<0>; 475 def ZeroArgFP : FPFormat<1>; 476 def OneArgFP : FPFormat<2>; 477 def OneArgFPRW : FPFormat<3>; 478 def TwoArgFP : FPFormat<4>; 479 def CompareFP : FPFormat<5>; 480 def CondMovFP : FPFormat<6>; 481 def SpecialFP : FPFormat<7>; 482 483In this case, template arguments are used as a space efficient way to specify a 484list of "enumeration values", each with a "``Value``" field set to the specified 485integer. 486 487The more esoteric forms of `TableGen expressions`_ are useful in conjunction 488with template arguments. As an example: 489 490.. code-block:: llvm 491 492 class ModRefVal<bits<2> val> { 493 bits<2> Value = val; 494 } 495 496 def None : ModRefVal<0>; 497 def Mod : ModRefVal<1>; 498 def Ref : ModRefVal<2>; 499 def ModRef : ModRefVal<3>; 500 501 class Value<ModRefVal MR> { 502 // Decode some information into a more convenient format, while providing 503 // a nice interface to the user of the "Value" class. 504 bit isMod = MR.Value{0}; 505 bit isRef = MR.Value{1}; 506 507 // other stuff... 508 } 509 510 // Example uses 511 def bork : Value<Mod>; 512 def zork : Value<Ref>; 513 def hork : Value<ModRef>; 514 515This is obviously a contrived example, but it shows how template arguments can 516be used to decouple the interface provided to the user of the class from the 517actual internal data representation expected by the class. In this case, 518running ``llvm-tblgen`` on the example prints the following definitions: 519 520.. code-block:: llvm 521 522 def bork { // Value 523 bit isMod = 1; 524 bit isRef = 0; 525 } 526 def hork { // Value 527 bit isMod = 1; 528 bit isRef = 1; 529 } 530 def zork { // Value 531 bit isMod = 0; 532 bit isRef = 1; 533 } 534 535This shows that TableGen was able to dig into the argument and extract a piece 536of information that was requested by the designer of the "Value" class. For 537more realistic examples, please see existing users of TableGen, such as the X86 538backend. 539 540Multiclass definitions and instances 541^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 542 543While classes with template arguments are a good way to factor commonality 544between two instances of a definition, multiclasses allow a convenient notation 545for defining multiple definitions at once (instances of implicitly constructed 546classes). For example, consider an 3-address instruction set whose instructions 547come in two forms: "``reg = reg op reg``" and "``reg = reg op imm``" 548(e.g. SPARC). In this case, you'd like to specify in one place that this 549commonality exists, then in a separate place indicate what all the ops are. 550 551Here is an example TableGen fragment that shows this idea: 552 553.. code-block:: llvm 554 555 def ops; 556 def GPR; 557 def Imm; 558 class inst<int opc, string asmstr, dag operandlist>; 559 560 multiclass ri_inst<int opc, string asmstr> { 561 def _rr : inst<opc, !strconcat(asmstr, " $dst, $src1, $src2"), 562 (ops GPR:$dst, GPR:$src1, GPR:$src2)>; 563 def _ri : inst<opc, !strconcat(asmstr, " $dst, $src1, $src2"), 564 (ops GPR:$dst, GPR:$src1, Imm:$src2)>; 565 } 566 567 // Instantiations of the ri_inst multiclass. 568 defm ADD : ri_inst<0b111, "add">; 569 defm SUB : ri_inst<0b101, "sub">; 570 defm MUL : ri_inst<0b100, "mul">; 571 ... 572 573The name of the resultant definitions has the multidef fragment names appended 574to them, so this defines ``ADD_rr``, ``ADD_ri``, ``SUB_rr``, etc. A defm may 575inherit from multiple multiclasses, instantiating definitions from each 576multiclass. Using a multiclass this way is exactly equivalent to instantiating 577the classes multiple times yourself, e.g. by writing: 578 579.. code-block:: llvm 580 581 def ops; 582 def GPR; 583 def Imm; 584 class inst<int opc, string asmstr, dag operandlist>; 585 586 class rrinst<int opc, string asmstr> 587 : inst<opc, !strconcat(asmstr, " $dst, $src1, $src2"), 588 (ops GPR:$dst, GPR:$src1, GPR:$src2)>; 589 590 class riinst<int opc, string asmstr> 591 : inst<opc, !strconcat(asmstr, " $dst, $src1, $src2"), 592 (ops GPR:$dst, GPR:$src1, Imm:$src2)>; 593 594 // Instantiations of the ri_inst multiclass. 595 def ADD_rr : rrinst<0b111, "add">; 596 def ADD_ri : riinst<0b111, "add">; 597 def SUB_rr : rrinst<0b101, "sub">; 598 def SUB_ri : riinst<0b101, "sub">; 599 def MUL_rr : rrinst<0b100, "mul">; 600 def MUL_ri : riinst<0b100, "mul">; 601 ... 602 603A ``defm`` can also be used inside a multiclass providing several levels of 604multiclass instanciations. 605 606.. code-block:: llvm 607 608 class Instruction<bits<4> opc, string Name> { 609 bits<4> opcode = opc; 610 string name = Name; 611 } 612 613 multiclass basic_r<bits<4> opc> { 614 def rr : Instruction<opc, "rr">; 615 def rm : Instruction<opc, "rm">; 616 } 617 618 multiclass basic_s<bits<4> opc> { 619 defm SS : basic_r<opc>; 620 defm SD : basic_r<opc>; 621 def X : Instruction<opc, "x">; 622 } 623 624 multiclass basic_p<bits<4> opc> { 625 defm PS : basic_r<opc>; 626 defm PD : basic_r<opc>; 627 def Y : Instruction<opc, "y">; 628 } 629 630 defm ADD : basic_s<0xf>, basic_p<0xf>; 631 ... 632 633 // Results 634 def ADDPDrm { ... 635 def ADDPDrr { ... 636 def ADDPSrm { ... 637 def ADDPSrr { ... 638 def ADDSDrm { ... 639 def ADDSDrr { ... 640 def ADDY { ... 641 def ADDX { ... 642 643``defm`` declarations can inherit from classes too, the rule to follow is that 644the class list must start after the last multiclass, and there must be at least 645one multiclass before them. 646 647.. code-block:: llvm 648 649 class XD { bits<4> Prefix = 11; } 650 class XS { bits<4> Prefix = 12; } 651 652 class I<bits<4> op> { 653 bits<4> opcode = op; 654 } 655 656 multiclass R { 657 def rr : I<4>; 658 def rm : I<2>; 659 } 660 661 multiclass Y { 662 defm SS : R, XD; 663 defm SD : R, XS; 664 } 665 666 defm Instr : Y; 667 668 // Results 669 def InstrSDrm { 670 bits<4> opcode = { 0, 0, 1, 0 }; 671 bits<4> Prefix = { 1, 1, 0, 0 }; 672 } 673 ... 674 def InstrSSrr { 675 bits<4> opcode = { 0, 1, 0, 0 }; 676 bits<4> Prefix = { 1, 0, 1, 1 }; 677 } 678 679File scope entities 680------------------- 681 682File inclusion 683^^^^^^^^^^^^^^ 684 685TableGen supports the '``include``' token, which textually substitutes the 686specified file in place of the include directive. The filename should be 687specified as a double quoted string immediately after the '``include``' keyword. 688Example: 689 690.. code-block:: llvm 691 692 include "foo.td" 693 694'let' expressions 695^^^^^^^^^^^^^^^^^ 696 697"Let" expressions at file scope are similar to `"let" expressions within a 698record`_, except they can specify a value binding for multiple records at a 699time, and may be useful in certain other cases. File-scope let expressions are 700really just another way that TableGen allows the end-user to factor out 701commonality from the records. 702 703File-scope "let" expressions take a comma-separated list of bindings to apply, 704and one or more records to bind the values in. Here are some examples: 705 706.. code-block:: llvm 707 708 let isTerminator = 1, isReturn = 1, isBarrier = 1, hasCtrlDep = 1 in 709 def RET : I<0xC3, RawFrm, (outs), (ins), "ret", [(X86retflag 0)]>; 710 711 let isCall = 1 in 712 // All calls clobber the non-callee saved registers... 713 let Defs = [EAX, ECX, EDX, FP0, FP1, FP2, FP3, FP4, FP5, FP6, ST0, 714 MM0, MM1, MM2, MM3, MM4, MM5, MM6, MM7, 715 XMM0, XMM1, XMM2, XMM3, XMM4, XMM5, XMM6, XMM7, EFLAGS] in { 716 def CALLpcrel32 : Ii32<0xE8, RawFrm, (outs), (ins i32imm:$dst,variable_ops), 717 "call\t${dst:call}", []>; 718 def CALL32r : I<0xFF, MRM2r, (outs), (ins GR32:$dst, variable_ops), 719 "call\t{*}$dst", [(X86call GR32:$dst)]>; 720 def CALL32m : I<0xFF, MRM2m, (outs), (ins i32mem:$dst, variable_ops), 721 "call\t{*}$dst", []>; 722 } 723 724File-scope "let" expressions are often useful when a couple of definitions need 725to be added to several records, and the records do not otherwise need to be 726opened, as in the case with the ``CALL*`` instructions above. 727 728It's also possible to use "let" expressions inside multiclasses, providing more 729ways to factor out commonality from the records, specially if using several 730levels of multiclass instanciations. This also avoids the need of using "let" 731expressions within subsequent records inside a multiclass. 732 733.. code-block:: llvm 734 735 multiclass basic_r<bits<4> opc> { 736 let Predicates = [HasSSE2] in { 737 def rr : Instruction<opc, "rr">; 738 def rm : Instruction<opc, "rm">; 739 } 740 let Predicates = [HasSSE3] in 741 def rx : Instruction<opc, "rx">; 742 } 743 744 multiclass basic_ss<bits<4> opc> { 745 let IsDouble = 0 in 746 defm SS : basic_r<opc>; 747 748 let IsDouble = 1 in 749 defm SD : basic_r<opc>; 750 } 751 752 defm ADD : basic_ss<0xf>; 753 754Looping 755^^^^^^^ 756 757TableGen supports the '``foreach``' block, which textually replicates the loop 758body, substituting iterator values for iterator references in the body. 759Example: 760 761.. code-block:: llvm 762 763 foreach i = [0, 1, 2, 3] in { 764 def R#i : Register<...>; 765 def F#i : Register<...>; 766 } 767 768This will create objects ``R0``, ``R1``, ``R2`` and ``R3``. ``foreach`` blocks 769may be nested. If there is only one item in the body the braces may be 770elided: 771 772.. code-block:: llvm 773 774 foreach i = [0, 1, 2, 3] in 775 def R#i : Register<...>; 776 777Code Generator backend info 778=========================== 779 780Expressions used by code generator to describe instructions and isel patterns: 781 782``(implicit a)`` 783 an implicitly defined physical register. This tells the dag instruction 784 selection emitter the input pattern's extra definitions matches implicit 785 physical register definitions. 786 787.. _TableGen backend: 788.. _TableGen backends: 789.. _write a backend: 790 791TableGen backends 792================= 793 794Until we get a step-by-step HowTo for writing TableGen backends, you can at 795least grab the boilerplate (build system, new files, etc.) from Clang's 796r173931. 797 798TODO: How they work, how to write one. This section should not contain details 799about any particular backend, except maybe ``-print-enums`` as an example. This 800should highlight the APIs in ``TableGen/Record.h``. 801