[Comgr][hotswap] Add DecodedInst + basic kernel decoder by tgymnich · Pull Request #2517 · ROCm/llvm-project

tgymnich · 2026-05-13T14:23:37Z

Wraps the MC-layer disassembler in a hotswap-shaped API:

decoded_inst.h — DecodedInst value type bundling MCInst, raw bytes, kernel offset, TSFlags, and an explicit IsBranch / BranchTarget decoration so Phase-3 BB layout can run without re-querying MCInstrDesc.
parsed_reg.h — ParsedReg{Kind, BaseIdx, NReg} value type so handlers can match on register class without re-parsing MCOperand::getReg() at every consumer.
decode.{h,cpp} — decodeKernel walks the .text section once, populates a DecodeResult{Insts, BlockStarts, Offsets} view used by every subsequent raise phase. Block boundaries derive from the branch target set discovered during the linear sweep so Phase 3 can pre-create LLVM BasicBlocks before any IR builder lands.
mc_state.cpp — minor reshuffle to expose the MCDisassembler / MCSubtargetInfo pair to the new decode entry point.

tgymnich · 2026-05-13T14:24:01Z

Warning

Stacked PR

ftynse

Overall, the DecodedInst data structure looks needlessly fat and redundant. A lot of its fields are duplicating information from MCInst, which it also stores, and should be converted into accessor functions. Some flags are relevant for any instruction with a specific name/class, not a particular instance, so there is no value in storing them per-instance except micro-optimization on the hot path (that would require data prior to implementing them). Even when it's a hot path, a plausible solution would be to have a look-aside table with that information and store pointers into that table instead.

There are also comments about string parsing, though I don't really see any code performing, and would complain about it unless the approach is unavoidable. The only place with string parsing seems to be vcmp that I already complained about in the previous PR.

ftynse · 2026-05-18T11:21:06Z

+}
+
+namespace COMGR::hotswap {


Suggested change

}

namespace COMGR::hotswap {

ftynse · 2026-05-18T11:21:13Z

+
+namespace COMGR::hotswap {
+
+struct DecodedInst {


This needs a doc. All of the fields need doc.

ftynse · 2026-05-18T11:21:40Z

+
+} // namespace COMGR::hotswap
+
+#endif


Nit: add a comment saying which #if this closes.

ftynse · 2026-05-18T11:24:46Z

+  std::string Mnemonic;
+  std::string RawMnemonic;


Do we really need to store the textual mnemonic, twice? For every instance of an instruction. Can't we rather have an enum entry and/or the opcode. It also looks like we have the MCInst below, so this may even be redundant.

ftynse · 2026-05-18T11:26:53Z

+struct DecodedInst {
+  std::string Mnemonic;
+  std::string RawMnemonic;
+  std::string FullText;


If we really need to store textual representation of the instruction for some reason, as opposed to generating it on-the-fly from MCInt, maybe we can generate full textual assembly once into a memory mapped file or something like that and only store pointers/StringRefs to that instead of piecemeal dynamic allocations of strings for every instance.

ftynse · 2026-05-18T12:32:44Z

+    DecodedInst Di;
+    Di.RawMnemonic = getMnemonic(Mc, Inst);
+    {
+      std::string S;


SmallString to avoid dynamic allocations.

ftynse · 2026-05-18T12:33:16Z

+      Mc.Printer->printInst(&Inst, 0, "", *Mc.SubtargetInfo, Os);
+      Di.FullText = StringRef(S).ltrim().str();


See comment above about printing the whole function and keeping references instead, assuming this is needed at all.

ftynse · 2026-05-18T12:33:36Z

+      Mc.Printer->printInst(&Inst, 0, "", *Mc.SubtargetInfo, Os);
+      Di.FullText = StringRef(S).ltrim().str();
+    }
+    Di.Mnemonic = stripEncoding(StringRef(Di.RawMnemonic)).str();


Can't this be done on the fly?

ftynse · 2026-05-18T12:34:56Z

+      break;
+    }
+    Off += InstSize;
+  }


Perhaps we want to signal how many bytes we processed if it's not the whole input.

ftynse · 2026-05-18T12:42:47Z

+  static constexpr unsigned KMaxSrcs = 24;
+  unsigned SrcMap[KMaxSrcs] = {};
+  unsigned ModMap[KMaxSrcs] = {};


So we are storing 48*4=192 bytes of information per instruction here where the overwhelming majority of instructions won't ever use 24 sources. Sounds ridiculously wasteful. Just use a SmallVector.

ftynse · 2026-05-18T12:45:06Z

A separate question is testing. We need a way to test this as well.

Wraps the MC-layer disassembler in a hotswap-shaped API: * decoded_inst.h — `DecodedInst` value type bundling MCInst, raw bytes, kernel offset, TSFlags, and an explicit `IsBranch` / `BranchTarget` decoration so Phase-3 BB layout can run without re-querying MCInstrDesc. * parsed_reg.h — `ParsedReg{Kind, BaseIdx, NReg}` value type so handlers can match on register class without re-parsing `MCOperand::getReg()` at every consumer. * decode.{h,cpp} — `decodeKernel` walks the .text section once, populates a `DecodeResult{Insts, BlockStarts, Offsets}` view used by every subsequent raise phase. Block boundaries derive from the branch target set discovered during the linear sweep so Phase 3 can pre-create LLVM BasicBlocks before any IR builder lands. * mc_state.cpp — minor reshuffle to expose the `MCDisassembler` / `MCSubtargetInfo` pair to the new decode entry point. Co-Authored-By: Tim Gymnich <tim@gymni.ch> Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

tgymnich requested review from chinmaydd and lamb-j as code owners May 13, 2026 14:23

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from 963fd34 to 5f3d5aa Compare May 13, 2026 15:40

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from 6e6f193 to 4d23bed Compare May 13, 2026 15:40

tgymnich requested a review from martin-luecke May 13, 2026 15:56

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from 5f3d5aa to 5a5c860 Compare May 15, 2026 11:16

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from 4d23bed to f434f7f Compare May 15, 2026 11:16

This was referenced May 15, 2026

[Comgr][hotswap] Add raise_failure + RaiseContext + register file #2551

Open

[Comgr][hotswap] Add raise_cli + lit harness for hotswap-raise fixtures #2552

Open

[Comgr][hotswap] Add SOPP + SOPC handlers #2553

Open

tgymnich requested a review from ftynse May 15, 2026 11:44

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from 5a5c860 to a098906 Compare May 15, 2026 13:06

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from f434f7f to 5b815e6 Compare May 15, 2026 13:06

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from a098906 to 4eda47a Compare May 15, 2026 15:28

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from 5b815e6 to 634765b Compare May 15, 2026 15:28

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from 4eda47a to 4b9f8b4 Compare May 15, 2026 16:07

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from 634765b to a32b336 Compare May 15, 2026 16:07

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from 4b9f8b4 to 97a7c25 Compare May 15, 2026 16:18

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from a32b336 to 5249422 Compare May 15, 2026 16:18

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from 97a7c25 to 4dbeb97 Compare May 15, 2026 18:55

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from 5249422 to eac9fc5 Compare May 15, 2026 18:55

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from 4dbeb97 to 8cbf5bd Compare May 15, 2026 20:45

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from eac9fc5 to c1bbdbf Compare May 15, 2026 20:46

ftynse reviewed May 18, 2026

View reviewed changes

tgymnich force-pushed the users/tgymnich/hotswap-pr-03-mc-state branch from 8cbf5bd to d3b2b77 Compare May 18, 2026 14:38

tgymnich force-pushed the users/tgymnich/hotswap-pr-04-decoder branch from c1bbdbf to 36b17dd Compare May 18, 2026 14:38

tgymnich mentioned this pull request May 18, 2026

[Comgr][hotswap] Add setpc analysis + sync-translation doc #2588

Open

chinmaydd added comgr Related to Code Object Manager hotswap Related to the Comgr Hotswap feature labels May 20, 2026

		Mc.Printer->printInst(&Inst, 0, "", *Mc.SubtargetInfo, Os);
		Di.FullText = StringRef(S).ltrim().str();

Conversation

tgymnich commented May 13, 2026

Uh oh!

tgymnich commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ftynse left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ftynse commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tgymnich commented May 13, 2026 •

edited

Loading