Fix evaluation of non-dot assignments containing forward references #510

parth-07 · 2025-10-30T11:11:33Z

This commit fixes evaluation of non-dot assignments containing forward references. At a high-level, eld evaluates assignments in sequence and consequently the expressions containing forward references needs to be reevaluated when the forward reference value is computed for the correct computation results. For example:

u = v1;    // Assignment 1
v1 = v2;   // Assignment 2
v2 = 0xa;  // Assignment 3

eld evaluates the assignments in order, that is, the evaluation happens as: [Assignment 1, Assignment 2, Assignment 3]. If we follow this evaluation order, the symbols u and v1 will have incorrect values because the value of v2 is unknown when the assignments of u and v1 are evaluated.

This commit fixes evaluation of non-dot assignments containing forward references by making the below two changes:

Assignments which cannot be completely evaluated during the
sequential evaluation of expressions are marked as pending
assignments. After the layout is done, but before the relaxation,
these pending assignments are recursively reevaluated until there
is no improvement in pending assignments or a MaxIteration limit is
reached.
During a layout iteration, assignments may get evaluated multiple times
as layout needs to reset itself based on a few conditions
(new segment needs to be created, and so on...). It is important to reset
the symbol if a layout gets reset. Let's
see why it is important with the help of an example:

   SECTIONS {
     FOO : {
       u = v; // Assignment 1
       v = 0xa; // Assignment 2
       *(.text.foo)
     }
     BAR : {
       v = 0xb; // Assignment 3
     }
     v = 0xc; // Assignment 4
  }

The sequential assignment evaluation order is: [A1, A2, A3, A4]. When A1 is evaluated, v is not defined, hence we mark A1 as pending assignment. However, if the layout gets reset after evaluating A2, then A1 will be evaluated again, but this time, v is defined (from assignment 2 evaluation) and thus A1 can be completely evaluated. This is wrong because A1 should get the value from the assignment 4 instead of assignment 2!

We fix this issue by resetting the symbol values
whenever a layout is reset.

The same issue happens when the layout needs to be recomputed after a relaxation pass. And the same solution of resetting the the symbol values works for this case as well.

This commit also adds a new trace category 'pending-assignments' for tracing pending assignment evaluation.

Resolves #505

quic-seaswara

Can we define a hierarchy of what assignments are evaluated in order, Yes it makes sense to evaluate in the order that the scripts define them, but users would not know the assignment order. I need this to be documented thoroughly.
Please also add assignment id's similar to rule id's that we can annotate to let the user know which assignment was picked up when evaluating a symbol.

These two patches can be a pre requisite before bringing this patch in.

Few questions before I review this change in more detail

What happens with PROVIDE assignments ?
An example illustration is symdef fiiles, defsym assignments and fixing the order would be very crucial.
ld.eld --defsym foo=1 -T script.t --defsym bar=2 (what is the category of symbol assignments assigned to foo=1 and bar=2)
ld.eld --defsym foo=1 obj.symdef -T script.t --defsym bar=2 (what is the category of symbol assignments assigned to foo=1 and bar=2) when there is a symdef file
section magic symbols and assigning values to it
How is garbage collection affected by what symbols are evaluated especially with PROVIDE and non PROVIDE use cases

quic-seaswara · 2025-10-31T16:25:05Z

include/eld/Diagnostics/DiagVerbose.inc

+DIAG(verbose_performing_layout_iteration, DiagnosticEngine::Verbose,
+     "Performing layout iteration %0")
+DIAG(verbose_eval_pending_assignments, DiagnosticEngine::Verbose,
+     "Evaluating pending assignments")


add a %0 with the assignment expression

We cannot add assignment expression here. This diagnostic marks the beginning of pending assignment evaluations. We do not report this diagnostic per assignment.

quic-seaswara · 2025-10-31T16:25:41Z

include/eld/Diagnostics/DiagVerbose.inc

 DIAG(verbose_infer_target, DiagnosticEngine::Verbose,
     "Inferred target : %0")
+DIAG(verbose_performing_layout_iteration, DiagnosticEngine::Verbose,
+     "Performing layout iteration %0")


Can we --trace=layout and add the layout diagnostics in tracing layout iterations ?

Yes, this sounds good to me. I think a lot of things can be covered under --trace=layout. Can we handle this as a separate task altogether that analysis what all diagnostics should fall under --trace=layout?

quic-seaswara · 2025-10-31T16:27:27Z

include/eld/Target/GNULDBackend.h


  bool m_NeedPhdr = false;
+
+  std::unordered_set<const ResolveInfo *> PartiallyEvaluatedSymbols;


shouldn't this be PartiallyEvaluatedAssignments instead ? because everything is an assignment

We need both PartiallyEvaluatedAssignments and PartiallyEvaluatedSymbols. With just storing assignments, we cannot determine if a symbol was partially evaluated or not, as we do not maintain a mapping of symbol nodes to assignments.

quic-seaswara · 2025-10-31T16:28:20Z

lib/SymbolResolver/IRBuilder.cpp


+  // If symbol originates from a linker script, record it in NamePool.
+  if (Input && Input->isLinkerScript())
+    ThisModule.getNamePool().addScriptSymbol(Result.Info);


what about defsym ?

defsym is handled in the exact same way as other linker script assignments.

quic-seaswara · 2025-10-31T16:28:27Z

lib/SymbolResolver/IRBuilder.cpp

  }

+  if (Input && Input->isLinkerScript())
+    ThisModule.getNamePool().addScriptSymbol(OutputSym->resolveInfo());


why in two places ?

One is used by non-PROVIDE linker script assignments, and the other is used by PROVIDE linker script assignments.

parth-07 · 2025-11-14T14:18:58Z

@quic-seaswara Thank you for the detailed review.

Can we define a hierarchy of what assignments are evaluated in order, Yes it makes sense to evaluate in the order that the scripts define them, but users would not know the assignment order. I need this to be documented thoroughly.

I have added documentation detailing linker script symbol assignment evaluation order in #597.

Please also add assignment id's similar to rule id's that we can annotate to let the user know which assignment was picked up when evaluating a symbol.

I am not sure if we should make it this complicated. It is either the assignment that was before the symbol reference, or the final assignment for the symbol. Those are the only two valid assignments for which the symbol value can be taken from. Can we please prioritize this analysis feature later as per the requirements?

Few questions before I review this change in more detail

What happens with PROVIDE assignments ?

The framework that determines if PROVIDE should provide a symbol happens much before the layout starts. We determine which PROVIDE assignments will provide the symbol in scanRelocations step, it happens before section merging. The framework for recursively evaluating pending assignments to correctly evaluate non-dot forward references all happen in the layout phase. The PROVIDE assignments are considered only if they were previously determined to actually provide symbol.

ld.eld --defsym foo=1 -T script.t --defsym bar=2 (what is the category of symbol assignments assigned to foo=1 and bar=2)

I have added a note for --defsym in the documentation added in #597. In brief, --defsym is treated as a temporary linker script containing only one symbol assignment.

ld.eld --defsym foo=1 obj.symdef -T script.t --defsym bar=2 (what is the category of symbol assignments assigned to foo=1 and bar=2) when there is a symdef file

Currently, we do not create an assignment node for symdef symbol assignments. So, they do not affect assignment evaluation order.

section magic symbols and assigning values to it

Can you please explain more about this concern?

How is garbage collection affected by what symbols are evaluated especially with PROVIDE and non PROVIDE use cases

I think this falls outside the scope of this PR functionality. This PR only affects forward-reference evaluation in the layout step. Garbage-collection happens much before.

This commit fixes evaluation of non-dot assignments containing forward references. At a high-level, eld evaluates assignments in sequence and consequently the expressions containing forward references needs to be reevaluated when the forward reference value is computed for the correct computation results. For example: ``` u = v1; // Assignment 1 v1 = v2; // Assignment 2 v2 = 0xa; // Assignment 3 ``` eld evaluates the assignments in order, that is, the evaluation happens as: [Assignment 1, Assignment 2, Assignment 3]. If we follow this evaluation order, the symbols `u` and `v1` will have incorrect values because the value of `v2` is unknown when the assignments of `u` and `v1` are evaluated. This commit fixes evaluation of non-dot assignments containing forward references by making the below two changes: 1) Assignments which cannot be completely evaluated during the sequential evaluation of expressions are marked as pending assignments. After the layout is done, but before the relaxation, these pending assignments are recursively reevaluated until there is no improvement in pending assignments or a MaxIteration limit is reached. 2) During a layout iteration, assignments may get evaluated multiple times as layout needs to reset itself based on a few conditions (new segment needs to be created, and so on...). It is important to reset the symbol if a layout gets reset. Let's see why it is important with the help of an example: ``` SECTIONS { FOO : { u = v; // Assignment 1 v = 0xa; // Assignment 2 *(.text.foo) } BAR : { v = 0xb; // Assignment 3 } v = 0xc; // Assignment 4 } ``` The sequential assignment evaluation order is: [A1, A2, A3, A4]. When A1 is evaluated, `v` is not defined, hence we mark A1 as pending assignment. However, if the layout gets reset after evaluating A2, then A1 will be evaluated again, but this time, `v` is defined (from assignment 2 evaluation) and thus A1 can be completely evaluated. This is wrong because A1 should get the value from the assignment 4 instead of assignment 2! We fix this issue by resetting the symbol values whenever a layout is reset. The same issue happens when the layout needs to be recomputed after a relaxation pass. And the same solution of resetting the the symbol values works for this case as well. This commit also adds a new trace category 'pending-assignments' for tracing pending assignment evaluation. Resolves qualcomm#505 Signed-off-by: Parth Arora <partaror@qti.qualcomm.com>

partaror requested a review from quic-seaswara October 30, 2025 11:11

parth-07 force-pushed the NonDotForwRef branch 2 times, most recently from e6be6a3 to 56ebad4 Compare October 30, 2025 14:08

quic-seaswara requested changes Oct 31, 2025

View reviewed changes

parth-07 force-pushed the NonDotForwRef branch from 56ebad4 to 1006bc5 Compare November 14, 2025 14:35

parth-07 requested a review from quic-seaswara November 14, 2025 14:35

partaror requested a review from quic-areg November 14, 2025 14:38

partaror force-pushed the NonDotForwRef branch from 1006bc5 to fe0bd0a Compare November 24, 2025 08:51

parth-07 force-pushed the NonDotForwRef branch from fe0bd0a to b4c13f1 Compare December 1, 2025 15:52

parth-07 force-pushed the NonDotForwRef branch from b4c13f1 to 56498ff Compare December 29, 2025 18:32

parth-07 force-pushed the NonDotForwRef branch from 56498ff to af10850 Compare December 29, 2025 18:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix evaluation of non-dot assignments containing forward references #510

Fix evaluation of non-dot assignments containing forward references #510

parth-07 commented Oct 30, 2025

Uh oh!

quic-seaswara left a comment

Uh oh!

quic-seaswara Oct 31, 2025

Uh oh!

parth-07 Nov 14, 2025

Uh oh!

quic-seaswara Oct 31, 2025

Uh oh!

parth-07 Nov 14, 2025

Uh oh!

quic-seaswara Oct 31, 2025

Uh oh!

parth-07 Nov 14, 2025

Uh oh!

quic-seaswara Oct 31, 2025

Uh oh!

parth-07 Nov 14, 2025

Uh oh!

quic-seaswara Oct 31, 2025

Uh oh!

parth-07 Nov 14, 2025

Uh oh!

parth-07 commented Nov 14, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		bool m_NeedPhdr = false;

		std::unordered_set<const ResolveInfo *> PartiallyEvaluatedSymbols;

Fix evaluation of non-dot assignments containing forward references #510

Are you sure you want to change the base?

Fix evaluation of non-dot assignments containing forward references #510

Conversation

parth-07 commented Oct 30, 2025

Uh oh!

quic-seaswara left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

parth-07 commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

parth-07 commented Nov 14, 2025 •

edited

Loading