Use a later MIR #543

Nadrieril · 2025-01-31T17:38:32Z

Today we use mir_built, but using mir_drops_elaborated_and_const_checked is required for #152 and #542. It is also likely required for #306.

The text was updated successfully, but these errors were encountered:

Nadrieril · 2025-01-31T17:47:22Z

Here is a sketch of what needs to happen to solve this issue. External help is welcome here.

What I need to know is what happens if instead of mir_built we use mir_drops_elaborated_and_const_checked here:

charon/charon/src/bin/charon-driver/translate/get_mir.rs

Line 31 in 30cab88

let body = tcx.mir_built(local_def_id);

and then run the tests.

The first thing is that our tests are full of incorrect rust (because we don't run all the analyses so not all errors are reported); if that triggers rustc errors I'd welcome a PR that fixes them.

The second thing is that the translation may become incorrect (e.g. because we handle constants wrong) or panic. I'll need reproducers for these issues, or fixes if they're easy to fix.

The final step will be to identify changes that we don't want downstream tools to have to deal with. For example the elaborate_box_derefs pass transforms *b where b: Box<T> into something like *b.0.0.0 which accesses box internals; this is undesirable so we'll need to write a pass that undoes this. I don't know of others, also interested to get an overview of what the changes look like once the translation is correct.

giltho · 2025-01-31T17:52:48Z

For example the elaborate_box_derefs pass transforms *b where b: Box into something like *b.0.0.0 which accesses box internals; this is undesirable so we'll need to write a pass that undoes this.

Why is this undesirable? Not having this means having to make Box a special case during analysis. I understand that in some cases one might want this to not happen, but I would make this pass optional/

giltho · 2025-01-31T18:00:51Z

Also, additional question: in the case of monomorphised MIR, am I correct to expect that most current limitations of Charon would be lifted? E.g. things like impl returns and generic associated types wouldn't be a problem since they would be erased before going to Charon

Nadrieril · 2025-01-31T18:10:20Z

I would make this pass optional

Yes that pass would be optional. It is useful for tools targeting safe rust, for which there should be no raw pointers if there weren't any in their code.

in the case of monomorphised MIR, am I correct to expect that most current limitations of Charon would be lifted? E.g. things like impl returns and generic associated types wouldn't be a problem since they would be erased before going to Charon

That would solve a number limitations yes, that's what the majority of analysis tools do. Another big source of limitations is the proper handling of lifetimes, which isn't relevant if you don't need them. What would be your use-case?

Also note that I believe monomorphized MIR doesn't really exist, i.e. it's built on-the-fly alongside codegen and is not something we can easily retreive.

giltho · 2025-01-31T18:40:43Z

We have two projects, the one that requires Retag does not need lifetime information
The second will need lifetime information though :( from my understanding there is now work done inside the rust compiler that does export this information, but not sure how easy to use

sonmarcho · 2025-01-31T22:40:45Z

For example the elaborate_box_derefs pass transforms *b where b: Box into something like *b.0.0.0 which accesses box internals; this is undesirable so we'll need to write a pass that undoes this.

Why is this undesirable? Not having this means having to make Box a special case during analysis. I understand that in some cases one might want this to not happen, but I would make this pass optional/

Yes, the pass would definitely be optional. But at the same time tools like Aeneas really need not to see raw pointers in place of boxes, so we have to make sure we can reconstruct the boxes before validating a change of the MIR we target.

giltho · 2025-02-01T12:03:40Z

I don't think the target should change
My experience is probably that depending on the project you want some stage of Mir. For me the default should be mir_built and we should be able to ask for another one through a config flag.

sonmarcho · 2025-02-03T09:39:23Z

I don't think the target should change My experience is probably that depending on the project you want some stage of Mir. For me the default should be mir_built and we should be able to ask for another one through a config flag.

Yes, it makes sense to allow the user to choose the stage of MIR they want and it is actually something we used to do at the beginning of the project; we stopped doing it because it revealed too much low level information we couldn't handle yet. As Charon has evolved a lot since then it is probably not an issue anymore, so maybe all we need to do to provide you with the features you need is to add a CLI option so that you can select the MIR stage you want?

On the side, all the tools which use Charon actually need some information such as the result of the drop elaboration which are only computed later in the compilation pipeline. This means we will have to target a later MIR stage at some point anyway, while pretending that we are actually targeting MIR built by reverting some passes like elaborate_box_derefs for the tools that do not want to see the low-level details. But we could do the easy step of adding the CLI option now and do the more complicated changes later. @Nadrieril what do you think?

Nadrieril · 2025-02-03T12:35:27Z

from my understanding there is now work done inside the rust compiler that does export [lifetime] information, but not sure how easy to use

If there is such work, I am not aware of it. I've only heard of tool writers coming up with horrible hacks to get that info out of polonius.

@Nadrieril what do you think?

Agreed, I don't believe the stage of MIR is the right thing we want to tweak. Quite simply, any MIR before drop elaboration is lacking crucial information about the actual code that will be run. My plan is to only get the later MIR and provide options that reconstruct safe abstractions for tools that need them.

R1kM · 2025-02-03T12:39:20Z

from my understanding there is now work done inside the rust compiler that does export [lifetime] information, but not sure how easy to use

If there is such work, I am not aware of it. I've only heard of tool writers coming up with horrible hacks to get that info out of polonius.

Are you thinking of https://doc.rust-lang.org/nightly/nightly-rustc/rustc_borrowck/consumers/fn.get_body_with_borrowck_facts.html maybe?

Nadrieril · 2025-02-03T12:44:27Z

Hm, get_body_with_borrowck_facts has existed for a while, I remember hearing that this wasn't sufficient. For example, this works only on promoted MIR, which is lacking elaborated drops. And I was recently chatting with Ralf about how to get useful lifetime info in runnable code (for Miri) and it seemed like that would take some work.

Nadrieril · 2025-02-03T13:04:19Z

The idea we had was to add a MIR statement that indicates where a given lifetime starts/ends, the benefit being that the statements can be kept across MIR phases without effort. (And apparently such a statement existed in early versions of rust). I'm not aware that anyone is working on implementing this at the moment.

giltho · 2025-02-03T13:58:56Z

If there is such work, I am not aware of it. I've only heard of tool writers coming up with horrible hacks to get that info out of polonius.

Pinging @xldenis who was the tool writer coming up with the horrible hack when I needed that for Gillian-Rust.

giltho · 2025-02-03T14:00:44Z

I know that it is, in general, not enough though. We're are somewhat limited by the proofs we can do because of that

xldenis · 2025-02-03T14:02:26Z

https://github.com/creusot-rs/creusot/blob/6546583679218c1841a824fb8fd547543496bfc0/creusot/src/callbacks.rs#L90-L108

This is what we use in creusot (MirBodyWithBorrowckFacts) what has changed is that last summer we exported the RegionInferenceContext and other helper data which allows us to make meaningful usage of the data the borrowck calculates.

@giltho the problem you're referring to is that we translate some specs from THIR which is before borrowchecking and thus doesn't have this corresponding information.

Nadrieril mentioned this issue Jan 31, 2025

Feature request: Retag statements #542

Open

Nadrieril self-assigned this Feb 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a later MIR #543

Use a later MIR #543

Nadrieril commented Jan 31, 2025

Nadrieril commented Jan 31, 2025 •

edited

Loading

giltho commented Jan 31, 2025

giltho commented Jan 31, 2025

Nadrieril commented Jan 31, 2025

giltho commented Jan 31, 2025 •

edited

Loading

sonmarcho commented Jan 31, 2025

giltho commented Feb 1, 2025

sonmarcho commented Feb 3, 2025

Nadrieril commented Feb 3, 2025

R1kM commented Feb 3, 2025

Nadrieril commented Feb 3, 2025 •

edited

Loading

Nadrieril commented Feb 3, 2025

giltho commented Feb 3, 2025

giltho commented Feb 3, 2025

xldenis commented Feb 3, 2025

Use a later MIR #543

Use a later MIR #543

Comments

Nadrieril commented Jan 31, 2025

Nadrieril commented Jan 31, 2025 • edited Loading

giltho commented Jan 31, 2025

giltho commented Jan 31, 2025

Nadrieril commented Jan 31, 2025

giltho commented Jan 31, 2025 • edited Loading

sonmarcho commented Jan 31, 2025

giltho commented Feb 1, 2025

sonmarcho commented Feb 3, 2025

Nadrieril commented Feb 3, 2025

R1kM commented Feb 3, 2025

Nadrieril commented Feb 3, 2025 • edited Loading

Nadrieril commented Feb 3, 2025

giltho commented Feb 3, 2025

giltho commented Feb 3, 2025

xldenis commented Feb 3, 2025

Nadrieril commented Jan 31, 2025 •

edited

Loading

giltho commented Jan 31, 2025 •

edited

Loading

Nadrieril commented Feb 3, 2025 •

edited

Loading