Remove disabled optional weak dependencies from Cargo.lock #11938

est31 · 2023-04-06T02:43:49Z

In #10801, it was noted that Cargo treats weak dependencies just like strong ones in Cargo.lock, as in, includes them unconditionally. Cargo does not build these dependencies though, only uses them during resolution.

Cargo has two resolvers:

one that performs aggressive unification and is used to generate Cargo.lock files
one that takes the enabled features on the command line into account, as well as the target platform

The implementation in #8818 only changed the latter resolver, which implemented weak dependency support for most important workflows, like determining which crates are actually built. The former resolver still keeps a list of enabled and disabled features for each crate, and the way more aggressive unification it performs isn't an issue for weak dependency support, so fundamentally one can add weak dependency support to it, too. This PR does that precisely:

it changes the parts of the resolver that generate Cargo.lock to not include weak dependencies.
it adds a new Cargo.lock ResolveVersion::V4 variant, which isn't emitted by default. The idea is though that at some point in the future cargo switches to the new version by default. I'm open to feedback for different suggestions how to phase in the change, maybe via resolver = 3, or maybe additionally by having an unstable flag so that we can iterate before version 4 is stabilized.

Fixes #10801

rustbot · 2023-04-06T02:43:53Z

r? @epage

(rustbot has picked a reviewer for you, use r? to override)

Eh2406 · 2023-04-10T19:23:38Z

One clarification, I don't think "old" is useful piece of terminology (at least at this time). The older Dependency Resolver works on an NP-Hard problem of determining which versions of transitive dependencies to select. When it hits conflicts it solves them by backtracking and attempting another solution. Its output is reified into a lock file, which must be the same even if created on different systems or when building different artifacts. The newer Feature Resolver takes a concrete set of dependency versions, as outputted by the Dependency Resolver, and decides which features and dependencies actually need to be built for the given invocation. Algorithmically this is a much simpler problem, although the implementation gets tricky with all of the interacting filters it supports. Many new pieces of functionality for the Feature Resolver cannot be included in the Dependency Resolver, because it would lead to different architectures or invocations including different information in the lock file.

I do not have enough implementation detail loaded into my brain at the moment to know if this is a problem, but any change to the Dependency Resolver needs to have a solution for backtracking. (Extensive tests, or detailed comments). Specifically, imagine that Foo is in the tree from three different dependencies, A depends on Foo without any optional features, B depends on Foo with a feature that activates Bar, and C depends on Foo with a week dependency on Bar?/qux. For any order these requirements are processed, will they always get correct solution? Now Furthermore, imagine that qux cannot be resolved, all versions are yanked or something. This can be solved by trying a different version of B or C but not A. Will all of those versions be correctly tried?

est31 · 2023-04-10T20:31:46Z

Thanks for your reply, @Eh2406 !

I don't think "old" is useful piece of terminology (at least at this time).

I used "old" because of the explanation in the comments of the resolver module referring to the other one as the "new" one, so I inferred the other one would be the old one, but I will change the PR description.

The issue with calling the new resolver as the feature aware one is that one might think that the Dependency Resolver has no concept of features at all, which is not the case, very importantly so for this PR as it builds on that. Correct me if I'm wrong but the main difference between the Dependency Resolver and the Feature Resolver is that former performs more aggressive unification, and enables more features at the start that might not be needed in the end (like platform specific ones).

but any change to the Dependency Resolver needs to have a solution for backtracking. (Extensive tests, or detailed comments).

I will add more tests once I find time to work on this PR.

Eh2406 · 2023-04-11T18:35:30Z

I used "old" because

That is entirely fair. It is the terminology we use then. And I am sorry for the confusion that terminology has caused over the years.

The issue with calling the new resolver as the feature aware one is that one might think that the Dependency Resolver has no concept of features at all,

That makes sense. And I'm open to better terminology.

main difference between the Dependency Resolver and the Feature Resolver is that former performs more aggressive unification, and enables more features at the start that might not be needed in the end (like platform specific ones).

The Dependency Resolver specifies the version of every dependency (even if optional and activated by a feature) that will be used for any invocation on any system. Whereas the Feature Resolver figures out which features and dependencies are needed for this particular invocation on this particular system. Because of this distinction there will always be two phases, one of which enables more features to generate a lock file followed by one that more aggressively filters.

taoohong · 2023-05-05T09:06:33Z

Is there any other way to solve this problem now?

Eh2406 · 2023-05-05T17:24:40Z

I'm sorry if I wasn't clear. I do think this is a reasonable and correct approach. I think Ed had some concerns about exactly how we opt into it, and I would like to see more testing. But I am supportive of this work moving forward.

epage · 2023-05-05T17:29:42Z

From my perspective, if a new lockfile version is what is needed, ok. I was then waiting on a response to Jacob's request for tests.

est31 · 2023-05-05T18:23:03Z

I'm sorry if I wasn't clear. I do think this is a reasonable and correct approach. I think Ed had some concerns about exactly how we opt into it, and I would like to see more testing.

Good then I understood it correctly. I didn't think you didn't like the approach of this PR. I agree that tests are important. The reason why I haven't worked more on this PR is that I have other projects going on so I didn't get to it. It is on my TODO list. It doesn't help that the weather is really good now and I go outside in my free time 😆 .

ehuss · 2023-05-05T18:41:20Z

I'm pretty sure this approach won't work. There are several situations where weak dependencies are included through mechanisms that don't directly enable those features (like the user didn't specify the feature, or things like deferred features and such). I think the only option to minimize the entries in the resolve is to include all of the logic from the feature resolver in the dependency resolver, and we avoided that due to the level of complexity needed.

est31 · 2023-05-05T19:22:20Z

like the user didn't specify the feature

The Cargo.lock generating resolver enables all features that a crate has. If that enables any features of dependencies, those are also enabled. This will also enable the weak dependencies. This PR changes nothing about that. However, the weak dependencies of dependencies are only enabled if that feature is. That's precisely what the two present tests test for: disabled_weak_direct_dep and disabled_weak_optional_deps.

deferred features

WDYM by that? I grepped the cargo sources for that, the only place I could find was deferred profiles, which was something about debuginfo, and the deferred_weak_dependencies hashmap of the new resolver. Which seems like an implementation detail of the new resolver to me.

ehuss · 2023-05-05T19:50:38Z

Since the dependency resolver doesn't know about dep: features, it isn't able to have a feature associated with the dependency in order to also trigger a weak dependency.

Deferred features is indeed related to the implementation of the feature resolver. When it sees the ? syntax, and it isn't already enabled, it needs to defer it in case something else enables it later.

TODO better commit msg

est31 · 2023-05-05T20:34:28Z

I still don't understand your first paragraph. Would it be possible to give an example?

I now see what you mean in your second paragraph about deferred features. Indeed, if I adapt the deferred test from the testsuite, it fails. I have pushed the failing test to demonstrate it. I was a bit surprised that this didn't work before as similar logic is needed for dependency unification, but now I'm not sure if it's possible to implement such logic via backtracking. IDK.

weihanglo · 2023-05-05T20:37:32Z

FWIW, if we want to bump the lockfile version, I have a list of what could be bumped together.

ehuss · 2023-05-05T21:43:13Z

Here's an example:

[package]
name = "z73"
version = "0.1.0"
edition = "2021"

[dependencies]
serde = {version = "1", optional = true }

[features]
foo = ["dep:serde"]
bar = ["serde?/derive"]

est31 · 2023-05-05T22:28:19Z

@ehuss thanks, that looks like the weak_namespaced test in weak_dep_features.rs. I will try to adapt it to v4 lockfiles, but in theory it should work with little modification as the Cargo.lock creating resolver can also handle FeatureValue::Dep. There might be one 1 line change needed though, the code doesn't look 100% correct.

Edit: pushed now, dep: works with a 1 line change. So only the unification support is an issue then (plus tests, plus any unknown issue).

bors · 2023-06-17T14:32:26Z

☔ The latest upstream changes (presumably #12279) made this pull request unmergeable. Please resolve the merge conflicts.

djc · 2023-07-04T08:22:02Z

@ehuss do you see any path forward here?

It's important to me that my Cargo.lock somewhat accurately reflects the dependencies that are being built for my products, and this makes that substantially harder. (See for example, launchbadge/sqlx#2579.)

est31 · 2023-07-04T09:08:28Z

The blocker is the unification support (see the failing test), which also is kinda important. I need to have a closer look but right now I don't even know if this PR's approach is viable at all. An alternative to this PR might be to use the "features"/new resolver with some different parameters (to make it similar to the "dependencies"/old resolver).

ehuss · 2023-07-06T14:19:27Z

Unfortunately I don't see a path forward here. The current design was intentionally chosen to manage the complexity of the implementation. It might be possible to do post-resolution filtering, but that would require running feature resolution twice which could have a noticeable performance hit.

juancampa · 2023-08-28T05:50:19Z

Should this PR be closed then @ehuss?

est31 · 2023-08-28T06:56:01Z

Yeah I think I will close this, even though I really want the feature. I don't have the time right now to fix the issues. Others are invited to work on this. Maybe in the future the approach could be tried out that we run feature resolution twice. It might have a performance impact, yes, but it will also create more correct output. Of course, if the existing dependency resolver could be made aware, that would be even better, but I'm not sure it is possible. Anyone, feel free to take my PR and make it work.

Sytten · 2023-12-20T19:43:45Z

@ehuss So if I am understanding this correctly. In order to fix the bug, we would need to unify both resolvers to use the same logic? Do we have an idea of the scope of the work and if so would it be possible to provide some pointers. I would be willing to help, but I have no idea where I would start.

I am trying to see if there is a path forward for this issue that lingers and is becoming more problematic as many libraries are adopting the optional dependencies.

Eh2406 · 2023-12-24T03:57:43Z

@Sytten The resolvers were not unified on purpose. The complexity of doing whole graph optimization like the feature resolver while the graph is still being constructed and can be changed like the dependency resolver is likely to be too complex to be maintainable. (There were some other implementation suggestions made above, the details of which I don't think I fully understood.)

The dependency resolver has to go back and reevaluate alternatives when it finds a problem. This makes it very sensitive to what order it tries to evaluate constraints. These are the backtracking concerns I mentioned above. This makes it incredibly difficult to be sure that a handful of tests actually covers all the important permutations of decision-making. (The tests not only needs to hit all the corner cases, it needs to try all of the orders of evaluation.) The only tool I have found useful for finding these bugs is the prop tests, which generate random crate version graphs and make sure the induced resolutions make sense. The most important of these is the SatResolve, where we compare the result to a simplified resolver that uses the varisat library. Unfortunately SatResolve does not (yet) know about features, and the prop test generator doesn't (yet) know how to generate them.

So I would start by:

add support for features to the SatResolve
add support for generating features to the prop test generator
fix the bugs in the dependency resolver exposed by the new test coverage
(Optional) re-factor the dependency resolver to treat features as a more first-class object that can be turned on and off, instead of an aspect of the dependency requirement.
propose new semantics for week dependencies in the dependency resolver, by specifying it in terms of the SatResolve
add support for generating week features to the prop test generator
attempt to add support for these new semantics to the dependency resolver. This PR will be a good place to start. But step 4, may be required to get the backtracking correct.

The easiest way to implement step 7 may be to adopt some wrapper around the pubgrub-rs library. I am busy working on that library to make it production ready for this purpose. I believe the lowering from cargo's understanding of dependency resolution to pubgrub's will automatically require/involve step 4, and should make at least "week" features not too complicated. I would happily support someone who wants to contribute by working on 1 - 6 while I continue to work on pubgrub!

P.S. @epage: having written this to do list up, is there a more appropriate place for it to be recorded?

Sytten · 2023-12-24T13:02:57Z

@Eh2406 This is a really good starter thanks for that. I am going to try to tackle the first point this week. I have goos experience in rust but never contributed to cargo so I will likely need to have some reviews along the way. Is it ok if I tag you in draft PR?

Eh2406 · 2023-12-24T15:25:00Z

@Sytten Absolutely! Also we have Office Hours if you want to chat synchronously. This week schedules may be odd for the US holidays, so ping us on the zulip thread for the slot you want to attend. The SatResolve code is not the best written for understandability, PR's that just refactor it so it makes sense to you are also welcome.

doc: Add README for resolver-tests ### What does this PR try to resolve? This collect the comments from `@Eh2406` about the resolver-tests and add a README to it. ### How should we test and review this PR? ### Additional information Maybe Fixed #13319 See #11938 (comment)

rustbot assigned epage Apr 6, 2023

rustbot added A-dependency-resolution Area: dependency resolution and the resolver A-lockfile Area: Cargo.lock issues S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 6, 2023

est31 force-pushed the weak_deps_cargo_lock branch from 8cda29d to 48abb23 Compare April 6, 2023 02:53

est31 mentioned this pull request Apr 6, 2023

Disabled optional weak dependencies end up in Cargo.lock #10801

Open

est31 force-pushed the weak_deps_cargo_lock branch 2 times, most recently from a0ccf26 to 850fb43 Compare April 6, 2023 14:56

epage mentioned this pull request Apr 19, 2023

MSRV-dependent dependency version resolution #9930

Open

25 tasks

est31 added 3 commits May 5, 2023 22:01

Don't include disabled weak dependencies

2ca2ec2

TODO better commit msg

Add ResolveVersion gating

315023b

Add tests

2539c74

est31 force-pushed the weak_deps_cargo_lock branch from 850fb43 to df83bae Compare May 5, 2023 20:32

Adapt deferred test for optional weak dependencies

4f2d1b0

est31 force-pushed the weak_deps_cargo_lock branch from df83bae to 4f2d1b0 Compare May 5, 2023 20:43

Add test for dep: dependencies

3dd1882

est31 closed this Aug 28, 2023

weihanglo mentioned this pull request Oct 19, 2023

feat: stabilize lockfile v4 #12852

Merged

abonander mentioned this pull request Apr 21, 2024

[KNOWN ISSUE] Unexpected/unwanted crates in Cargo.lock launchbadge/sqlx#3211

Open

linyihai mentioned this pull request May 29, 2024

doc: Add README for resolver-tests #13977

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove disabled optional weak dependencies from Cargo.lock #11938

Remove disabled optional weak dependencies from Cargo.lock #11938

est31 commented Apr 6, 2023 •

edited

Loading

rustbot commented Apr 6, 2023

Eh2406 commented Apr 10, 2023

est31 commented Apr 10, 2023

Eh2406 commented Apr 11, 2023

taoohong commented May 5, 2023 •

edited

Loading

Eh2406 commented May 5, 2023

epage commented May 5, 2023

est31 commented May 5, 2023

ehuss commented May 5, 2023

est31 commented May 5, 2023

ehuss commented May 5, 2023

est31 commented May 5, 2023

weihanglo commented May 5, 2023

ehuss commented May 5, 2023

est31 commented May 5, 2023 •

edited

Loading

bors commented Jun 17, 2023

djc commented Jul 4, 2023

est31 commented Jul 4, 2023

ehuss commented Jul 6, 2023

juancampa commented Aug 28, 2023

est31 commented Aug 28, 2023

Sytten commented Dec 20, 2023

Eh2406 commented Dec 24, 2023

Sytten commented Dec 24, 2023

Eh2406 commented Dec 24, 2023

Remove disabled optional weak dependencies from Cargo.lock #11938

Remove disabled optional weak dependencies from Cargo.lock #11938

Conversation

est31 commented Apr 6, 2023 • edited Loading

rustbot commented Apr 6, 2023

Eh2406 commented Apr 10, 2023

est31 commented Apr 10, 2023

Eh2406 commented Apr 11, 2023

taoohong commented May 5, 2023 • edited Loading

Eh2406 commented May 5, 2023

epage commented May 5, 2023

est31 commented May 5, 2023

ehuss commented May 5, 2023

est31 commented May 5, 2023

ehuss commented May 5, 2023

est31 commented May 5, 2023

weihanglo commented May 5, 2023

ehuss commented May 5, 2023

est31 commented May 5, 2023 • edited Loading

bors commented Jun 17, 2023

djc commented Jul 4, 2023

est31 commented Jul 4, 2023

ehuss commented Jul 6, 2023

juancampa commented Aug 28, 2023

est31 commented Aug 28, 2023

Sytten commented Dec 20, 2023

Eh2406 commented Dec 24, 2023

Sytten commented Dec 24, 2023

Eh2406 commented Dec 24, 2023

est31 commented Apr 6, 2023 •

edited

Loading

taoohong commented May 5, 2023 •

edited

Loading

est31 commented May 5, 2023 •

edited

Loading