Lazy Workspace/Project Discovery #17537

davidbarsky · 2024-07-03T13:15:39Z

Few notes:

This issue is my attempt to consolidate discussion from Zulip, A Plan for Making Rust Analyzer Faster #17491, comments, and Proper support for standalone/detached files #14318.
The term "workspace" is often used in rust-analyzer; I'm going to use "project" because an indexed project might not have a 1:1 correspondence to a VS Code workspace. In fact, there might be multiple, indexed projects in a single workspace, or there might be no workspace at all!

Anyway! Project discovery should be entirely lazy. This change makes the following easier:

Monorepos. Projects are often discovered incrementally as the user navigates around a monorepo and it doesn't make sense to do Cargo-style project discovery at startup.
Standalone files, like rustlings. Users would be able to just open a Rust file, and through a rust-analyzer.toml in the rustlings repo, IDE functionality would just work for them.
Cargo scripts, which have a similar dynamic to that of rustlings/monorepos but scaled down from the latter.

To make this change happen, the currently eager (that is, they occur on startup/workspace folder change) ProjectManifest::discover_all + cargo metadata-style operations would become lazy, rust-analyzer.workspace.DiscoverCommand-style operations that only happen after startup. This would mean several things:

Project discovery/indexing wouldn't start until a user opens a Rust file.
cargo_metadata::MetadataCommand::new() would become the default mode for flycheck/src/json_workspace.rs (see this comment. That file would no longer be JSON workspace-specific, but it would also make "project discovery" a first-class concept in rust-analyzer.
rust-analyzer.linkedProjects would technically be lazily evaluated, but if any value is set, it would effectively be "eagerly" evaluated.
The current "rust-analyzer only searches two directory levels down for a Cargo.toml" behavior can be removed in favor of "run cargo-metadata in the parent of the rust file"-esque behavior, which newer Rust users often struggled with and complained about.

To support this change, I think three things need to happen:

feature: teach rust-analyzer to discover linked_projects #17246 needs to land.
The crate graph should be lifted into a standalone, Salsa database.
- Salsa's interning infrastructure should be used with the crate data as the "key". This is necessary in order to support different feature flags/versions across projects.
A nice performance bonus, the VFS should be able to load all a project's files in a single go.
- Today, rust-analyzer doesn't have a meaningful distinction between the user-facing "startup" and "steady-state, using-the-IDE" phases. It is always already to incrementaly update and rebuild the crate graph, which it does many times during project loading. I think it is worthwhile to have this distinction because it'd then be possible to load all relevant files in a single turn extremely quickly.
- This is particularly important on network-backed file systems, like EdenFS. I've observed 180x speedups through some naive usage of Rayon.

The text was updated successfully, but these errors were encountered:

Veykril · 2024-07-03T15:11:44Z

cargo_metadata::MetadataCommand::new() would become the default mode for flycheck/src/json_workspace.rs

I'd expect these to still be different from another, unless I misunderstand this phrase makes it sound like we are unify-ing cargo and rust-project.json like projects.

The crate graph should be lifted into a standalone, Salsa database.

I don't think this is necessary for this change. It is necessary to support standalone files, but that is a separate issue that can follow afterwards.

A nice performance bonus, the VFS should be able to load all a project's files #17491 (comment).

Likewise I don't think this is necessary either, this is a separate issue as well. We can implement lazy discovery without this change.

davidbarsky · 2024-07-03T15:57:27Z

cargo_metadata::MetadataCommand::new() would become the default mode for flycheck/src/json_workspace.rs

I'd expect these to still be different from another, unless I misunderstand this phrase makes it sound like we are unify-ing cargo and rust-project.json like projects.

They are different, sorry! I'm trying to say that these would go through similar codepaths/mechanisms, as opposed to being fully distinct today.

The crate graph should be lifted into a standalone, Salsa database.

I don't think this is necessary for this change. It is necessary to support standalone files, but that is a separate issue that can follow afterwards.

It's not, strictly speaking, necessary for #17246, but it would simply the currently complicated state machine.

A nice performance bonus, the VFS should be able to load all a project's files #17491 (comment).

Likewise I don't think this is necessary either, this is a separate issue as well. We can implement lazy discovery without this change.

Same thing: it's not really required, but I really think it makes a lot of the subtle bugs would crop up substantially easier to reason about as a result.

YPares · 2024-11-22T16:41:01Z

Currently working on a big monorepo with several independent rust crates, each one having a different set of sysdeps (provided via Nix and direnv). This would indeed be very helpful because in our case, it doesn't even make sense to have rust-analyser load all the crates, as the sysdeps for some may not even be in scope.

What we do right now is open only a subpart of the monorepo in VSCode, to have just one crate in scope each time.

davidbarsky added E-medium C-Architecture Big architectural things which we need to figure up-front (or suggestions for rewrites :0) ) A-project-model project model and workspace related issues labels Jul 3, 2024

davidbarsky mentioned this issue Jul 7, 2024

Proper support for standalone/detached files #14318

Open

davidbarsky mentioned this issue Jul 15, 2024

feature: teach rust-analyzer to discover linked_projects #17246

Merged

davidbarsky mentioned this issue Aug 1, 2024

rust-analyzer not working in vscode #4894

Closed

alibektas mentioned this issue Aug 1, 2024

Read global ratoml before first config query #17712

Open

This was referenced Aug 9, 2024

Support .rust-project.json (i.e. hidden) #17816

Closed

rust-analyzer wanted features & bugfixes Rust-for-Linux/linux#1051

Open

davidbarsky mentioned this issue Sep 24, 2024

internal: add tracing to project discovery and VFS loading #18181

Merged

This was referenced Oct 11, 2024

"rust-analyzer failed to discover workspace" could be so much more helpful #13226

Open

New files in tests/ give "This file is not included in any crates" #18279

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lazy Workspace/Project Discovery #17537

Lazy Workspace/Project Discovery #17537

davidbarsky commented Jul 3, 2024

Veykril commented Jul 3, 2024

davidbarsky commented Jul 3, 2024

YPares commented Nov 22, 2024 •

edited

Loading

Lazy Workspace/Project Discovery #17537

Lazy Workspace/Project Discovery #17537

Comments

davidbarsky commented Jul 3, 2024

Veykril commented Jul 3, 2024

davidbarsky commented Jul 3, 2024

YPares commented Nov 22, 2024 • edited Loading

YPares commented Nov 22, 2024 •

edited

Loading