Martin

martin@friends.mbober.de

Folge

Netzwerkbeiträge

Berlin, Germany

@martin:matrix.mbober.de

Martin

9 Monate her •

Martin
9 Monate her •

I'm trying to get into #rustlang. I've already rewritten a small program and it was an okay experience but today, I hit something weird.

I have a LinkedList and need to iterate over it and, if I found any item that meets some criteria, I need to remove it from the list.

Using C++ iterators, this would be O(N).

Rust's linked_list has an issue open for more than 4 years (!) to add a remove function to linked lists. But even with that function, you'll need an index of the item to be removed and cannot (as far as I can tell) use the iterator you already have from scanning the list.

So in Rust I would need to spend O(N) to scan the list and an additional O(N) for each item I need removed.

Or is there a faster way to do that?

Tracking Issue for linked_list_remove · Issue #69210 · rust-lang/rust

#68705 adds a method in LinkedList to remove an element at a specified index. The feature gate for the issue is #![feature(linked_list_remove)].

^GitHub

#rustlang

teilten dies erneut

Als Antwort auf Martin

Ian Douglas Scott

Als Antwort auf Martin • 9 Monate her • •

Technically that's also O(n) (O(2n) = O(n)).

I don't see much Rust code using `LinkedList`. Probably its more common to just use `Vec`. Which in many cases will end up performing better overall, regardless of language.

Als Antwort auf Ian Douglas Scott

Martin

Als Antwort auf Ian Douglas Scott • 9 Monate her •

The List grows very dynamically, that's why I want to use LinkedList, because adding, removing and moving within the list is normally cheap with a LinkedList.

Als Antwort auf Martin

brettwitty

Als Antwort auf Martin • 9 Monate her • •

Do you have to remove it from that list, or can you iterate and create the filtered list? I'm unsure if you can .collect() into a LinkedList (assuming you really want one)

Als Antwort auf brettwitty

Martin

Als Antwort auf brettwitty • 9 Monate her •

That sounds like a lote of unnecessary memory allocations to me.

Als Antwort auf Martin

brettwitty

Als Antwort auf Martin • 9 Monate her • •

Possibly! Depends on the context. Try and see, you might be surprised. The costs of these things can be a lot more complicated under the hood than a napkin calculation. Good luck!

Martin mag das.

Als Antwort auf Martin

Epic Eric :thinkhappy:

Als Antwort auf Martin • 9 Monate her • •

The "faster way" depends on your exact use-case. If you intend to remove several items at once sporadically, then [Vec::retain()](https://doc.rust-lang.org/std/vec/struct.Vec.html#method.retain) might be performant enough - especially if you're performing full reads of the resulting vector.

Vec in std::vec - Rust

A contiguous growable array type, written as `Vec`, short for ‘vector’.

^{doc.rust-lang.org}

Als Antwort auf Epic Eric :thinkhappy:

Epic Eric :thinkhappy:

Als Antwort auf Epic Eric :thinkhappy: • 9 Monate her • •

In terms of which collection will best suit your needs, I recommend reading through the doc section "When Should You Use Which Collection?": https://doc.rust-lang.org/std/collections/#when-should-you-use-which-collection

std::collections - Rust

Collection types.

^{doc.rust-lang.org}

Als Antwort auf Epic Eric :thinkhappy:

Martin

Als Antwort auf Epic Eric :thinkhappy: • 9 Monate her •

That guide explicitly says I should use a LinkedList when:

You want to efficiently split and append lists.

Which is exactly why I want to use a linked list. Vec seems like it would do a lot of unnecessary memory allocation for that.

Als Antwort auf Martin

Epic Eric :thinkhappy:

Als Antwort auf Martin • 9 Monate her • •

Fair enough! Then the best way forward would be using nightly Rust in order to enable LinkedList features like remove and retain

Dieser Beitrag wurde bearbeitet. (9 Monate her)

Unbekannter Ursprungsbeitrag

Martin

Unbekannter Ursprungsbeitrag • 9 Monate her •

I don't want to use Vec because:

I don't want the collection to hog more memory than it actually needs.
I want to be able to cheaply extend the collection
I want to be able to cheaply remove items from arbitrary locations within the collection

Unbekannter Ursprungsbeitrag

Martin

Unbekannter Ursprungsbeitrag • 9 Monate her •

Doesn't Vec::retain create a whole new copy of the collection, i.e. I'll need twice the memory at that point?

Unbekannter Ursprungsbeitrag

Martin

Unbekannter Ursprungsbeitrag • 9 Monate her •

Yeah, I'm aware of that cursor API and even ChatGPT guided me towards that, but I don't want to do unstable at this point.

Als Antwort auf Martin

Martin

Als Antwort auf Martin • 9 Monate her •

BTW what I want to do is a cache that stores a key/value pair (both are strings). When an entry is hit, I want to move it to the front of the list.

When there is no hit, a network request is issued to fetch the value for the key. The key/value pair is then emplaced at the front of the list. If the list is then over the maximum capacity, the item at the back shall be removed.

Als Antwort auf Martin

laund [old account, moved]

Als Antwort auf Martin • 9 Monate her • •

huh, i never even knew Rust had a linked list in the stdlib. i wonder what the usecase is over Vec/VecDeque - the docs only say to prefer Vec/VecDeque "almost always" without any info on when to actually use a linked list

Als Antwort auf laund [old account, moved]

Martin

Als Antwort auf laund [old account, moved] • 9 Monate her •

In my experience, LinkedLists are great when the container is growing and shrinking dynamically or when you want to move stuff around within the collection while the rest of the items maintain their order. The downside is expensive random access to elements so list are better suited for use cases where you have to iterate through the collection anyways.

Vectors are better for access to random elements and if you know the size of the collection beforehand. It's also cheaper to insert elements at the end if the vector's current size can hold another element. Inserting elements anywhere other than the end or removing an element that is not at the front or back is ridiculously expensive.

Als Antwort auf Martin

laund [old account, moved]

Als Antwort auf Martin • 9 Monate her • •

i mean i kinda understand that, i just found it interesting the docs don't state this.

Though i kinda wanna do a benchmark of linked list inserting/removing elements in the middle vs vec = vec.iter().filter(remove_condition).collect() - how many elements do i need to filter out for a Vec to be faster? how much slower is iteration of linked lists?

theres a old benchmark by matklad which inserts random numbers into a sorted vec/list, which requires tons of insertions in the middle, that claims Vec is still far more performant at this.

Vev: 33365 μs
List: 184282 μs
https://github.com/matklad/vec-vs-list
but this was 8 years ago with a different linked list.

GitHub - matklad/vec-vs-list: Compare vector vs linked list in terms of performance

Compare vector vs linked list in terms of performance - matklad/vec-vs-list

^GitHub

Martin mag das.

Unbekannter Ursprungsbeitrag

Martin

Unbekannter Ursprungsbeitrag • 9 Monate her •

I cannot use swap_remove because the ordering in the list matters. My use case: https://friends.mbober.de/display/b22eb8e8-4167-19df-204d-582842795229

As far as I know common C++ vector implementations, they do not release memory when removing an element from the vector unless many elements were removed since reducing the size of a vector means allocating new memory and copying all remaining elements which is quite expensive. I assume Rust's vec does the same. That is what I meant by "hogging" memory.

As I'm also exploring Rust for embedded applications. Performance and memory efficiency matters to me and I was told that Rust is as good as C++ in this regard.

Unbekannter Ursprungsbeitrag

Martin

Unbekannter Ursprungsbeitrag • 9 Monate her •

Okay, I just read that it works in place. Still all the elements after the removed element must be copied.

Als Antwort auf Martin

laund [old account, moved]

Als Antwort auf Martin • 9 Monate her • •

@sgued Just a general piece of advice:

Don't confuse "As good as C++ in this regard" with "Works the same way as C++" - while the characteristics can be very similar, the way you'll want to implement things are often quite different.

@Soso

Als Antwort auf Martin

laund [old account, moved]

Als Antwort auf Martin • 9 Monate her • •

@sgued You could consider using or taking inspiration from the LRU used by the Servo browser https://github.com/servo/uluru

It uses a fixed size array to store all items, using u16s to create a linked list using the array indices of your items. Since its backed by a constant size array you know exactly how much memory it uses (well, unless your items are heap pointers like Vec, String etc.) - in this case: (your item + 2*u16)*n + 2*u16 which is also going to be more memory efficient than a linked list utilizing pointers on all architectures where u16 < usize

The fact it uses a static array also makes the crate work in no_std environments without an allocator (embedded).

GitHub - servo/uluru: A simple, fast, LRU cache implementation.

A simple, fast, LRU cache implementation. Contribute to servo/uluru development by creating an account on GitHub.

^GitHub

@Soso

Martin mag das.

⇧