-
Notifications
You must be signed in to change notification settings - Fork 291
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
get_many_mut with variable number of keys #332
Comments
Make HashMap extension fn like get_many_mut but for variable sizes See also rust-lang/hashbrown#332
I'd also really like to have this, I'm in a situation right now where I need to have an arbitrary number of mutable references and the way I'm doing it for now isn't great Update: I've made some code that allows you to do this, I can share it if anyone needs it |
The problem with returning a variable number of mutable references is that it requires returning a |
It seems that this would be for the end user to decide, no? |
Keep in mind that |
That seems reasonable on the const size version because those are unlikely to ever be very large, but here indeed that would seem a bit too expensive, so probably flagging entry as currently being borrowed would fix this? By the way, on the const size version, how do you deal with Eq implementations that are inconsistent? ("It's ok I'm equal to nobody - oh wait in fact I'm the same key" -> yields several mutable borrows to the same entry) |
We compare the pointers to the entries that are actually returned by the lookups. |
Ah of course! ^^ The problems becomes similar to this one: |
Unlikely, since you still need to take into account the sort itself |
Currently the implementation is optimized for small |
The sort itself is
For the const-size version it's pretty clear that we want to ensure that we are as optimized as possible for small slice sizes. It's also very practical to be able to pattern-match directly on the result (so we don't want to remove a const size version), and I would imagine as well that the most common case is 2. However it looks like we could provide different APIs through the same function via a Then we could have different versions based on the slice size: let ptrs: Vec<*mut ??> = /* ... */;
if n < 10 { // 10 is the limit for insertion sort in std
// do n^2
} else {
let mut ptrs_sorted = ptrs.clone();
ptrs_sorted.sort_unstable(); // n * log(n)
assert!(ptrs_sorted.windows(2).all(|w| w[0] != w[1]); // n
} having optimal perf for both cases. (That was actually also suggested for the std const version With pointers, it looks like the problem becomes essentially exactly the same as they have in std with indexes. |
Well, as you can see from your own example, you first need to sort, and then go through the array again, looking for the equal pointers. In addition, you do not take into account that sorting will violate the order of the values. That is, the corresponding key will not point to the corresponding value. |
I'm not sure why that is an issue.
I do, that's what the |
Hmm... it might work. It looks like the method you suggested will improve the existing implementation as well. As for variable length. Why not consider using something like https://doc.rust-lang.org/std/array/struct.IntoIter.html. For example, create a separate search array and a separate output array. Then push only unique pointers into the output array and return it as |
I'm afraid I don't understand what you have in mind. Perhaps that would be easier with a code sample? |
This comment was marked as outdated.
This comment was marked as outdated.
Actually I figure out how it can be done. I try to implement it in next two days. |
@Amanieu, @Ten0 What if we provide the following API (draft pull request #408): pub fn try_get_many_key_value_mut<'a, Q, I, const N: usize>(
&mut self,
iter: &mut I,
) -> ArrayIter<(&K, &mut V), N>
where
I: Iterator<Item = &'a Q>,
Q: ?Sized + Hash + Equivalent<K> + 'a,
{
/* implementation */
}
pub unsafe fn try_get_many_key_value_unchecked_mut<'a, Q, I, const N: usize>(
&mut self,
iter: &mut I,
) -> ArrayIter<(&K, &mut V), N>
where
I: Iterator<Item = &'a Q>,
Q: ?Sized + Hash + Equivalent<K> + 'a,
{
/* implementation */
} Where impl<T, const N: usize> ArrayIter<T, N> {
/// Returns an immutable slice of all elements that have not been yielded yet.
#[inline]
pub fn as_slice(&self) -> &[T] {
/* implementation */
}
/// Returns a mutable slice of all elements that have not been yielded yet.
#[inline]
pub fn as_mut_slice(&mut self) -> &mut [T] {
/* implementation */
}
/// Returns an [`ArrayIter`] of the same size as `self`, with function `f`
/// applied to each element in order.
pub fn convert<F, U>(self, mut f: F) -> ArrayIter<U, N>
where
F: FnMut(T) -> U,
{
/* implementation */
}
} |
While your implementation is very clever, I don't think this is an API I actually want in hashbrown. It just ends up being very hard to use correctly (due to the interactions between the As I've said before, you really should just do separate lookups when you actually need the values instead of keeping an arbitrary number of mutable references around. |
get_many_mut
on slices (rust-lang/rust#83608) does not need to handle a variable number of keys (although it might be nice) because, worse case scenario, you can sort your input array of indexes and iterate over it with successivesplit_at_mut
.In this case however, that's not an option, as
HashMap
s don't have the equivalent ofsplit_at_mut
.So it may be useful to be able to lookup a variable (somewhat large) number of distinct keys at the same time in a
HashMap
, and get mutable references back.That is in particular needed when we have a storage of many entries, and we want to run a round of updates on distinct subsets of keys, and want to run these in parallel.
Having
get_many_mut
follow a similar model to what is described at rust-lang/rust#83608 (review) and providing an interface that allows for this would consequently be useful - in fact, that is something I currently would use.The text was updated successfully, but these errors were encountered: