`For` instance for Generators #169

cscherrer · 2020-08-17T17:12:50Z

@zenon noticed in #166 that rand gives incorrect results for Generators. This is specific to v0.12, since it's now fixed in master.

So first, yes we need a new release.

But also...

In v0.12, we have the incorrect method definition

@inline function rand(d :: For{F,T,D,X}) where {F,T <: Base.Generator, D, X}
    rand.(Base.Generator(d.θ.f, d.θ.iter))
end

In 0345f04 this was updated to

@inline function rand(d::For{F,T,D,X}) where {F,T<:Base.Generator,D,X}
    return rand.(Base.Generator(d.f, d.θ))
end

But thinking more about this, I wonder if it would be better to have

@inline function rand(d::For{F,T,D,X}) where {F,T<:Base.Generator,D,X}
    return rand.(Base.Generator(d.f ∘ d.θ.f, d.θ.iter))
end

Or even

@inline function rand(d::For{F,T,D,X}) where {F,T<:Base.Generator,D,X}
    return Base.Generator(rand ∘ d.f ∘ d.θ.f, d.θ.iter)
end

This would avoid allocation. We need to test...

Would this lead to better composability?
Would this improve performance?

The text was updated successfully, but these errors were encountered:

DilumAluthge · 2020-08-17T17:26:32Z

Could you explain the difference between each of these?

cscherrer · 2020-08-17T17:32:04Z

Sure :)

This one

@inline function rand(d::For{F,T,D,X}) where {F,T<:Base.Generator,D,X}
    return rand.(Base.Generator(d.f, d.θ))
end

Builds a Generator-within-a-Generator, while this

@inline function rand(d::For{F,T,D,X}) where {F,T<:Base.Generator,D,X}
    return rand.(Base.Generator(d.f ∘ d.θ.f, d.θ.iter))
end

unrolls it so you just have a single Generator. These may lead to the same compiled code, I'm not sure. But in both cases, the outer rand. forces the whole thing into an Array.

The last one is different, because the end result is a Generator over random numbers. When I mentioned composability, I'm thinking of things like streaming results of on simulation into the input for another.

DilumAluthge · 2020-08-18T00:32:12Z

In the very last example (the one that returns a Generator), if you really want an Array, you can just call collect() on the Generator, right?

cscherrer · 2020-08-18T00:51:50Z

Yes, that's right. Also, I just realized the last example has the interesting property that it acts as a suspended computation:

julia> θ = (sqrt(p) for p in 1:10)
Base.Generator{UnitRange{Int64},typeof(sqrt)}(sqrt, 1:10)

julia> d = For(θ) do θj
           Normal(θj, 1)
           end
For{var"#5#6",Base.Generator{UnitRange{Int64},typeof(sqrt)},Normal{Float64},Float64}(var"#5#6"(), Base.Generator{UnitRange{Int64},typeof(sqrt)}(sqrt, 1:10))

julia> r = rand(d)
Base.Generator{UnitRange{Int64},Base.var"#62#63"{Base.var"#62#63"{typeof(rand),var"#5#6"},typeof(sqrt)}}(Base.var"#62#63"{Base.var"#62#63"{typeof(rand),var"#5#6"},typeof(sqrt)}(Base.var"#62#63"{typeof(rand),var"#5#6"}(rand, var"#5#6"()), sqrt), 1:10)

julia> collect(r)
10-element Array{Float64,1}:
 0.06844850095205879
 4.345937348027204
 0.4260161672012266
 0.9027785360988985
 2.404324563944117
 2.2100818176085286
 2.2332541535358383
 1.6701557218576157
 1.7094957979035412
 3.336007963531211

julia> collect(r)
10-element Array{Float64,1}:
 1.65992410815293
 1.8507072361488428
 2.714179646583497
 1.8856240294357847
 3.247269903712069
 0.38141318206481634
 3.097531577391594
 3.818148250021317
 3.1237990423455737
 3.06097069796954

Though I'm not sure if this is guaranteed behavior, I can imagine some optimizations might want to cache it.

DilumAluthge · 2020-08-18T00:58:29Z

Is the idea that every time you call collect, you get a new sample?

Anyway, I like the last option. Because you can get the Array if you really want, by calling collect. But you don't need to allocate until you are ready.

cscherrer · 2020-08-18T01:47:46Z

Is the idea that every time you call collect, you get a new sample?

Right, the Generator doesn't allocate (or at least not much, I haven't checked that) so each traversal requires re-computing the function.

Anyway, I like the last option. Because you can get the Array if you really want, by calling collect. But you don't need to allocate until you are ready.

Right, I like that too. I'm wondering if a sensible approach is that the type of rand(::For) is related to the type of the indices you give it.

Worth noting here that arrays can sometimes be faster because despite the allocation, you get cache friendliness. But then again, since Soss generates code, I think down the line we can pull out any compiler trick in the book. So speed should get there, and maybe we should focus first on the semantics we want.

DilumAluthge · 2020-08-18T02:00:43Z

I'm wondering if a sensible approach is that the type of rand(::For) is related to the type of the indices you give it.

Hmmm. That might make sense. Although I wonder if it would make the API more confusing for new users to learn?

So speed should get there, and maybe we should focus first on the semantics we want.

For sure

DilumAluthge · 2020-08-18T02:01:28Z

I think the immediate next step is to make a new release off of master so that the bugfix can be deployed.

That buys us some time, I think, to make a decision on the semantics.

cscherrer · 2020-08-18T03:04:28Z

Good call

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`For` instance for Generators #169

`For` instance for Generators #169

cscherrer commented Aug 17, 2020 •

edited

Loading

DilumAluthge commented Aug 17, 2020

cscherrer commented Aug 17, 2020

DilumAluthge commented Aug 18, 2020

cscherrer commented Aug 18, 2020

DilumAluthge commented Aug 18, 2020

cscherrer commented Aug 18, 2020

DilumAluthge commented Aug 18, 2020

DilumAluthge commented Aug 18, 2020

cscherrer commented Aug 18, 2020

For instance for Generators #169

For instance for Generators #169

Comments

cscherrer commented Aug 17, 2020 • edited Loading

DilumAluthge commented Aug 17, 2020

cscherrer commented Aug 17, 2020

DilumAluthge commented Aug 18, 2020

cscherrer commented Aug 18, 2020

DilumAluthge commented Aug 18, 2020

cscherrer commented Aug 18, 2020

DilumAluthge commented Aug 18, 2020

DilumAluthge commented Aug 18, 2020

cscherrer commented Aug 18, 2020

`For` instance for Generators #169

`For` instance for Generators #169

cscherrer commented Aug 17, 2020 •

edited

Loading