Feature/block offsets #550

sjoerd-bouma · 2023-07-10T14:43:24Z

(Finally) - added a module that can add or remove 'block offsets' as seen in the voltage traces recorded by RNO-G. The fit is done by assuming a repeating rect shape, and fitting this in the Fourier domain, outside the band where the antennas/amplifiers are expected to be sensitive to real signals.

I also added an additional channelParameter that stores the added/removed block offsets; maybe this can be used for the RNOG data reader in the future to ensure that the correction remains reversible. We can also consider adding (some version of) this module in the RNO-G data reader, while we're still waiting for the integration of the equivalent code on the DAQ side, although it's probably worth making sure first that the performance impact of the fitting and/or Fourier transforms is not too high.

…block_offsets

cg-laser

looks good to me but someone with RNO-G data expertise should also have a look and check/test it.

shallmann

This is great! I think it can be merged, since it is standalone... not sure why the doc checks fail.

NuRadioReco/modules/RNO_G/channelBlockOffsetFitter.py

…block_offsets

NuRadioReco/modules/RNO_G/channelBlockOffsetFitter.py

fschlueter · 2023-08-30T11:35:59Z

NuRadioReco/modules/RNO_G/channelBlockOffsetFitter.py

+            - 'fit' (default): fit the block offsets with a minimizer
+            - 'approximate' : use the first guess from the out-of-band component,
+              without any fitting (slightly faster)
+            - 'stored': use the block offsets already stored in the 


If you simulate the block offsets would you not simulated it on a sim_station? Hence, the station would not have this parameter

The 'stored' option is probably superfluous, but I thought maybe one might want to undo the block offsets that have already been removed by e.g. the external calibration, or allow for some other fit to compute the block offsets.

In terms of simulation, right now we don't simulate the block offsets anyway, and because e.g. the timings are different I'm not sure how straightforward it would be to simulate them for the sim_station (rather than for the station, which is what I've been doing so far). Even if we end up doing this for the sim_station, I'd be inclined to leave it to the user to retrieve the simulated parameter from the sim_station to keep a cleaner separation between the two.

fschlueter · 2023-08-30T11:45:28Z

NuRadioReco/modules/RNO_G/channelBlockOffsetFitter.py

+    filtered_trace = fft.freq2time(filtered_trace_fft, sampling_rate)
+
+    # obtain guesses for block offsets
+    a_guess = np.array([


Probably not relevant but using np.split seems to be a bit faster

In [17]: %timeit np.mean(np.split(filtered_trace, n_blocks), axis=1) 19 µs ± 849 ns per loop (mean ± std. dev. of 7 runs, 10,000 loops each) In [18]: %timeit a_guess = np.array([np.mean(filtered_trace[i*block_size:(i+1)*block_size]) for i in range(n_blocks)]) 32.4 µs ± 1.17 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)

Did you ever check what the difference is when you use the unfiltered trace?

Thanks, implemented (though I agree I don't think this was the bottleneck). Using the unfiltered trace is a little bit faster but gives a ~4 times larger RMS difference between true and fitted block offset

fschlueter · 2023-09-13T20:52:36Z

Damn, I did not know that I had to submit my comments/review. Sorry Sjoerd that those come so late (they were written a week ago)

…e offsets

sjoerd-bouma · 2023-09-25T14:41:40Z

I fixed (hopefully) Felix's comments, and additionally added the fitter to the mattak datareader. A quick comparison of how fast the different options are:

from NuRadioReco.modules.io.RNO_G.readRNOGDataMattak import readRNOGData
import time
reader = readRNOGData()
results = dict()
for mode in ['none', 'median', 'approximate', 'fit']:
    reader.begin('/home/sb/Python/scratch/data/inbox/station11/run1100/', apply_baseline_correction=mode)
    t0 = time.time()
    n_evts = 0
    max_n_evts = 1e4 if mode != 'fit' else 25

    for evt in reader.run():
        n_evts += 1
        if n_evts >= max_n_evts:
            break

    results[mode] = (time.time() - t0) / n_evts
    
for mode, t in results.items():
    print(f"{mode:12s} : {1e3*t:-9.3f} ms / event")

Giving

none         :     1.888 ms / event
median       :     4.409 ms / event
approximate  :     4.824 ms / event
fit          :   768.660 ms / event

Both 'approximate' and 'median' (= the previous implementation in the reader) are probably good enough for most purposes.

fschlueter · 2023-09-25T16:30:24Z

What happens if you relax the tol argument or introduce a max number of iterations within curve_fit? I assume the fit should converge stabil and fast

sjoerd-bouma · 2023-09-26T15:15:29Z

What happens if you relax the tol argument or introduce a max number of iterations within curve_fit? I assume the fit should converge stabil and fast

I'm definitely spending too much time on this now.... but of course it's a good idea, so I checked using toy Gaussian offsets of 10% on top of a (bandpass-filtered) Gaussian trace.

Basically, after 2 iterations the block offsets are already reduced by 2 orders of magnitude (RMS is normalized to the offset size), which is probably enough in most cases unless the offsets are huge, so I've made this the default maxiter.

I've also removed/deprecated the old baseline correction, and made the bandpass-filtered 'approximate' mode the default, seeing as it was about a factor 4 better without a noticeable decrease in performance. Let me know if you disagree.

fschlueter · 2023-09-26T17:14:54Z

NuRadioReco/modules/io/RNO_G/readRNOGDataMattak.py

    return wfs - baseline_traces

+blockoffsetfitter = channelBlockOffsets()


That that here in between functions

I've made it a private attribute of the readRNOGData class, having readRNOGDataMattak.blockoffsetfitter showing up in IDE autocompletion is probably unnecessary clutter.

fschlueter · 2023-09-26T17:20:32Z

What happens if you relax the tol argument or introduce a max number of iterations within curve_fit? I assume the fit should converge stabil and fast

I'm definitely spending too much time on this now.... but of course it's a good idea, so I checked using toy Gaussian offsets of 10% on top of a (bandpass-filtered) Gaussian trace. Basically, after 2 iterations the block offsets are already reduced by 2 orders of magnitude (RMS is normalized to the offset size), which is probably enough in most cases unless the offsets are huge, so I've made this the default maxiter.

I've also removed/deprecated the old baseline correction, and made the bandpass-filtered 'approximate' mode the default, seeing as it was about a factor 4 better without a noticeable decrease in performance. Let me know if you disagree.

Hi, you tested it with 10% amplitudes of what, a baseline ADC of ~ 1800? I am a bit worried that maxiter=2 is a bit low. Not that an accuracy of 1e-2 is not sufficient but what happens if a fit does not converge as fast as in your test? Looking at your plot: Why don't we set it to ~5. It still gives us a time boost of over one order of magnitude right?

fschlueter · 2023-09-26T17:22:21Z

Deprecating the old code is fine for me. However, I am still wondering a bit why it is not significant faster than the bandpass-filtered 'approximate' mode. After all this includes running an fft which should be quite time consuming compared to the other mathematical operations ?

fschlueter · 2023-09-26T17:23:02Z

PS: An improvement with a factor of 100 in performance is never a waist of time :)

sjoerd-bouma · 2023-09-29T17:17:41Z

Thanks Felix,
I've increased the number of iterations to 5 as suggested - the 99th percentile is now ~2% (was 4%)

The times are a bit different compared to the previous plot, I realize showing the time per channel after earlier providing times per event (= 24 channels) was a bit confusing. So the improvement is not a factor 100 but closer to a factor 10 in speed, unfortunately.

…block_offsets

sjoerd-bouma · 2023-12-01T10:33:21Z

@fschlueter Sorry, created a merge conflict for myself by removing all trailing whitespaces in #557, I've now turned that feature off again in my IDE. Nothing else should have changed.

fschlueter · 2023-12-07T10:45:12Z

I am using a VS code plugin which should just remove ws in lines which I have modified anyway (its not working as promised but if you find one which works let me know)

fschlueter · 2023-12-07T10:46:02Z

I just merged it. Did not feel like waiting longer ..

anelles · 2023-12-07T10:49:18Z

Hmm, reading the history here, @fschlueter may just have avoided (but just barely :) ) the three strikes rule of only merging after 24 hours after final approval. It has been taking long enough, I concur.

fschlueter · 2023-12-07T10:56:19Z

Hmm, reading the history here, @fschlueter may just have avoided (but just barely :) ) the three strikes rule of only merging after 24 hours after final approval. It has been taking long enough, I concur.

😇

sjoerd-bouma added 9 commits August 11, 2022 16:01

added block offset fitter for RNO-G

5847398

fix docstrings

27babda

changed block offset fit function to standalone

37cd070

fixed bug (dt instead of sampling rate)

9bc2e3f

Merge branch 'develop' of github.com:nu-radio/NuRadioMC into feature/…

607dc8f

…block_offsets

change to smarter minimization

c005726

store fitted block offsets as a channelParameter, simplify interface

4f423f7

update changelog

256f21f

Merge branch 'develop' of github.com:nu-radio/NuRadioMC into feature/…

14e393f

…block_offsets

cg-laser reviewed Aug 1, 2023

View reviewed changes

forgot to pass through optional kwarg

57f5a44

shallmann approved these changes Sep 6, 2023

View reviewed changes

shallmann reviewed Sep 6, 2023

View reviewed changes

NuRadioReco/modules/RNO_G/channelBlockOffsetFitter.py Outdated Show resolved Hide resolved

shallmann previously approved these changes Sep 6, 2023

View reviewed changes

sjoerd-bouma added 2 commits September 12, 2023 16:15

remove commented import

162f014

Merge branch 'develop' of github.com:nu-radio/NuRadioMC into feature/…

8c9bc68

…block_offsets

sjoerd-bouma dismissed shallmann’s stale review via 8c9bc68 September 12, 2023 14:15

fschlueter reviewed Sep 13, 2023

View reviewed changes

sjoerd-bouma added 5 commits September 25, 2023 13:04

treat offsets given as dict correctly

3d58f05

save the microseconds

98f51e4

add different block offsets fit options to mattak reader and store th…

8a77811

…e offsets

change *= to prevent error due to Type change from int to float

87a505d

store offsets only if they are actually computed

2384323

fschlueter previously approved these changes Sep 25, 2023

View reviewed changes

sjoerd-bouma added 2 commits September 26, 2023 16:36

make blockoffsetfitter faster by setting maxiter=2 by default

ba3b1b2

deprecate old baseline correction in readRNOGDataMattak

80ef76d

sjoerd-bouma dismissed fschlueter’s stale review via 80ef76d September 26, 2023 15:01

remove inaccessible old baseline correction code

291433e

fschlueter reviewed Sep 26, 2023

View reviewed changes

sjoerd-bouma added 2 commits September 29, 2023 19:11

increase default number of iterations to 5

30726a0

make blockoffsetfitter a private attribute to reduce clutter in module

69f6fda

sjoerd-bouma requested a review from fschlueter October 16, 2023 08:23

fschlueter previously approved these changes Nov 27, 2023

View reviewed changes

Merge branch 'develop' of github.com:nu-radio/NuRadioMC into feature/…

c84a962

…block_offsets

sjoerd-bouma dismissed fschlueter’s stale review via c84a962 November 30, 2023 21:24

sjoerd-bouma requested a review from fschlueter December 1, 2023 10:31

fschlueter approved these changes Dec 7, 2023

View reviewed changes

fschlueter merged commit 778d5cd into develop Dec 7, 2023
9 checks passed

anelles deleted the feature/block_offsets branch February 14, 2024 07:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/block offsets #550

Feature/block offsets #550

sjoerd-bouma commented Jul 10, 2023

cg-laser left a comment

shallmann left a comment

fschlueter Aug 30, 2023

sjoerd-bouma Sep 25, 2023

fschlueter Aug 30, 2023

fschlueter Aug 30, 2023

sjoerd-bouma Sep 25, 2023

fschlueter commented Sep 13, 2023

sjoerd-bouma commented Sep 25, 2023

fschlueter commented Sep 25, 2023

sjoerd-bouma commented Sep 26, 2023

fschlueter Sep 26, 2023

sjoerd-bouma Sep 29, 2023

fschlueter commented Sep 26, 2023

fschlueter commented Sep 26, 2023

fschlueter commented Sep 26, 2023

sjoerd-bouma commented Sep 29, 2023

sjoerd-bouma commented Dec 1, 2023

fschlueter commented Dec 7, 2023

fschlueter commented Dec 7, 2023

anelles commented Dec 7, 2023

fschlueter commented Dec 7, 2023

		return wfs - baseline_traces

		blockoffsetfitter = channelBlockOffsets()

Feature/block offsets #550

Feature/block offsets #550

Conversation

sjoerd-bouma commented Jul 10, 2023

cg-laser left a comment

Choose a reason for hiding this comment

shallmann left a comment

Choose a reason for hiding this comment

fschlueter Aug 30, 2023

Choose a reason for hiding this comment

sjoerd-bouma Sep 25, 2023

Choose a reason for hiding this comment

fschlueter Aug 30, 2023

Choose a reason for hiding this comment

fschlueter Aug 30, 2023

Choose a reason for hiding this comment

sjoerd-bouma Sep 25, 2023

Choose a reason for hiding this comment

fschlueter commented Sep 13, 2023

sjoerd-bouma commented Sep 25, 2023

fschlueter commented Sep 25, 2023

sjoerd-bouma commented Sep 26, 2023

fschlueter Sep 26, 2023

Choose a reason for hiding this comment

sjoerd-bouma Sep 29, 2023

Choose a reason for hiding this comment

fschlueter commented Sep 26, 2023

fschlueter commented Sep 26, 2023

fschlueter commented Sep 26, 2023

sjoerd-bouma commented Sep 29, 2023

sjoerd-bouma commented Dec 1, 2023

fschlueter commented Dec 7, 2023

fschlueter commented Dec 7, 2023

anelles commented Dec 7, 2023

fschlueter commented Dec 7, 2023