Improving performance of dynamic characterization #57

TimoDiepers · 2024-07-17T06:12:25Z

The current implementation of the dynamic characterization does not scale very well. The main issue is the repeated creation of pd.Series at several points when characterizing of each row of the inventory df.

Instead of handling pd.Series and DataFrames all the time, I switched it to namedtuple. In an example case, this reduced computing time from ~30s to <1s.

I've also noticed that the test for dynamic characterization seem to be incomplete, but some preparation has been done. I can try do add this, so I'll keep this PR as a draft for now.

TimoDiepers added 4 commits July 17, 2024 07:15

switch from dataframe to namedtuple

ee8d23d

start (re-)writing tests

3ecbd1d

specifying date dtype and sort for unambiguity

d411d23

add test

30bca71

TimoDiepers marked this pull request as ready for review July 19, 2024 07:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving performance of dynamic characterization #57

Improving performance of dynamic characterization #57

TimoDiepers commented Jul 17, 2024

Improving performance of dynamic characterization #57

Are you sure you want to change the base?

Improving performance of dynamic characterization #57

Conversation

TimoDiepers commented Jul 17, 2024