pybii+dataframes #30
Replies: 6 comments
-
re: (1), i'd say it would fit in pybbi very well. One question is though - would we like pybbi to depend on bioframe or vice versa? I'd vote for the former, as bioframe is a pure python dependency, while pybbi requires compiling. |
Beta Was this translation helpful? Give feedback.
-
good suggestions! Re: (1), that was my thinking as well. any thoughts @nvictus ? |
Beta Was this translation helpful? Give feedback.
-
Discussed this with @gfudenberg and @GarrettNg. Conclusion was that this functionality would best start its life in the bioframe sandbox. It could mature into bioframe proper, or if we decide to keep bioframe lean, it may graduate to its own project. For several reasons, I think pybbi should try to remain a low-level access API over what UCSC's source code provides to interface to numpy/pandas. There are already a couple fileops implementations in bioframe that allow multiple bbi "engine" backends (pybbi and pybigwig) and I think this pattern is good because different engines have strengths and weaknesses (e.g. some UCSC ops just fail on ENCODE bigWig files). Neither of these libraries are hard dependencies in bioframe, and an error message indicates to the user that at least one of them needs to be installed -- this was modeled off the pyarrow/fastparquet soft dependence in pandas. |
Beta Was this translation helpful? Give feedback.
-
in |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
@nvictus , could you move this discussion into bioframe? After some thoughts, we may need three different functions indeed.
|
Beta Was this translation helpful? Give feedback.
-
There are two patterns that come up a lot at the intersection of pybbi/bioframe:
interval_df
(then often visualized as a heatmap)interval_df
Is this something that would make sense in pybbi? in bioframe? elsewhere?
Example for (1) below:
Beta Was this translation helpful? Give feedback.
All reactions