Pandance provides advanced relational operations for pandas DataFrames, enabling powerful and efficient joins (aka merges).
Pandance extends the set of standard join operations in pandas (inner, outer, cross, left, right) with:
- fuzzy joins: Match columns with a tolerance. Supports numerical and datetime values.
- inequality join: Match one column's values that are less / greater than the other column's values.
- theta joins: Allows the user to specify arbitrary matching conditions on which to join
Pandance is designed with performance in mind, aiming to provide fast implementations whenever possible.
pip install pandance
See the documentation