Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Linting in fastparquet? #939

Open
yohplala opened this issue Oct 29, 2024 · 3 comments
Open

Linting in fastparquet? #939

yohplala opened this issue Oct 29, 2024 · 3 comments

Comments

@yohplala
Copy link

Hello,
Could we have a linting tool setup (maybe through pre-commit) in fastparquet?
We tackled this topic in issue #720 .
At this time, @martindurant , you mentionned:

It would be reasonable to require some standard linting (flake8, black, isort) via pre-commit or otherwise. Obviously, implementing this would change most of the function blocks and make all current PRs very hard to merge - so the timing of formatting introduction is very sensitive.

I raise this topic as I have in mind that you are working on "significant" changes in fastparquet through #931 .
Can this milestone be identified as a "fastparquet 2.0"? (there has been been several 2.0" release "recently": numpy, pandas mostly)
If yes, do you think, it could be an appropriate timing to introduce linting?

@yohplala
Copy link
Author

Side comment:

Maybe it could be relevant to have a "dev" branch, to which contributors can issue pull requests, without linting (this decreases the constraint for punctual contributors to contribute)

Then the linting step could be setup only when releasing "dev" branch into "main" branch?

@martindurant
Copy link
Member

On the prospect of fastparquet 2: I have indeed put significant work into the branch. However, I am now hoping that https://kylebarron.dev/arro3/ will solve the use case for us ( kylebarron/arro3#195 - discussion). If this can be done in rust with solid upstream implementations for all the parquet peculiarities, then maybe the cython in this project becomes unnecessary and can be dropped. I could, instead, work at making a nicer API and convenience functions in arro3.

About linting: sure, it would be fine. There are not too many things in motion, so it could be done any time (the fastparquet-numpy branch, if it progresses, can apply it later). Ruff seems to have overtaken the other linters in the meantime.

@yohplala
Copy link
Author

I didn't know about arro3, thanks for the info!
I will be watching it, let's see how this evolves.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants