Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Central data and more #3

Open
wants to merge 17 commits into
base: florian
Choose a base branch
from
Open

Central data and more #3

wants to merge 17 commits into from

Conversation

FlorianJacta
Copy link
Member

I passed the entire dataset through the augmentation process. You can find it in the repo directly! You can run the app directly now; no need to change the path or download data. I stored the file in parquet, which is a compressed way of storing data better than CSV.

Here is the head of movies now:
movieId,title,genres,description,directors,leadCast,keywords,Bagofwords,rating,Nb ratings

As you can see new attributes are in this dataframe. Having this central data simplified some other code of the application (like process_title).

There are on the page new elements like the genres, the description, the lead cast, directors..

image

I parallelized some code (the one that generated the augmented_dataset and the one that makes recommendations for the search page).

I let you see in the code what other things have been changed!

@FlorianJacta
Copy link
Member Author

Of course, don't hesitate to give me some feedback or other suggestions. This is just some suggestions that I implemented!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants