The data is from kaggle.
- unbalanced data.
- very few samples.
- the success parameter is vauge, not showing which parameters affects it.
- very few features with so little information.
- cleaning and keeping only the informative features.
- scrapping the web for more info about the movies.
- using feature engineering to easily identify the patterns.
- basic modeling of the data