¿Cuál es el precio del diamante 💎?

Primera competición dentro de Ironhack para ver quien se aproximaba más al valor real. La base de datos consistia en:

las Features:

id: only for test & sample submission files, id for prediction sample identification
price: price in USD
carat: weight of the diamond
cut: quality of the cut (Fair, Good, Very Good, Premium, Ideal)
color: diamond colour, from J (worst) to D (best)
clarity: a measurement of how clear the diamond is (I1 (worst), SI2, SI1, VS2, VS1, VVS2, VVS1, IF (best))
x: length in mm
y: width in mm
z: depth in mm
depth: total depth percentage = z / mean(x, y) = 2 * z / (x + y) Valores:(43--79)
table: width of top of diamond relative to widest point Valores:(43--95)

Resolución:

Aplicación del modelo random forest, tras el analisis inicial de la base de datos.

NOTA: Al eliminar los datos atípicos el modelo mejora las predicciones dentro del código pero al pasarlo a Kaggle empeora los mismos.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
dataset		dataset
README.md		README.md
main.ipynb		main.ipynb