For my regression model, I opted for a Random Forest model, chosen for its seamless integration of both numerical and categorical variables, including cut, clarity, and color.
To initiate the process, I undertook data preprocessing, transforming categorical variables (Clarity, Color, Cut) into numeric values using one-hot encoding. Simultaneously, I divided the dataset into training and validation sets.