Posts

Showing posts from December, 2025

Final Project

Image
  Problem Description: I wanted to explore a dataset that was rich enough to support several different visual analysis types yet simple enough to interpret. And so I selected the diamonds dataset built into R, which has over 53,000 observations and ten variables describing carat weight / cut / color / clarity / price. My research question in general was: How do physical and quality attributes of a diamond affect its price - and what patterns emerge when they are considered individually and together? From that question I sliced the analysis into several connected ideas. I wanted to know how the distribution of diamond prices and carat size varies across cut categories. I also wondered which cut categories command higher prices and if there were significant deviations from the average price across the dataset. Last but not least, I wanted to test whether price was strongly associated with carat weight and how multiple attributes like cut and color affect average price. Such small que...