I've modeled a lot of my tennis after Roger's -- the service motion, the attacking forehand, and even a valiant attempt at going one-handed on the backhand. So it probably doesn't come as a surprise that I wanted to analyze his match statistics, but I was struggling to find a pre-existing dataset. I used this as an opportunity to learn how to web-scrape, downloading his match data from the ATP website. The only insights I gleaned from the analysis was how dominant his forehand his (shocking I know), but learning web-scraping showed that I could find data everywhere. If you're curious, take a closer look here.
Tools/Concepts:
Python, web-scraping