I think it's nice when the code that scrapes data, the data itself, and any sort of web UI code that displays that data all live together in the same repo. And I like them to be deployed together. One example where this is working well for me is this data about California's Board of Parole Hearings results and the corresponding website. Simon W, who has a habit of naming the thing well having coined "git scraping", wrote about and called this pattern "baked data." In the post he lays out a bu...