Andrew Thompson was interested in what 10 topics a computer would identify in our daily news. He gathered over 140.000 new articles from the archives of 10 different sources, as you can see in the figure below. In Python, Andrew converted the text of all these articles into a manageable form (tf-idf document term matrix … Continue reading Summarizing our Daily News: Clustering 100.000+ Articles in Python
The Washinton Post is known for the lovely visualizations accompanying their stories. In a recent post, they visualized how long it would take you to get out of the downtown areas of various cities. They compared all the major U.S. cities and examined different leaving times. Unfortunately, I cannot copy the visualizations' text here, but … Continue reading Leaving town at rush hour? Here’s how far you’re likely to get from America’s largest cities.