Jared Wilber posted this great walkthrough where he codes a simple R data pipeline using purrr and tidyr to train a large variety of models and methods on the same base data, all in a non-repetitive, reproducible, clean, and thus tidy fashion. Really impressive workflow!
Peter Cottle built this great interactive Git tutorial that teaches you all vital branching skills right in your browser. It's interactive, beautiful, and very informative, introducing every concept and Git command in a step-by-step fashion. Have a look yourself: https://learngitbranching.js.org/ Here's the associated GitHub repository for those interested in forking. The tutorial includes many levels … Continue reading Learn Git Branching: An Interactive Tutorial
PyData is famous for it's great talks on machine learning topics. This 2019 London edition, Vincent Warmerdam again managed to give a super inspiring presentation. This year he covers what he dubs Artificial Stupidity™. You should definitely watch the talk, which includes some great visual aids, but here are my main takeaways: Vincent speaks of … Continue reading Artificial Stupidity – by Vincent Warmerdam @PyData 2019 London
The Bayesian method is the natural approach to inference, yet it is hidden from readers behind chapters of slow, mathematical analysis. Nevertheless, mathematical analysis is only one way to "think Bayes". With cheap computing power, we can now afford to take an alternate route via probabilistic programming. Cam Davidson-Pilon wrote the book Bayesian Methods for … Continue reading E-Book: Probabilistic Programming & Bayesian Methods for Hackers
Survival of the Best Fit is a webgame that simulates what happens when companies automate their recruitment and selection processes. You - playing as the CEO of a starting tech company - are asked to select your favorite candidates from a line-up, based on their resumés. As your simulated company grows, the time pressure increases, … Continue reading Survival of the Best Fit: A webgame on AI in recruitment
Claus Wilke wrote the Fundamentals of Data Visualization - a great resource that's definitely high on my list of recommended data visualization books. In a recent post, Claus shared the link to a GitHub repository where he hosts some of the R programming code with which Claus made the graphics for his dataviz book. The … Continue reading Recreating graphics from the Fundamentals of Data Visualization