R’s standard correlation functionality (base::cor) seems very impractical to the new programmer: it returns a matrix and has some pretty shitty defaults it seems. Simon Jackson thought the same so he wrote a tidyverse-compatible new package: corrr! Simon wrote some practical R code that has helped me out greatly before (e.g., color palette’s), but this new package is … Continue reading Simple Correlation Analysis in R using Tidyverse Principles→
It has been twenty years since the first Harry Potter novel, the sorcerer’s/philosopher’s stone, was published. To honour the series, I started a text analysis and visualization project, which my other-half wittily dubbed Harry Plotter. In several blogs, I intend to demonstrate how Hadley Wickham’s tidyverse and packages that build on its principles, such as tidytext (free book), have taken programming in R to an … Continue reading Harry Plotter: Celebrating the 20 year anniversary with tidytext and the tidyverse in R→
Hadley Wickham‘s tidyverse has improved the workflow of analysts / data scientists, makes coding errors less likely and code more transparent. You’ve got to love the figure below, representing a simplified workflow of the average analysis project. The tidyverse provides assistance in each of the stages. Various packages provide functionality to perform analytical tasks more effectively, in … Continue reading tidyverse 101: Simplifying life for useRs→
For those of you unfamiliar with the tidyverse, it is a collection of R packages that share common philosophies and are designed to work together. Most if not all, are created by R-god Hadley Wickham, one of the leads at RStudio. I was introduced to the tidyverse-packages such as ggplot2 and dplyr in my second … Continue reading tidyverse: Example: Trump Approval Rate→
Despite the pandemic, 2020 has been a great year for me. Professionally, I grew into my role as data science product owner. And next to this, I got more and more freelance side gigs. Mostly teaching, but also some consultancy projects. Unfortunately, all my start-up ideas failed miserably again this year, yet I’ll keep trying … Continue reading Reviewing year 4 of paulvanderlaken.com→
TLDR; You can use the corrtable package (see CRAN or Github)! In most (observational) research papers you read, you will probably run into a correlation matrix. Often it looks something like this: In Social Sciences, like Psychology, researchers like to denote the statistical significance levels of the correlation coefficients, often using asterisks (i.e., *). Then … Continue reading Create a publication-ready correlation matrix, with significance levels, in R→