Diamonds in a Sea of Silver; Handpicked Resources for Data Science

Resources for data science are anything but scarce. However, finding concise, to-the-point, useful, and thought-provoking resources isn’t as common as one might think. Below, I’ve curated a list of resources that I find succinct, straightforward, practical, and stimulating. The list follows, roughly, the order in which research might be conducted and covers a broad scope of data science topics, from hypothesis testing to NLP and machine learning.

This post will be updated periodically as I discover new gems.


Machine learning

Lantz, B. (2013). Machine learning with R: learn how to use R to apply powerful machine learning methods and gain an insight into real-world applications.
A great source to start with. Its explanation , for example about KNN, are very easy to understand.

Data Sources for Cross-Cultural Research on Threats

Big Data for Psychology

Sample Size Determination and Power Analysis

Random Variables and Probability Distributions

Software Tutorials

Data Preprocessing

Phylogenetic Non-Independence

Presenting the Results