📊 Awesome Data Resources For July 🚀
Here are awesome data resources just for you! 🎉
Greetings! I hope your are well! ☀️ This is the July 2020 version of the monthly email with great data science resources from Jeff Hale.
If this message was forwarded to you, you can go ahead and subscribe at dataawesome.com - and give a thank you to the sender! 😀
Everything below is designed to help you sharpen your data skills and everything has a lot of awesome! Let's get to it! 🚀
Awesome articles 💻
Technology Holy Wars are Coordination Problems by Gwern Branwen is an interesting look at some of the dynamics around arguments for open source software. His synopsis: Flamewars over platforms & upgrades are so bitter not because people are jerks but because the choice will influence entire ecosystems, benefiting one platform through network effects & avoiding ‘bitrot’ while subtly sabotaging the rest through ‘bitcreep’. ⚔️
What Productivity Looks Like to Me by Ali Spittel is full of excellent time management tips. Ali mentions the Flowtime Technique, which I like more than the better-known Pomodoro technique that I find too constrictive. By the way, Flowtime is based on ideas by Mihaly Csikszenthmihalyi, whose book Flow is a fabulous. ⏳
Awesome visualization website 🖼
Naftali Harris has an awesome interactive website for visualizing clustering algorithms. Websites like this are so great for learning new concepts! 😀 Here's a an example with k-means after a number of iterations.

Awesome keyboard shortcut ⌨️
Switch between open windows of the same program on a Mac with Cmd + ` (that's a backtick). I find this really valuable when working with multiple browser windows open. ⭐️
Awesome virtual conference 🖥
PyData LA is coming up Saturday, August 22. I've attended, given a lightening talk, and volunteered at a few in-person PyData conferences in the past and had great experiences. Lots of interesting content and good conversation. The conference is free, but you need to register here.
Awesome people to follow 📣
I mentioned Ali Spittel above. She teaches at General Assembly, writes about programming, and co-hosts the LadyBug Podcast. (Disclosure, I've also taught at GA.) She is a thought leader in the coding education space and shares lots of great stuff on Twitter.
Hadley Wickham is the name most closely associated with the R language. Hadley works at RStudio and is creator of the famous Tidy Data article, Tidyverse packages, and lots of other content. If you're on Twitter, follow him so you're in the know. In fact, I found the article on Holy Wars above because he retweeted it. 🐦
Speaking of Twitter, I'm trying to be more active there, so give me and let me know you found me here and I'll follow you back. 👍
What I've been working on 🛠
It's been a busy month! Here are a few resources I made that you might find useful:
In Sidetable Gives You the Pandas Methods You Didn't Know You Needed I show how to use the new Python library. You can read the article and run all the code in the article on DeepNote or read it on Medium. 🏓
In my research for The Most In-Demand Tech Skills for Machine Learning Engineers I found interesting differences in job listing technologies for different roles. In the article I explain differences between machine learning engineers and data scientists, data engineers, and data analysts. I wrote it because I wish I had found something like it when I was starting out. 📊
My Data Skills Book Bundle of 4 ebooks is available for a savings of over 60%! For $29 USD you get Memorable Python, Memorable Pandas, Memorable SQL, and Memorable Docker. You'll learn the foundations of Python programming, data manipulation with the pandas library, SQL with PostgreSQL, and how to use Docker containers. 🚀