Le Nguyen The Dat bio photo

Le Nguyen The Dat

~ Data Science & Engineering

Twitter Facebook LinkedIn Github

All Posts

2018

Moving to Medium

Hi there, if you are one of the 5 random people who have been reading my blog here, you probably realized that it has not been updated for over a year now. I...

2017

Data team workflows

Introduction Choosing the right tools and workflows that fit well with your team is a critical task but oftentimes get overlooked. This article demonstrat...

Engineering thought: how to move fast?

What is “moving fast”? Speed is relative. It is hard to measure, especially when there is hardly anything similar to compare with. In engineering, it doe...

2015

My OS X Setup

This is my OS X setup. There are many like it, but this one is mine. Alfred v2 - alfredapp.com I have been using Alfred for a good while as a Spotlight alte...

The data team.

What are they, where do they fit in? The data team is probably the most impactful team in your entire company. Their products either directly help the key...

Poker 101.

Note: I am not a pro poker player - I love statistics and calculating probabilities however. I also love playing Texas hold’em Poker with friends as well a...

Google Analytics Referral Spam.

/rant If you are using or planing to use Google Analytics for visitors tracking purpose on your website, you probably want to have another closer look int...

My favorite technical blogs.

Tech Companies / Startups Engineering: Instagram Airbnb Netflix Etsy Programming / Software Engineering: The Changelog Coding Horror Mach...

Docker cache and apt-get update.

If you ever run into such error below when building your Docker image… E: Failed to fetch http://archive.ubuntu.com/ubuntu/pool/main/l/linux/linux-libc-d...

SEA Games History.

In light of the current SEA Games 2015 in Singapore, below is a Tableau Public’s vizualization of all SEA Games Results since the first ever SEA Games 1959 (...

Google AdSense API with Python.

Following my last blog post on Youtube Analytics API, this post will be about Google AdSense (Management) API. Prerequisites: You will need python 2.7 and...

Youtube Analytics API with Python.

In the past few days, I’ve been working on retrieving data from various Google Products with their APIs, I figured it would be helpful (at least for myself!)...

Minimal Tmux Cheatsheet.

It’s important to use tmux to run time-consuming scripts on remote servers, especially when they are critical and/or my internet connection is not very relia...

Write your resume the programmer's way

Came across this gem - jsonresume.org while trying to update my resume and I decided to give it a try. Turned out, it was very easy to setup and to use on OS...

2014

Moving away from blogger

That’s it, I’ve had enough. Blogging with Blogger is simply inefficient and annoying as hell. I’m glad I made this decision. Below are the steps that I’...

Redcat

Started off as an experiment: 1-man job, single-node Redshift cluster, a bunch of tables for some big joins...Redcat is now expanding into a 6-node...

Use MySQL with SSH Tunnelling, AutoSSH

Just some notes for myself:Apparently SSH Tunnelling works better than SSL for secure connection with MySQL server...AutoSSH is to keep SSH session always al...

2013

Haskell, GHC, Cabal, and all that jazz

Update: for OS X, do have a look at http://ghcformacosx.github.io/As a (somewhat) newcomer to Haskell, it's super hard for me to fully understand about ...

Web scraping tool

A pretty neat tool.$ xidel https://www.google.com.sg/search?q=web+scraping+tools -f //a -e //title

Hi!

++++++++++[>+++++++>++++++++++>+++>+<<<<-] >++.>+.+++++++..+++.>++.<<+++++++++++++++.>.+++.------.--------.>+.&...