API · Big Data · Books · Data Mining · NLP · Python · Text Analytics · Text Mining

Mastering Social Media Mining with Python

Great news, my book on data mining for social media is finally out! The title is Mastering Social Media Mining with Python. I’ve been working with Packt Publishing over the past few months, and in July the book has been finalised and released. Links: ebook and paperback on Packt Publishing (the publisher) ebook and paperback… Continue reading Mastering Social Media Mining with Python

Big Data · Engineering · Python

Adding Slack Notifications to a Luigi Pipeline in Python

In a previous article, I’ve described how to build a data pipeline in Python using Luigi, a workflow manager written in Python and open sourced by Spotify. I also had the opportunity to give a short talk about Luigi at the local PyData London meetup (see slides). One of the nice features of Luigi is… Continue reading Adding Slack Notifications to a Luigi Pipeline in Python

Big Data · Data Mining · Engineering · Python

Building Data Pipelines with Python and Luigi

As a data scientist, the emphasis of the day-to-day job is often more on the R&D side rather than engineering. In the process of going from prototypes to production though, some of the early quick-and-dirty decisions turn out to be sub-optimal and require a decent amount of effort to be re-engineered. This usually slows down… Continue reading Building Data Pipelines with Python and Luigi