Catégorie Python

From Pandas to Apache Spark’s Dataframe

With the introduction in Spark 1.4 of Window operations, you can finally port pretty much any relevant piece of Pandas’ Dataframe computation to Apache Spark parallel computation framework using Spark SQL’s Dataframe. If you’re not yet familiar with Spark’s Dataframe, don’t hesitate to checkout my last article RDDs are the new bytecode of Apache Spark and […]

Sharing PyPi/Maven dependency data

As time is always running out, i don’t think i’ll have the time in a while to work again on the data I collected for the last three articles, Going offline with Maven, State of the Maven/Java dependency graph and State of the PyPi/Python dependency graph. So, as it took me a long time to build […]

PyPi dependency graph generated using Gephi

State of the Python/PyPi dependency graph

I usually work in Java/Maven environment, so when I explain to people that Python also has a package manager – a bit less heavy than maven – and that it’s working pretty well, I always have to answer the same question : « Ok, but how does it solve the transitive dependency hell ? » Also known […]

New Year’s Python Meme 2012

1. What is the coolest Python application, framework or library you have discovered in 2012? Mainly for APPARTINFO, but not only, i’ve been using every single part of Django and this framework is still as awesome as usual. But as i must talk about what i’ve discovered in 2012, i have to talk about some […]

Snow leopard and Qt/PyQt 4.8.x won’t work

If you try to install, even with Homebrew the latest version of Qt the 4.8.x, you may end up haing a surprise like that : ImportError: dlopen(/usr/local/lib/python/PyQt4/QtWebKit.so, 2): Symbol not found: _kCFWebServicesProviderDefaultDisplayNameKey Referenced from: /Library/Frameworks/QtWebKit.framework/Versions/4/QtWebKit Expected in: /System/Library/Frameworks/CoreFoundation.framework/Versions/A/CoreFoundation This is coming precisely from a Qt issue that don’t seem to be resolved anytime soon, so […]

Handle Celery-dependent tests in Django and with django-jenkins

So in your life, one of these days, you’re going to realize you need tests, and that « maybe » you also need to test components that depend on several Celery tasks. Well to help you make this day more productive and less painful, here’s a few tips. First to make it work with Django-celery, a pretty […]

La puissance et le contrôle

En développement, comme dans beaucoup d’arts martiaux, on peut devenir fort assez rapidement. On peut se fixer des objectifs (une ceinture, une victoire / maitriser une technologie ou réaliser un projet perso) et les atteindre rapidement selon le language, le maître et l’implication qu’on y met. Certains langages, tout comme certains arts martiaux, poussent à […]

How to be a happy programmer (with Python) ? 2/3

In the series of the Python « features » that makes me happy last time i began with two concepts, the with statement and the list comprehensions, now i’m going to talk about Multiple assignments and the import aliases. Multiple assignments It’s a simple idea that lets you return a series of value and on the other end assign those […]

Using TOR with Python

There are many occasion where you may be limited using your own IP address, i will obviously only refer myself to « rightful » cases where you need to use different IP address in very short lapse of time. Let’s say you want to test your website localization functionality, or just access it using many different IP address […]

How to be a happy programmer (with Python) ? 1/3

I’ve just watched Hillary Mason’s talk in Pycon 2011 : http://pycon.blip.tv/file/4878710/ And that got me thinking about all the python constructs that makes my day better, and i decided to make a list of them and their meaning. With The with keyword is the equivalent of the whole try, catch, finally triplets in Java to […]