4 private links
Many companies seems to go through a pattern of hiring a data science team only for the entire team to quit or be fired around 12 months later. Why is the failure rate so high?
These stories illustrate common problems that occur with the uncontrolled use of spreadsheets. In many cases, we identify the area of risk involved and then say how we think the problem might have been avoided.
Um corpus de aproximadamente 180 milhões de palavras em português europeu, criado pelo projecto Processamento Computacional do Português após a assinatura de um protocolo entre o Ministério da Ciência e da Tecnologia (MCT) português e o jornal PÚBLICO em Abril de 2000.
yay for brightnets!
"Unicode 5.0 encodes exactly 98,884 graphic characters on different planes. Here you can see them all."
a place to find and store code snippets
really useful tips for mysql/LAMP-based webapp development
DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web.
or why using databases for organising data isn't necessarily a good option - "Maybe the reason flat files work so well is that a file system IS a hierarchical database"
a small but powerful python web framework
geoname.py is a python wrapper around the geonames web service
The geonames.org geographical database is available for download free of charge under a creative commons attribution license.