4 private links
Um corpus de aproximadamente 180 milhões de palavras em português europeu, criado pelo projecto Processamento Computacional do Português após a assinatura de um protocolo entre o Ministério da Ciência e da Tecnologia (MCT) português e o jornal PÚBLICO em Abril de 2000.
Alphabet Soup is a project which attempts to determine a number of things about the shapes of letters in several different writing systems. First, it hypothesizes a set of basic building blocks that all letters are built up from. Second, it hypothesizes a set of rules, a grammar or syntax, which defines how those pieces combine to make different letters.
The egrep program is used to scan files for character strings (e.g. words). Its basic function is to go through a text file line for line, and print all lines matching a search pattern or regular expression to `standard out(put)'.
Chef is a programming language in which programs look like recipes.
Combining the cuteness of LOLCODE and the cuddliness of Python
DadaDodo is a program that analyses texts for word probabilities, and then generates random sentences based on that. Sometimes these sentences are nonsense; but sometimes they cut right through to the heart of the matter, and reveal hidden meanings.
Can You Learn YAML in Five Minutes?
It's early days in what promises to be the long death of a galactic civilization so it is going to be hard to discern how things might reshape themselves under this bombardment, how art and the art-world might re-articulate new models of value.
Markup Syntax and Parser Component of Docutils
A Simple, Extensible Tool for Literate Programming
"Unicode 5.0 encodes exactly 98,884 graphic characters on different planes. Here you can see them all."