The Linguateca project (a distributed center for Portuguese language resources, 2000-2008) has SINTEF ICT as the main node and links around 20 researchers in 6 different locations (Lisbon, Odense, Porto, Braga, São Carlos and Coimbra). Its philosopy is described as the IRE model (Information - Resources - Evaluation).
The main site is www.linguateca.pt, which has received more than 6 million visits since its creation, and which offers Web access to several Portuguese resources. Linguateca has organized several evaluation contests to foster research and development of tools dealing with Portuguese, such as the Morfolimpíadas (morphological analysis), CLEF (cross-lingual information retrieval and question answering - the Portuguese part) and HAREM (named entity recognition).
The project has produced a large number of publications (ca 250) and presentations (ca. 100), available from www.linguateca.pt/documentos/.
Research in the scope of Linguateca encompasses ontology discovery from text, terminologically-aware IR, human factors in Web search and resource creation, example-based MT and issues in comparable and parallel corpora, as well as evaluation metrics and tasks.