À propos
Before, I was the CEO and…
Activité
-
FLUX.1-Krea-dev, the new best open image generation model? The answer is........ Run your own experiments! I've just done a quick experiment, and…
FLUX.1-Krea-dev, the new best open image generation model? The answer is........ Run your own experiments! I've just done a quick experiment, and…
Aimé par Daniel Vila Suero
-
FLUX.1-Krea-dev, the new best open image generation model? The answer is........ Run your own experiments! I've just done a quick experiment, and…
FLUX.1-Krea-dev, the new best open image generation model? The answer is........ Run your own experiments! I've just done a quick experiment, and…
Partagé par Daniel Vila Suero
-
hf jobs: synthetic data pipelines as a service, at lightning speed ⚡ Here's a real example: > Using Kimi-K2-Instruct to generate a dataset of…
hf jobs: synthetic data pipelines as a service, at lightning speed ⚡ Here's a real example: > Using Kimi-K2-Instruct to generate a dataset of…
Partagé par Daniel Vila Suero
Expérience et formation
Publications
-
datos.bne.es: A library linked dataset
Semantic Web Journal
We describe the datos.bne.es library dataset. The dataset makes available the authority and bibliography catalogue from the Biblioteca Nacional de España (BNE, National Library of Spain) as Linked Data. The catalogue contains around 7 million authority and bibliographic records. The records in MARC 21 format were transformed to RDF and modelled using IFLA (International Federation of Library Associations) ontologies and other well-established vocabularies such as RDA (Resource Description and…
We describe the datos.bne.es library dataset. The dataset makes available the authority and bibliography catalogue from the Biblioteca Nacional de España (BNE, National Library of Spain) as Linked Data. The catalogue contains around 7 million authority and bibliographic records. The records in MARC 21 format were transformed to RDF and modelled using IFLA (International Federation of Library Associations) ontologies and other well-established vocabularies such as RDA (Resource Description and Access) or the Dublin Core Metadata Element Set. A tool named MARiMbA automatized the RDF generation process and the data linkage to DBpedia and other library linked data resources such as VIAF (Virtual International Authority File) or GND (Gemeinsame Normdatei, the authority dataset from the German National Library).
Autres auteursVoir la publication -
Guidelines for Multilingual Linked Data
WIMS '13 Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
In this article, we argue that there is a growing number of linked datasets in different natural languages, and that there is a need for guidelines and mechanisms to ensure the quality and organic growth of this emerging multilingual data network. However, we have little knowledge regarding the actual state of this data network, its current practices, and the open challenges that it poses. Questions regarding the distribution of natural languages, the links that are established across data in…
In this article, we argue that there is a growing number of linked datasets in different natural languages, and that there is a need for guidelines and mechanisms to ensure the quality and organic growth of this emerging multilingual data network. However, we have little knowledge regarding the actual state of this data network, its current practices, and the open challenges that it poses. Questions regarding the distribution of natural languages, the links that are established across data in different languages, or how linguistic features are represented, remain mostly unanswered. Addressing these and other language-related issues can help to identify existing problems, propose new mechanisms and guidelines or adapt the ones in use for publishing linked data including language-related features, and, ultimately, provide metrics to evaluate quality aspects. In this article we review, discuss, and extend current guidelines for publishing linked data by focusing on those methods, techniques and tools that can help RDF publishers to cope with language barriers. Whenever possible, we will illustrate and discuss each of these guidelines, methods, and tools on the basis of practical examples that we have encountered in the publication of the datos.bne.es dataset.
Autres auteurs -
-
Grupo Incubador de Datos Vinculados de Bibliotecas: Casos de uso
World Wide Web Consortium
En este documento se describe una selección de casos de uso de la comunidad bibliotecaria y de otros sectores afines. El Grupo Incubador de Datos Vinculados de Bibliotecas del W3C los ha reunido y analizado a partir de los informes de distintas organizaciones e individuos. Los casos se han agrupados en ocho grupos temáticos que se describen a continuación. Así mismo, en cada agrupación se presenta un resumen de los casos de uso seleccionados.
-
W3C Library Linked Data Incubator Group: Use Cases
World Wide Web Consortium
Selected use cases and case studies from the library community and related sectors are described in this document. These were gathered and analyzed by the W3C Library Linked Data Incubator Group, based on submissions from different organizations and individuals. Cases have been grouped into eight topical clusters, which are described below. Selected use cases from each cluster have also been summarized.
Projets
-
datos.bne.es
datos.bne.es is an open initiative aimed at enriching the Web of Data with library data from the Spanish National Library. The RDF generation from MARC 21 records was done using the tool MARiMbA, which allows non-technical users to work on the mappings from MARC21 metadata to RDF using different RDFS/OWL vocabularies.
Autres créateursVoir le projet -
Museumsportal Berlin
-
The first shared website between Berlin museums, it enables tourists and those with an interest in culture to gain a unique overview of the vast and diverse museum-landscape within the capital. Information about the services, tours and events at nearly 200 museums, memorials, castles and collections can be found here. Instead of having to trawl through every individual website, Museumsportal Berlin offers comprehensive information about all current and forthcoming exhibitions in Berlin
Langues
-
Español
Bilingue ou langue natale
-
Inglés
Capacité professionnelle complète
-
Alemán
Capacité professionnelle complète
-
Francés
Compétence professionnelle limitée
Plus d’activités de Daniel
-
How to 4x the ROI of a dataset: → Enter Dataset Repurposing Many teams limit themselves by using datasets for a single task ↳ Missing out on…
How to 4x the ROI of a dataset: → Enter Dataset Repurposing Many teams limit themselves by using datasets for a single task ↳ Missing out on…
Aimé par Daniel Vila Suero
-
New research reveals the hidden cultural biases in AI evaluation 🌍 An interesting study by Singh et al. just dropped some eye-opening findings…
New research reveals the hidden cultural biases in AI evaluation 🌍 An interesting study by Singh et al. just dropped some eye-opening findings…
Aimé par Daniel Vila Suero
-
1M repos on Xet, and more on the way! 🚀 📈
1M repos on Xet, and more on the way! 🚀 📈
Aimé par Daniel Vila Suero
-
A key step towards automated AI R&D just landed on Hugging Face: HF Jobs! > Step 1: Use an agent to pick the most promising (model, dataset) pair on…
A key step towards automated AI R&D just landed on Hugging Face: HF Jobs! > Step 1: Use an agent to pick the most promising (model, dataset) pair on…
Aimé par Daniel Vila Suero