Some changes to our daily open data "Green domains" snapshots - introducing a 12 month cut off, and where to get bespoke help using the data| Green Web Foundation
Earlier this week, we published an update to our datasets website, that contains our latest daily snapshot of our Green Domains dataset, along with the recent official 1.0 release of the Real Time Cloud Metadata dataset that we work on with the Green Software Foundation, in their corresponding working groups. This post explains how you can use them.| Green Web Foundation
In this project, I will describe in detail the process of making a deepfake video where I swap my head with the head of the Norwegian actor, Aksel Hennie. The post Project Aksel: Deepfake Head Replacement first appeared on Hegnes.| Hegnes
In this post, we’re excited to introduce the Chamber Ensemble Generator, a system for generating realistic chamber ensemble performances, and the corresponding CocoChorales Dataset, which contains over 1,400 hours of audio mixes with corresponding source data and MIDI, multi-f0, and per-note performance annotations. 🎵Audio Examples📝arXiv Paper📂Dataset Download InstructionsGithub Code Data is the bedrock that all machine learning systems are built upon. Historically, researchers app...| Magenta
It’s three years since my 2021 post summarizing what I knew about estimating software tasks. While no major new public datasets have appeared (there have been smaller finds), I have talked to lots of developers/managers about the findings from the 2019/2021 data avalanche, and some data dots have been connected.| The Shape of Code
An independent team of researchers has created a dataset of wordplay puzzles that require users to add or subtract letters from words to identify a phrase. It contains 333 puzzles from 13 categories, such as major cities and food. Researchers can use the dataset to improve multimodal AI systems that| Center for Data Innovation
Researchers at Pohang University of Science and Technology, Seoul National University, and Yonsei University in South Korea have created a dataset of video clips of laughter. It contains nearly 900 clips of audiences laughing during TED talks and sitcom shows, as well as annotations explaining why t| Center for Data Innovation
A compilation of celestial data files - This project provides several datasets in GeoJSON and GeoPackage format of celestial objects...| One world | Projects, maps and coding
A database with geocodes - Complete database of countries and territories, their different country codes under common standards (ISO-3166, GEC...| One world | Projects, maps and coding