Connect ClickHouse to AWS's new S3 Table Buckets using Iceberg, run real-time SQL on Parquet data, and build data pipelines in Altinity.Cloud. The post Real-time Queries on AWS S3 Table Buckets in ClickHouse® appeared first on Altinity | Run open source ClickHouse® better.| Altinity
ART CHAT #30 Aoraki/Mount Cook, New Zealand 28 May 2023 It’s been a long time since I’ve shared any new additions to my New Zealand trip sketchbook. The last time was Oamaru & Lake Tekapo on April 30th. There’s been... Read the Post The post ART CHAT #30 – Aoraki/Mount Cook, NZ first appeared on Write of the Middle.| Write of the Middle
lakeFS Enterprise offers a fully standards-compliant implementation of the Apache Iceberg REST Catalog, enabling Git-style version control for structured data at scale. This integration allows teams to use Iceberg-compatible tools like Spark, Trino, and PyIceberg without any vendor lock-in or proprietary formats. By treating Iceberg tables as versioned entities within lakeFS repositories and branches, users […] The post Versioned Data with Apache Iceberg Using lakeFS Iceberg REST Catalog ap...| Git for Data – lakeFS
A behind-the-scenes look at the design decisions, architecture, and lessons learned while bringing the Apache Iceberg REST Catalog to lakeFS. When we first announced our native lakeFS Iceberg REST Catalog, we focused on what it means for data teams: seamless, Git-like version control for structured and unstructured data, at any scale. But how did we […] The post How We Built Our lakeFS Iceberg Catalog appeared first on Git for Data - lakeFS.| Git for Data – lakeFS
Wikimedia Commons now uses Structured Data on Commons (SDC) to make media information multilingual and machine-readable. A core part of SDC is the ‘depicts’ statement (P180), which identifies items clearly visible in a file. Depicts statements are crucial for MediaSearch, enabling it to find relevant results in any language by using Wikidata labels, as well…| addshore
Real-time data lakes just got real. Altinity.Cloud for ClickHouse® now supports managed Iceberg & compute swarms for fast, scalable analytics. The post Altinity.Cloud Launches Managed Iceberg with Antalya Compute Swarms appeared first on Altinity | Run open source ClickHouse® better.| Altinity
Recent developments in ClickHouse® and Project Antalya elevate Iceberg support to a new level - matching and in some cases outperforming MergeTree, while enabling cheap object storage and scalable, shared data systems. The post The Future Has Arrived: Parquet on Iceberg Finally Outperforms MergeTree appeared first on Altinity | Run open source ClickHouse® better.| Altinity
Perfect for ClickHouse®, Altinity Ice is an open source tool that makes it easy to set up Iceberg REST catalogs and load data with just two commands.| Altinity | Run open source ClickHouse® better
DuckDB has gained a new feature in preview, that allows querying of Iceberg data in AWS S3 Tables. Setting up a S3 Table There are multiple steps which need to be performed to set up a S3 Table that can be then queried with tools like DuckDB. As the ...| tobilg.com
TL;DR: Shared a notebook showing the results of Iceberg metadata conversion to Delta in Onelake. I’ve been following the evolution of Iceberg shortcuts to OneLake and I’m genuinely impressed with how the engineering team has invested so much energy into making it more robust, it is a good idea to read the documentation. Essentially, XTable … Continue reading "Stress Testing Iceberg shortcut in Onelake"| Small Data And self service
This is more or less the industry consensus on how a Lakehouse architecture should look in 2025. By now, it’s become clear that Parquet is the de facto standard for storing data, and using an object store to separate storage from compute makes a lot of sense. Another interesting development is how vendors want to … Continue reading "An Excel User’s Perspective on Lakehouse Architecture"| Small Data And self service
Discover Project Antalya: Experience ClickHouse® analytics on Iceberg storage, cutting costs by 90% and delivering up to 100x faster queries.| Altinity | Run open source ClickHouse® better
This blog will talk about iceberg table support and why it both matters and doesn't| dbt Developer Hub Blog
Time to get back to my New Zealand trip posts! There's still a few to cover yet and I'd really like to get them completed this year, so let's pick up at Day 16. We had arrived at Lake Tekapo from Dunedin the day before. Today's post is very photo heavy so grab a cuppa| Write of the Middle -
Select Apache Iceberg or Delta Lake’s UniForm based on business goals. The right infrastructure is vital for efficient data management and analysis.| Dremio