Streamlining access to tabular datasets stored in Amazon S3 Tables with DuckDB | AWS Storage Blog

Streamlining access to tabular datasets stored in Amazon S3 Tables with DuckDB | AWS Storage Blog
Apache Iceberg + Virtual Knowledge Graphs
A hands-on tutorial here: https://ontopic.ai/en/tech-notes/create-virtual-knowledge-graphs-on-apache-iceberg/
Revolutionizing Data Pipelines: Apache Iceberg Streaming Writes and Batch Reads in C++
The latest PR from Timeplus introduces a groundbreaking MVP-level support for Apache Iceberg, enabling seamless streaming writes and batch reads directly in C++. This development not only enhances dat...
Apache iceberg the Hadoop of the modern-data-stack? — https://blog.det.life/apache-iceberg-the-hadoop-of-the-modern-data-stack-c83f63a4ebb9
#HackerNews #ApacheIceberg #ModernDataStack #Hadoop #DataEngineering #BigData
Exploring Alternatives for Cloud Scale Logging: A Developer's Dilemma
As organizations grapple with the rising costs and complexities of cloud logging solutions, developers are on the hunt for viable alternatives that can seamlessly integrate with modern architectures. ...
https://news.lavx.hu/article/exploring-alternatives-for-cloud-scale-logging-a-developer-s-dilemma
The house at the lake, Teil 3 - The Dashboard Diaries: https://blog.sogeo.services/blog/2025/01/26/house-at-the-lake-03.html #Trino #SQL #datalake #datalakehouse #lakehouse #duckdb #apacheiceberg
Revolutionizing Data Management: The Fusion of DuckDB and Apache Iceberg
Bauplan's innovative approach to integrating DuckDB with Apache Iceberg is reshaping the landscape of data lakehouses. This serverless architecture not only simplifies data management but also enhance...
https://news.lavx.hu/article/revolutionizing-data-management-the-fusion-of-duckdb-and-apache-iceberg
I thought that DLL hell in #dotnet with #nuget was bad. After trying to match #scala versions with #ApacheIceberg and AWS SDK packages, I'm now realizing how good I had it. No documentation on which versions match what. Maven doesn't show dependencies. What a waste of time.
Seriously, this is all trial and error running a test program to see if I get a NoClassDefFoundError.
The house at the lake, Teil 2 - Start your engines: https://blog.sogeo.services/blog/2025/01/12/house-at-the-lake-02.html #Spark #ApacheIceberg #SQL #Datalake #Lakehouse #DuckDB
Snowflakeを凌駕する新星?Apache Icebergで変わるデータ戦略
https://qiita.com/Chuanwei/items/a1062a777de1f6534c41?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items
The house at the lake, Teil 1- Iceberg ahead. Data Lakehouse baby steps. https://blog.sogeo.services/blog/2025/01/05/house-at-the-lake-01.html #ApacheIceberg #Spark #Pyspark #Datalake #Lakehouse
When an opportunity arose to support earlier-stage #openSource projects like #apacheIceberg and #apachePolaris (incubating) while building on the incredible work already underway, I couldn’t pass it up…
I’m beyond excited to announce that I’ll be working as a Lead Developer Advocate for #openSource at Snowflake where I'll be focussing on #apacheIceberg and #apachePolaris (incubating)!
Why this role? Great question, read on
The October issue of #CheckpointChronicle is now out
It covers Ververica's Fluss, #ApacheFlink 2.0, Iggy.rs, Strimzi's support for #ApacheKafka 4.0, tons of OTF material from @vanlightly, Christian Hollinger's write up of ngrok's data platform, nice detail of how SmartNews use #ApacheIceberg with Flink and #ApacheSpark, a good writeup from Sudhendu Pandey on #ApachePolaris, notes from Kir Titievsky on Kafka's Avro serialisers, and much more!
Apache Iceberg Live Crash Course, Register for free!
Register Here: https://bit.ly/am-2024-iceberg-live-crash-course-1
HANDS-ON WITH APACHE ICEBERG FROM YOUR LAPTOP
GET HANDS ON WITH KAFKA CONNECT
New Blog about using Kanfa Connect to ingest into Nessie/Apache Iceberg
Useful writeup from Alex Merced: A Deep Dive into the Concept and World of #ApacheIceberg Catalogs https://medium.com/data-engineering-with-dremio/a-deep-dive-into-the-concept-and-world-of-apache-iceberg-catalogs-0697e8d18a8b
My demo for **#kafkasummit** next week is coming together. **Michael Drogalis**' **ShadowTraffic** is the perfect tool for easily generating realistic data into **#apachekafka**, from where I'm reading it with **#apacheflink** and on into **#ApacheIceberg** :)
All with SQL, and not a line of Java in sight!
Hope to see you there
** Here be Dragons^H^H Stacktraces — Flink SQL for Non-Java Developers**
*Tue Mar 19*
* 3:30 PM*
* Breakout room 2*
First **#ApacheIceberg** summit announced:
https://tabular.io/blog/announcing-first-iceberg-summit/
CfP is open now, until April 12th https://sessionize.com/Iceberg-Summit-2024
**#dataEngineering** **#openTableFormat**