Keynote:
A series of fortunate events,or is it?
Marcin Zukowski (Dijkstra Fellow) is a computer scientist and entrepreneur, known for co-founding the high-performance analytical database company, Snowflake, and for his work on vectorized query execution during his PhD at CWI. He is in town because the day before this meetup he will receive the Dijkstra Fellowship.
"In this keynote Marcin will talk about his professional journey, starting with MonetDB, through Vectorwise, to Snowflake. He'll describe some of the key events on the way, the main lessons and biggest mistakes, and discuss how his path was shaped by hard work, amazing people, and sheer luck."
Panel: What are important data systems problems, ignored by research?
Viktor Leis (Technical
University Munich)
Andy Pavlo is an Associate Professor with Indefinite Tenure of Databaseology in the Computer Science Department at Carnegie Mellon University. He knows some pile about databases, and is well known for leading research on self-driving databases and his engaging online lectures.
Allison Lee has spent the last two decades building commercial database systems. She is a founding engineer at Snowflake and contributed to many of its foundational components, including the query optimizer, metadata management, language extensibility, and data protection. She currently leads the database engineering team.
Hannes Mühleisen is a creator of the DuckDB database system and Co-founder and CEO of DuckDB Labs. Hannes is also Professor of Data Engineering at Radboud Universiteit Nijmegen and a senior researcher at the CWI Database Architectures Group. His main interest are - shockingly - analytical data management systems.
Viktor Leis is a Professor at the Technical University of Munich (TUM), and he has worked on systems such as Hyper, Umbra and LeanStore. His research focuses on designing cost-efficient data systems for the cloud.
talk#1:
Follow your rows, wherever they may go
Bart Samwel (Databricks Amsterdam)
is a Principal Engineer at Databricks in Amsterdam. He takes at least partial blame for many recent Delta Lake innovations including Deletion Vectors, Liquid Clustering, and Row-Level Concurrency.
"This talk is about tracking row identities and changes in Delta Lake and Apache Iceberg tables. How can we assign dense, unique, and monotonically increasing identity sequence numbers while using optimistic concurrency? And how can we use this to efficiently derive the differences between arbitrary snapshot versions of a table?"
talk#2:
My Database Can Do This
Alexey Milovidov (ClickHouse) has 16 years of experience designing, developing, and operating data-intensive applications. He started work on ClickHouse in 2009 and initiated its release in open-source in 2016. Prior to ClickHouse he developed the data processing engine of the world's 2nd largest web analytics system. His area of interests is data processing algorithms and technologies. He loves data and high performance.
"Relational databases have existed for 50 years. Still, there are constant attempts to re-introduce specialized databases for specific use cases, such as stream processing, time series data, web applications, geospatial analysis, graphs, and vector search. I will provide examples of specialized data models and applications and how they fit (or not fit) into my favorite analytical database, ClickHouse."
talk#3:
60fps of UX Joy with DuckDB+Cloud
Boaz Leskes (MotherDuck Amsterdam)
is part of MotherDuck’s founding team and leads its database group. In past life, he spent some years on distributed systems, (Elastic)search and cloud platforms. Will happily talk to any of these, or speed skating, kite surfing, rowing, or any other thing of interest.
"In the age of ever more powerful hardware, where your laptop can do more than your typical Datacenter server, MotherDuck leverages DuckDB’s state of the art analytical prowess to drive compute down to your laptop as well as making the most of the Cloud. Combine DuckDB’s versatility to run everywhere (including your browser), augment it with a server-less CDW, and you get (interactive) analytical sessions delivering results in unprecedented speed. So fast it updates your dashboard in 60fps."