Show HN: OrioleDB Beta12 Features and Benchmarks

orioledb.com

55 points by akorotkov 6 days ago

Hey HN, I'm the creator of OrioleDB, an extension for PostgreSQL that serves as a drop-in replacement for the default Heap storage engine. It is designed to address scalability bottlenecks in PostgreSQL's buffer manager and reduce the WAL, enabling better utilization of modern multi-core CPUs and high‑performance storage systems.

We are getting closer to GA. This release includes:

- An index bridge to support all indexes that Heap supports

- Support for rewinding recent changes in the database.

- Tablespaces support

- Fillfactor support

- An orioledb_tree_stat() function for space utilization statistics

- Support for tables with more than 32 columns.

We also show several performance improvements using the TPC-C benchmarks. Overall, OrioleDB is much faster than Heap, also outperforming other Postgres providers.

We would love more people testing OrioleDB. The fastest way to do that is to use the docker image provided:

    docker run -d --name orioledb -p 5432:5432 orioledb/orioledb
Read the full release here:

https://www.orioledb.com/blog/orioledb-beta12-benchmarks

akorotkov 6 days ago

Additionally, OrioleDB beta12 features a new fastpath tree search, which can accelerate workloads with intensive key-value lookups by up to 20%. Stay tuned for a new blog post about this later this week.

atombender 2 days ago

Have you gotten any feedback from the core Postgres team about whether any of the improvements might make it into the mainline, or do you think it will remain an extension for ever?

I haven't studied the design super closely, so there might be shortcomings or design compromises that aren't immediately obvious. But from where I'm sitting the performance improvements seem so extreme that it's hard to see why anyone would not want to replace the current MVCC heap with this.

An undo log-based system is essentially the way Oracle structures its core row storage, with each row mutated in place and containing a header listing the chain of transactions currently holding different versions of it. This means you avoid the bloat that comes with Postgres-style MVCC, by sacrificing some read performance whenever someone needs to read a row that has been mutated but not committed, as the reader has to follow the chain to find the old committed version in the undo log. That's always seemed to me to be a better design. Moving the dead tuple bloat away from the main heap and into a separate undo log makes a lot of sense.

But I wonder if the core team, which is famously quite conservative about introducing big changes, sees this as too different to adopt as a replacement for what's been the state of the art since forever.

  • kiwicopple a day ago

    > But I wonder if the core team, which is famously quite conservative about introducing big changes, sees this as too different to adopt as a replacement

    we respect the level of conservatism in Postgres and expect that this could take a while to upstream. we're committed to any timeline and we'll develop it for self-hosters and on the supabase platform first to iron out all the bugs

    that said, even if it remains an extension that's fine: as long as the TAM patches land upstream this enables everyone to create storage engines using an extension approach - a very "postgres" way of doing things

    you can follow the progress of these patches here:

    https://www.orioledb.com/docs#patch-set

    • atombender a day ago

      I know it can exist just fine as an independent extension. I was asking more about whether you had gotten any signal that the core team was interested at all.

      With software projects like these, you often have a somewhat insular core team who isn't particularly amenable to innovations that may be perceived as disruptive or "not invented here" or going against the grain, or for other reasons. With 30+ years fighting the downsides of MVCC vacuums and heap bloat you would think and hope they would be jumping at the opportunity.

      The extension system is great. But extensions will always suffer from not being part of the core. For example, if you use a cloud provider like AWS and GCP, you're limited by what extensions are included, as well as their release cadence.

      • kiwicopple a day ago

        We have made our intentions clear with the core team but also haven’t pushed the agenda much - it’s too early.

        The conversations we have had so far are promising, we just need to make sure we approach it the right way. Projects like Oriole have started and sputtered out several times (zheap, for example). The burden first lies on our side to prove that we can see it through to GA, with meaningful usage. They (correctly) shouldn’t need to entertain the maintenance burden until they know the juice is worth the squeeze.

        If it’s not accepted into core, we will work with cloud providers to add it as a supported extension

ksec 2 days ago

Hopefully this will be part of the official Postgres soon(ish), along with what Planetscale is doing we could finally have an Open Source SQL DB system that is powerful without too much customisation.

ololobus 3 days ago

Does it require core patches or I can install it into the standard upstream Postgres? Asking because, afaik, it did, but it might that something has changed already.

znpy 2 days ago

Great! How does it fit into the landscape?

Can i used it in conjunction with other extensions, like the recently announced pgactive from amazon?

rickette 2 days ago

Would Postgres FTS (ts_vector) benefit from this storage engine?

kabes 3 days ago

Awesome. How far away would you say are you from a stable orioledb as postgres extension?

jfbaro 3 days ago

Great work!!