#run-time #traits #async #abstraction #extensible #back-end #multiple

fusio

Fusio provides lean, minimal cost abstraction and extensible Read / Write trait to multiple storage on multiple poll-based / completion-based async runtime

9 releases

new 0.3.4 Dec 28, 2024
0.3.3 Nov 10, 2024
0.3.1 Oct 26, 2024
0.2.1 Oct 17, 2024
0.1.1 Oct 2, 2024

#1450 in Network programming

Download history 112/week @ 2024-09-25 244/week @ 2024-10-02 167/week @ 2024-10-09 233/week @ 2024-10-16 280/week @ 2024-10-23 77/week @ 2024-10-30 368/week @ 2024-11-06 71/week @ 2024-11-13 20/week @ 2024-11-20 9/week @ 2024-11-27 12/week @ 2024-12-04 8/week @ 2024-12-11 113/week @ 2024-12-25

133 downloads per month
Used in 5 crates

Apache-2.0

215KB
5.5K SLoC

Fusio

crates.io docs.rs

fusio provides random read and sequential write traits to operate on multiple storage backends (e.g., local disk, Amazon S3) across various asynchronous runtimes—both poll-based (tokio) and completion-based (tokio-uring, monoio)—with:

  • lean: binary size is at least 14× smaller than others.
  • minimal-cost abstraction: compared to bare storage backends, trait definitions allow dispatching file operations without extra overhead.
  • extensible: exposes traits to support implementing storage backends as third-party crates.

fusio is now at preview version, please join our community to attend its development and semantics / behaviors discussion.

Why do we need fusio?

In developing Tonbo, we needed a flexible and efficient way to handle file and file system operations across multiple storage backends—such as memory, local disk, and remote object storage. We also required compatibility with various asynchronous runtimes, including both completion-based runtimes and event loops in languages like Python and JavaScript.

fusio addresses these needs by providing:

  • offers traits that allow dispatch of file and file system operations to multiple storage backends.
  • usable in diverse async runtimes, not only disk but also network I/O.
  • ideal for embedded libs like Tonbo.
  • can be extended via third-party crates, enabling custom asynchronous file and file system implementations.

For more context, please check apache/arrow-rs#6051.

How to use it?

Installation

fusio = { version = "*", features = ["tokio"] }

Examples

Runtime agnostic

fusio supports switching the async runtime at compile time. Middleware libraries can build runtime-agnostic implementations, allowing the top-level application to choose the runtime.

Object safety

fusio provides two sets of traits:

  • Read / Write / Seek / Fs are not object-safe.
  • DynRead / DynWrite / DynSeek / DynFs are object-safe.

You can freely transmute between them.

File system traits

fusio has an optional Fs trait (use default-features = false to disable it). It dispatches common file system operations (open, remove, list, etc.) to specific storage backends (local disk, Amazon S3).

S3 support

fusio has optional Amazon S3 support (enable it with features = ["tokio-http", "aws"]); the behavior of S3 operations and credentials does not depend on tokio.

When to choose fusio?

Overall, fusio carefully selects a subset of semantics and behaviors from multiple storage backends and async runtimes to ensure native performance in most scenarios. For example, fusio adopts a completion-based API (inspired by monoio) so that file operations on tokio and tokio-uring have the same performance as they would without fusio.

compare with object_store

object_store is locked to tokio and also depends on bytes. fusio uses IoBuf / IoBufMut to allow &[u8] and Vec<u8> to avoid potential runtime costs. If you do not need to consider other async runtimes, try object_store; as the official implementation, it integrates well with Apache Arrow and Parquet.

compare with opendal

fusio does not aim to be a full data access layer like opendal. fusio keeps features lean, and you are able to enable features and their dependencies one by one. The default binary size of fusio is 16KB, which is smaller than opendal (439KB). If you need a full ecosystem of DAL (tracing, logging, metrics, retry, etc.), try opendal.

Also, compared with opendal::Operator, fusio exposes core traits and allows them to be implemented in third-party crates.

Roadmap

  • abstractions
    • file operations
    • (partial) file system operations
  • storage backend implementations
    • disk
      • tokio
      • tokio-uring
      • monoio
    • network
      • HTTP client trait
      • network storage runtime support
        • tokio (over reqwest)
        • monoio (over hyper-tls)
        • tokio-uring (over hyper-tls)
      • Amazon S3
      • Azure Blob Storage
      • Cloudflare R2
    • in-memory
  • conditional operations
  • extensions
    • parquet support
    • object_store support

Credits

  • monoio: all core traits—buffer, read, and write—are highly inspired by it.
  • futures: its design of abstractions and organization of several crates (core, util, etc.) to avoid coupling have influenced fusio's design.
  • opendal: Compile-time poll-based/completion-based runtime switching inspires fusio.
  • object_store: fusio adopts S3 credential and path behaviors from it.

Dependencies

~3–18MB
~276K SLoC