Skip to content

Commit

Permalink
updating docs
Browse files Browse the repository at this point in the history
  • Loading branch information
muhamadazmy committed Sep 22, 2023
1 parent 8df4b97 commit 5cdd2b0
Show file tree
Hide file tree
Showing 2 changed files with 123 additions and 42 deletions.
156 changes: 120 additions & 36 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,45 +1,129 @@

# Introduction
This is a drop in replacement for [0-fs](https://github.com/threefoldtech/0-fs) written from scratch in rust.

## Motive
Improve stability and resources needed to run a 0-fs instance.
`rfs` is the main tool to create, mount and extract FungiStore lists (FungiList)`fl` for short. An `fl` is a simple format
to keep information about an entire filesystem in a compact form. It does not hold the data itself but enough information to
retrieve this data back from a `store`.

## Building rfs

## Features
- [x] Mount an flist
- [x] Query file `stat`
- [x] Traverse the mount tree
- [x] Download files (chunks) from hub
- [x] Opening files for reading
- [x] Reading files
- [x] Command line interface
- [x] ACL and ownership
To build rfs make sure you have rust installed then run the following commands:

## Difference between the 2 implementations
- Rust fuse library only provide the inode api (no higher level API). This is the reason we can't merge flists in runtime, because `inodes` are
generated from `rawid` of the sqlite database.
```bash
# this is needed to be run once to make sure the musl target is installed
rustup target add x86_64-unknown-linux-musl

## Basic usage
# build the binary
cargo build --features build-binary --release --target=x86_64-unknown-linux-musl
```
Mount Flists 0.1
USAGE:
rfs [FLAGS] [OPTIONS] <TARGET> --meta <META>
FLAGS:
-d, --daemon daemonize process
--debug enable debug logging
-h, --help Prints help information
-V, --version Prints version information
OPTIONS:
--cache <cache> cache directory [default: /tmp/cache]
--storage-url <hub> storage url to retrieve files from [default: redis://hub.grid.tf:9900]
--log <log> log file only in daemon mode
--meta <META> metadata file, can be a .flist file, a .sqlite3 file or a directory with a
`flistdb.sqlite3` inside
ARGS:
<TARGET>

the binary will be available under `./target/x86_64-unknown-linux-musl/release/rfs` you can copy that binary then to `/usr/bin/`
to be able to use from anywhere on your system.

## Stores

A store in where the actual data lives. A store can be as simple as a `directory` on your local machine in that case the files on the `fl` are only 'accessible' on your local machine. A store can also be a `zdb` running remotely or a cluster of `zdb`. Right now only `dir` and `zdb` stores are supported but this will change in the future to support even more stores.

## Usage

### Creating an `fl`

```bash
rfs pack -m output.fl -s <store-specs> <directory>
```

This tells rfs to create an `fl` named `output.fl` using the store defined by the url `<store-specs>` and upload all the files under directory recursively.

The simplest form of `<store-specs>` is a `url`. the store `url` defines the store to use. Any `url`` has a schema that defines the store type. Right now we have support only for:

- `dir`: dir is a very simple store that is mostly used for testing. A dir store will store the fs blobs in another location defined by the url path. An example of a valid dir url is `dir:///tmp/store`
- `zdb`: [zdb](https://github.com/threefoldtech/0-db) is a append-only key value store and provides a redis like API. An example zdb url can be something like `zdb://<hostname>[:port][/namespace]`

`<store-specs>` can also be of the form `<start>-<end>=<url>` where `start` and `end` are a hex bytes for partitioning of blob keys. rfs will then store a set of blobs on the defined store if they blob key falls in the `[start:end]` range (inclusive).

If the `start-end` range is not provided a `00-FF` range is assume basically a catch all range for the blob keys. In other words, all blobs will be written to that store.

This is only useful because `rfs` can accept multiple stores on the command line with different and/or overlapping ranges.

For example `-s 00-80=dir:///tmp/store0 -s 81-ff=dir://tmp/store1` means all keys that has prefix byte in range `[00-80]` will be written to /tmp/store0 all other keys `00-ff` will be written to store1.

The same range can appear multiple times, which means the blob will be replicated to all the stores that matches its key prefix.

To quickly test this operation

```bash
rfs pack -m output.fl -s 00-80=dir:///tmp/store0 -s 81-ff=dir:///tmp/store1 ~/Documents
```

this command will effectively create the `output.fl` and store (and shard) the blobs across the 2 locations /tmp/store0 and /tmp/store1.

```bash
#rfs pack --help

create an FL and upload blocks to provided storage

Usage: rfs pack [OPTIONS] --meta <META> <TARGET>

Arguments:
<TARGET> target directory to upload

Options:
-m, --meta <META> path to metadata file (flist)
-s, --store <STORE> store url in the format [xx-xx=]<url>. the range xx-xx is optional and used for sharding. the URL is per store type, please check docs for more information
-h, --help Print help
```

# Mounting an flist

Once the `fl` is created it can be distributes to other people. Then they can mount the `fl` which will allow them then to traverse the packed filesystem and also access (read-only) the files.

To mount an `fl` only the `fl` is needed since all information regarding the `stores` is already stored in the `fl`. This also means you can only share the `fl` if the other user can actually reach the store used to crate the `fl`. So a `dir` store is not sharable, also a `zdb` instance that is running on localhost :no_good:

```bash
sudo rfs mount -m output.fl <target>
```

The `<target>` is the mount location, usually `/mnt` but can be anywhere. In another terminal you can now `cd <target>` and walk the filesystem tree. Opening the files will trigger a file download from the store only on read access.

full command help

```bash
# rfs mount --help

mount an FL

Usage: rfs mount [OPTIONS] --meta <META> <TARGET>

Arguments:
<TARGET> target mountpoint

Options:
-m, --meta <META> path to metadata file (flist)
-c, --cache <CACHE> directory used as cache for downloaded file chuncks [default: /tmp/cache]
-d, --daemon run in the background
-l, --log <LOG> log file only used with daemon mode
-h, --help Print help
```

# Unpack and fl

Similar to `mount` rfs provides an `unpack` subcommand that downloads the entire content (extract) of an `fl` to a provided directory.

```bash
fs unpack --help
unpack (downloads) content of an FL the provided location

Usage: rfs unpack [OPTIONS] --meta <META> <TARGET>

Arguments:
<TARGET> target directory to upload

Options:
-m, --meta <META> path to metadata file (flist)
-c, --cache <CACHE> directory used as cache for downloaded file chuncks [default: /tmp/cache]
-h, --help Print help
```

# Specifications

Please check [docs](docs)
9 changes: 3 additions & 6 deletions src/main.rs
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,6 @@ enum Commands {
Mount(MountOptions),
/// create an FL and upload blocks to provided storage
Pack(PackOptions),

/// unpack (downloads) content of an FL the provided location
Unpack(UnpackOptions),
}
Expand All @@ -45,17 +44,14 @@ struct MountOptions {
#[clap(short, long, default_value_t = String::from("/tmp/cache"))]
cache: String,

/// run in the background.
#[clap(short, long)]
daemon: bool,

/// log file only used with daemon mode
#[clap(short, long)]
log: Option<String>,

/// hidden value
#[clap(long = "ro", hide = true)]
ro: bool,

/// target mountpoint
target: String,
}
Expand All @@ -66,7 +62,8 @@ struct PackOptions {
#[clap(short, long)]
meta: String,

/// store url
/// store url in the format [xx-xx=]<url>. the range xx-xx is optional and used for
/// sharding. the URL is per store type, please check docs for more information
#[clap(short, long, action=ArgAction::Append)]
store: Vec<String>,

Expand Down

0 comments on commit 5cdd2b0

Please sign in to comment.