Skip to content
This repository has been archived by the owner on Jun 28, 2024. It is now read-only.

Commit

Permalink
Changed split ratios
Browse files Browse the repository at this point in the history
  • Loading branch information
noaahh committed Jun 6, 2024
1 parent 402258d commit 84bb577
Show file tree
Hide file tree
Showing 4 changed files with 3 additions and 3 deletions.
Binary file modified data/partitions/labelled_dev.parquet
Binary file not shown.
Binary file modified data/partitions/unlabelled_dev.parquet
Binary file not shown.
Binary file modified data/partitions/validation_set.parquet
Binary file not shown.
6 changes: 3 additions & 3 deletions src/prep_datasets.py
Original file line number Diff line number Diff line change
Expand Up @@ -81,11 +81,11 @@ def parse_fractions_string(fractions_str):

@app.command()
def prepare_dataset(
dev_set_fraction: float = typer.Option(1 / 600,
dev_set_fraction: float = typer.Option(1 / 1440,
help="Fraction of the full dataset to be used as the development set"),
val_set_fraction: float = typer.Option(1 / 600,
val_set_fraction: float = typer.Option(1 / 1440,
help="Fraction of the test dataset to be used as the validation set"),
labelled_fraction: float = typer.Option(1 / 6, help="Fraction of the development set that should be labelled"),
labelled_fraction: float = typer.Option(1 / 10, help="Fraction of the development set that should be labelled"),
output_dir: str = typer.Option(os.getenv("DATA_DIR"), help="Directory to save the parquet files. Defaults to "
"the DATA_DIR environment variable."),
verbose: bool = typer.Option(False, help="Enable verbose logging")
Expand Down

0 comments on commit 84bb577

Please sign in to comment.