Adds script for AWQ-quantizing model #101

ru5h16h · 2024-03-04T00:21:14Z

Pull Request Checklist

Reference Issue

ref: castorini/ura-projects#4

Checklist Items

Before submitting your pull request, please review these items:

[Yes] Have you followed the contributing guidelines?
[Yes] Have you verified that there are no existing Pull Requests for the same update/change?
[Yes] Have you updated any relevant documentation or added new tests where needed?

PR Type

What kind of change does this PR introduce?

Bugfix
Feature
Code style update (formatting, local variables)
Refactoring (no functional changes, no api changes)
Documentation content changes
Other...
- Description: Adds model quantization using AWQ

ronakice · 2024-03-04T05:39:02Z

save_awq.py

This should not be in the main repo folder, probably src/rank_llm/scripts

sahel-sh

Thank you for the PR!

sahel-sh · 2024-03-06T01:41:39Z

save_awq.py

+    parser.add_argument(
+        "--dataset",
+        type=str,
+        default="msp_open_ai_ada2_random_s5000_gpt4_da0_mr20_sampled_mix.jsonl",


is it possible to have rank zephy's training data or a subset of it as the default value of the calibration dataset?

This would probably requires changes to the load dataset logic too

This is the file that @ronakice shared. Wasn't this one used for training?

No, this is not the data that we used for finetuning rankzephyr, but I leave it to Ronak to decide if we want to you the training dataset or the one that we shared with you.

@ronakice PTAL!

/u3/rpradeep/RankVicuna/data/msp_open_ai_ada2_random_s5000_gpt4_da0_mr20_sampled_mix.jsonl

This is the file I used to train RankZephyr @sahel-sh?

either way, something is off in AWQ quantizing, i will advice against merging until this is properly sorted

save_awq.py

sahel-sh

sorry for being slow on this, LGTM!

ru5h16h · 2024-04-05T20:09:15Z

Here are the details outlining the insights gathered and other experimental information: https://docs.google.com/document/d/1BHpN9lDVGjtjIAFMxjUxNuu1K4KJOgIaOZXkWF1_K8c/edit

ru5h16h added 2 commits March 4, 2024 00:10

Adds script for AWQ-quantizing model

166f3ed

Refactored changes

eeff317

ru5h16h marked this pull request as draft March 4, 2024 00:23

ronakice reviewed Mar 4, 2024

View reviewed changes

sahel-sh reviewed Mar 6, 2024

View reviewed changes

sahel-sh approved these changes Mar 12, 2024

View reviewed changes

Moved file to scripts folder

b5d92f4

ru5h16h marked this pull request as ready for review April 5, 2024 20:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds script for AWQ-quantizing model #101

Adds script for AWQ-quantizing model #101

ru5h16h commented Mar 4, 2024 •

edited

Loading

ronakice Mar 4, 2024

ru5h16h Mar 6, 2024

sahel-sh left a comment

sahel-sh Mar 6, 2024

sahel-sh Mar 6, 2024

ru5h16h Mar 6, 2024

sahel-sh Mar 12, 2024

sahel-sh Mar 12, 2024

ronakice Mar 12, 2024

ronakice Mar 12, 2024

sahel-sh left a comment

ru5h16h commented Apr 5, 2024

Adds script for AWQ-quantizing model #101

Are you sure you want to change the base?

Adds script for AWQ-quantizing model #101

Conversation

ru5h16h commented Mar 4, 2024 • edited Loading

Pull Request Checklist

Reference Issue

Checklist Items

PR Type

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sahel-sh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sahel-sh left a comment

Choose a reason for hiding this comment

ru5h16h commented Apr 5, 2024

ru5h16h commented Mar 4, 2024 •

edited

Loading