Script to figure out model sequence length limits #1579

JosselinSomervilleRoberts · 2023-05-17T20:30:19Z

This script is used to find the limits required by the window service

percyliang · 2023-05-17T20:49:15Z

Can we make this a scenario that we run officially (kind of like a unit test) rather than a one-off script?

JosselinSomervilleRoberts · 2023-05-17T20:56:46Z

Sure, I could look into that. Fox now, since @yifanmai needs this (I think) to fix the canary runs. Should we keep it as a script and then I could make it into a scenario in a different PR?

yifanmai · 2023-05-18T22:47:07Z

This can't be a scenario because scenarios cannot access the tokenizer. synthetic_efficiency has the same issue; we have to manually generate a static file for each tokenizer.

yifanmai

@teetone could you also take a look at this?

scripts/compute_request_limits.py

yifanmai · 2023-05-23T19:25:13Z

scripts/compute_request_limits.py

+
+# model_name, tokenizer_name, prefix and suffix are passed as arguments
+parser = argparse.ArgumentParser()
+parser.add_argument("--model_name", type=str, default="writer/palmyra-base")


Optional suggestion: helm-run and helm-summarize use hyphens for flags; you could considering doing that here for consistency. Note that argparse will autoconvert field names to use underscore e.g. args.model_name.

Not sure I understand this, I will just skip it for now

scripts/compute_request_limits.py

yifanmai · 2023-05-23T20:28:10Z

scripts/compute_request_limits.py

+    return lower_bound + max_prompt_length
+
+
+def check_limits(


Is this just for double-checking i.e. if the measurement was performed correctly, should this method always return that the limits are correct??

Normally it should, it's was just for me to check that my implementation was correct and that I did not have a +1 problem with my binary search

yifanmai

@teetone could you also take a look at this?

Script to figure out limits

e164fc7

JosselinSomervilleRoberts requested a review from yifanmai May 17, 2023 20:30

Fix typing

98d7e97

trying to fix precommit fail

b154e52

yifanmai changed the title ~~Script to figure out limits~~ Script to figure out model sequence length limits May 23, 2023

yifanmai requested changes May 23, 2023

View reviewed changes

yifanmai requested a review from teetone May 23, 2023 20:30

JosselinSomervilleRoberts added 4 commits June 7, 2023 17:22

Changes requested

0072dac

Changes requested

df4f881

Comments

c915db1

Fixing flake

91d2f47

JosselinSomervilleRoberts requested a review from yifanmai June 12, 2023 23:59

yifanmai approved these changes Jun 28, 2023

View reviewed changes

yifanmai merged commit 259a355 into main Jun 28, 2023

yifanmai deleted the josselin-script-limits branch June 28, 2023 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Script to figure out model sequence length limits #1579

Script to figure out model sequence length limits #1579

JosselinSomervilleRoberts commented May 17, 2023

percyliang commented May 17, 2023

JosselinSomervilleRoberts commented May 17, 2023

yifanmai commented May 18, 2023

yifanmai left a comment

yifanmai May 23, 2023

JosselinSomervilleRoberts Jun 12, 2023

yifanmai May 23, 2023

JosselinSomervilleRoberts May 30, 2023

yifanmai left a comment

Script to figure out model sequence length limits #1579

Script to figure out model sequence length limits #1579

Conversation

JosselinSomervilleRoberts commented May 17, 2023

percyliang commented May 17, 2023

JosselinSomervilleRoberts commented May 17, 2023

yifanmai commented May 18, 2023

yifanmai left a comment

Choose a reason for hiding this comment

yifanmai May 23, 2023

Choose a reason for hiding this comment

JosselinSomervilleRoberts Jun 12, 2023

Choose a reason for hiding this comment

yifanmai May 23, 2023

Choose a reason for hiding this comment

JosselinSomervilleRoberts May 30, 2023

Choose a reason for hiding this comment

yifanmai left a comment

Choose a reason for hiding this comment