Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Models table upd #539

Merged
merged 77 commits into from
Aug 6, 2023
Merged

Models table upd #539

merged 77 commits into from
Aug 6, 2023

Conversation

nstsj
Copy link
Collaborator

@nstsj nstsj commented Jul 28, 2023

MODELS.md update here:

  • added info about the models' licensing and commercial use
  • merged model link+ model name cols to improve overall readability and decrease redundancy

AndriiHura and others added 30 commits January 24, 2022 11:22
@nstsj nstsj requested a review from smilni July 28, 2023 15:25
@smilni
Copy link
Contributor

smilni commented Jul 28, 2023

что тут случилось с историей коммитов? так лучше не делать

@nstsj
Copy link
Collaborator Author

nstsj commented Jul 30, 2023

я подтянула себе свежий дев и вмержила его в эту ветку, так подъехали множественные коммиты из дева.
Cделала merge, потому что при ребейзе приходилось разрешать конфликты в версиях файлах в каждом отдельном коммите, таким образов один и тот же файл иногда приходилось проверять несколько раз.

Какой у нас best practice при подтягивании дева? Чтобы я в будущем делала так, как мы делаем

@smilni
Copy link
Contributor

smilni commented Jul 31, 2023

если я правильно поняла, о чем речь, то видимо у тебя случайно это произошло. вот например ответ со стаковерфлоу про это https://stackoverflow.com/questions/16306012/github-pull-request-showing-commits-that-are-already-in-target-branch

best practice в любом случае это чтоб в пр-е не было ничего, что не сделала и предлагаешь непосредственно ты

MODELS.md Outdated
| Russian XGLM 4.5B | transformers-lm-ruxglm | unavailable (private weights) | no | 4.5B | 15GB | 2,048 tokens | A private large language model for the Russian language which was fine-tuned for instruction following by Dmitry Kosenko in Summer 2023. This model is up and running on our servers and can be used for free. |
| ruGPT-3.5-13B | transformers-lm-rugpt35 | [link](https://huggingface.co/ai-forever/ruGPT-3.5-13B) | yes | 13B | 35GB (half-precision) | 2,048 tokens | A large language model for the Russian language which was used for trainig GigaChat. This model is up and running on our servers and can be used for free. |
|model name + link|container name|open-source?|size (billion parameters) |GPU usage|max tokens (prompt + response) |licence|description|
|--|--|--|--|--|--|--|--|
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

как Диля сказала, поправь плиз файл визуально, чтоб он был читаемый и в маркдауне

MODELS.md Outdated
|[GPT-4 32K](https://platform.openai.com/docs/models/gpt-4)|openai-api-gpt4-32k|no|supposedly, 175B| - (cannot be run locally) |32,768 tokens|available under subscription plan, commercial use allowed|A multilingual instruction-based large language model which is capable of code generation and other complex tasks. Same capabilities as the base gpt-4 mode but with 4x the context length. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.|
|[GPT-JT 6B](https://huggingface.co/togethercomputer/GPT-JT-6B-v1)|transformers-lm-gptjt|yes|6B|26GB|2,048 tokens|Apache 2.0 , commercial use is allowed|An open-source English-only large language model which was fine-tuned for instruction following but is NOT capable of code generation. NB: free of charge. This model is up and running on our servers and can be used for free.|
|[ChatGPT 16k](https://platform.openai.com/docs/models/gpt-3-5)|openai-api-chatgpt-16k|no|supposedly, 175B| - (cannot be run locally) |16,384 tokens|available under subscription plan, commercial use allowed|Same capabilities as the standard gpt-3.5-turbo model but with 4 times the context. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.|
|[Anthropic Claude-v1](https://docs.anthropic.com/claude/reference/complete_post)|anthropic-api-claude-v1|no|supposedly, 52B| - (cannot be run locally) |9,000 tokens|available under subscription plan, commercial use allowed|
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

а куда исчезло его описание и три модели после него? это специально удалено или случайно потерлось?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

айй, случайно потерлось, я поправлю

Copy link
Collaborator

@dilyararimovna dilyararimovna left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

переделайте в читаемую таблицу, как раньше было (отступы колонки)

@dilyararimovna dilyararimovna merged commit b63bba1 into dev Aug 6, 2023
@dilyararimovna dilyararimovna deleted the models-table-upd branch August 6, 2023 07:49
smilni pushed a commit that referenced this pull request Aug 8, 2023
* Fix requirements.txt (#84)

* fix itsdangerous requirements

* pin itsdangerous requirements for all flask==1.1.1 servers

* updated MODELS.md table: added info about models' licensing and commercial use + merged link+name cols to improve overall readability and decrease redundancy

* Update MODELS.md

fixed "is" for better consistency

* fix: format table and add new models back

* fix: sizes of models on gpu

* updated table

---------

Co-authored-by: Andrii.Hura <54397922+AndriiHura@users.noreply.github.com>
Co-authored-by: mtalimanchuk <mtalimanchuk@gmail.com>
Co-authored-by: Dilyara Baymurzina <dilyara.rimovna@gmail.com>
dilyararimovna added a commit that referenced this pull request Aug 9, 2023
* robot first commit (no cards)

* feat: do not use sentence ranker url from env (#535)

* Feat/ruxglm prompted dist (#528)

* feat: ignore env secret ru

* feat: add access token

* feat: distribution ruxglm

* fix: ruxglm cards

* fix: use use_auth_token

* fix: eos tokens type

* fix: stats cpu ram

* fix: skills cards

* fix: components cards

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: cards and table

* fix: ADDITIONAL_EOS_TOKENS

* fix: ADDITIONAL_EOS_TOKENS

* fix: codestyle

* fix: universal

* fix: dream persona ru skill name

* fix: prompt selector ru

* fix: replacement

* fix: prompt selecrto

* fix: use params

* fix: timeout and history

* fix: trye very short persona

* fix: increase timeout

* fix: sub replacement tokens correctly

* fix: sub replacement tokens correctly

* fix: use stopping criteria

* fix: typing

* fix: revert long persona

* fix: duplicate spaces

* fix: correct components for russian distribution

* fix: proxy for russian distribution

* feat: universal distr for ru

* feat: universal distr for ru

* fix: remove extra

* fix: working configs

* fix: configs

* feat: ruxglm prompted dists

* fix: component cards

* fix: container name

* fix: remove extra space after new line

* fix: remove extra space after new line

* feat: tests for dream ruxglm

* fix: proxy and ru lang

* fix: change port of universal ru

* fix: rights on file

* fix: tests skills

* fix: test for resp selector

* fix: tests for proxied components

* fix: remove do sample true

* fix: generative params

* feat: used sentence ranker url

* feat: utilized default llm

* Feat/ru prompted dists (#532)

* feat: ignore env secret ru

* feat: add access token

* feat: distribution ruxglm

* fix: ruxglm cards

* fix: use use_auth_token

* fix: eos tokens type

* fix: stats cpu ram

* fix: skills cards

* fix: components cards

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: cards and table

* fix: ADDITIONAL_EOS_TOKENS

* fix: ADDITIONAL_EOS_TOKENS

* fix: codestyle

* fix: universal

* fix: dream persona ru skill name

* fix: prompt selector ru

* fix: replacement

* fix: prompt selecrto

* fix: use params

* fix: timeout and history

* fix: trye very short persona

* fix: increase timeout

* fix: sub replacement tokens correctly

* fix: sub replacement tokens correctly

* fix: use stopping criteria

* fix: typing

* fix: revert long persona

* fix: duplicate spaces

* fix: correct components for russian distribution

* fix: proxy for russian distribution

* feat: universal distr for ru

* feat: universal distr for ru

* fix: remove extra

* fix: working configs

* fix: configs

* feat: ruxglm prompted dists

* fix: component cards

* fix: container name

* first dist (no cards)

* fix: remove extra space after new line

* fix: remove extra space after new line

* feat: tests for dream ruxglm

* fix: proxy and ru lang

* fix: change port of universal ru

* fix: rights on file

* fix: tests skills

* fix: test for resp selector

* multiskill_ru_assistant

* fix: tests for proxied components

* fairytale and action stories dists

* journalist helper dist

* fairytale fixes

* one more fix

* action stories cards

* add quotation marks

* fairytale cards

* storyteller cards

* journalist helper cards

* multiskill ru cards

* agent services cards

* minor fixes

* fix: utilize sentence ranker url

---------

Co-authored-by: dilyararimovna <dilyara.rimovna@gmail.com>

* update components.tsv (#537)

* update components.tsv

* tabulation

* Feat/rugpt 3.5 distribution (#534)

* feat: ignore env secret ru

* feat: add access token

* feat: distribution ruxglm

* fix: ruxglm cards

* fix: use use_auth_token

* fix: eos tokens type

* fix: stats cpu ram

* fix: skills cards

* fix: components cards

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: cards and table

* fix: ADDITIONAL_EOS_TOKENS

* fix: ADDITIONAL_EOS_TOKENS

* fix: codestyle

* fix: universal

* fix: dream persona ru skill name

* fix: prompt selector ru

* fix: replacement

* fix: prompt selecrto

* fix: use params

* fix: timeout and history

* fix: trye very short persona

* fix: increase timeout

* fix: sub replacement tokens correctly

* fix: sub replacement tokens correctly

* fix: use stopping criteria

* fix: typing

* fix: revert long persona

* fix: duplicate spaces

* fix: correct components for russian distribution

* fix: proxy for russian distribution

* feat: universal distr for ru

* feat: universal distr for ru

* fix: remove extra

* fix: working configs

* fix: configs

* feat: ruxglm prompted dists

* fix: component cards

* fix: container name

* fix: remove extra space after new line

* fix: remove extra space after new line

* feat: tests for dream ruxglm

* fix: proxy and ru lang

* fix: change port of universal ru

* fix: rights on file

* fix: tests skills

* fix: test for resp selector

* fix: tests for proxied components

* feat: rugpt-3.5 by sber in universal russian distribution

* fix; wait for it

* fix: models card

* fix: models card

* fix: add to list

* fix: change port

* fix: change port

* fix: change size to correct

* feat: instruction how to add a new model

* fix ru prompt selector, remove unused component (#538)

* feat: replace oasst12b with gptjt (#541)

* Feat/utilize rugpt35 (#540)

* feat: utilize rugpt35

* feat: tests for jounrlist rugpt35

* feat: tests for jounrlist rugpt35

* fix: rights for tfile

* feat: names

* fix: ru_dists_names_and_prompts (#543)

* rename ruxglm to u

* more renaming

* tabs

* tabs

* some more renaming

* short prompt

* many cards and name changes

* fix typo

* fixes for Dilya

* tiny fix

* tiny fix

* huge name check

* names

* typo prompt

* fix: no tests for non existing skills

---------

Co-authored-by: dilyararimovna <dilyara.rimovna@gmail.com>

* fix: cards for ru dists (#544)

* fix: rugpt35 config and envs (#546)

* Summarization models (#393)

* Added abstractive summarization model for English texts

* Added abstractive summarization model for Russian texts

* Added summarization annotator

* Moved rut5 summarizer to dream_russian

* Changed endpoint

* Added model path to Dockerfile

* Updated test

* Updated summarization annotator input

* Updated test

* Changed summarization service url

* Changed test

* Increased timeout

* Updated ram_usage

* Updated ports

* Updated models cards

* Added more info messages

* Fixed path error

* Added summarization output to bot attributes

* Added timeout param to dockerfile

* Updated model cards and ports

* Fixed problem with incorrect batch processing

* Updated summarization save format

* Updated dialog summarization model

* Updated tests

* Minor formatting changes

* Fixed black and flake8 codestyle

* Fixed black codestyle

* Updated models ports

* Small fixes

* Models table upd (#539)

* Fix requirements.txt (#84)

* fix itsdangerous requirements

* pin itsdangerous requirements for all flask==1.1.1 servers

* updated MODELS.md table: added info about models' licensing and commercial use + merged link+name cols to improve overall readability and decrease redundancy

* Update MODELS.md

fixed "is" for better consistency

* fix: format table and add new models back

* fix: sizes of models on gpu

* updated table

---------

Co-authored-by: Andrii.Hura <54397922+AndriiHura@users.noreply.github.com>
Co-authored-by: mtalimanchuk <mtalimanchuk@gmail.com>
Co-authored-by: Dilyara Baymurzina <dilyara.rimovna@gmail.com>

* fix: anthropic model params (#547)

* fix summarization annotator card (#549)

* add cards for prompted robot

* ports and n_utt

* port

* increase WAIT_HOSTS_TIMEOUT in cards

---------

Co-authored-by: Dilyara Zharikova (Baymurzina) <dilyara.rimovna@gmail.com>
Co-authored-by: Maxim Talimanchuk <mtalimanchuk@gmail.com>
Co-authored-by: Nikolay <99244955+Kolpnick@users.noreply.github.com>
Co-authored-by: Anastásis <43078815+nstsj@users.noreply.github.com>
Co-authored-by: Andrii.Hura <54397922+AndriiHura@users.noreply.github.com>
oserikov pushed a commit that referenced this pull request Nov 21, 2024
* Fix requirements.txt (#84)

* fix itsdangerous requirements

* pin itsdangerous requirements for all flask==1.1.1 servers

* updated MODELS.md table: added info about models' licensing and commercial use + merged link+name cols to improve overall readability and decrease redundancy

* Update MODELS.md

fixed "is" for better consistency

* fix: format table and add new models back

* fix: sizes of models on gpu

* updated table

---------

Co-authored-by: Andrii.Hura <54397922+AndriiHura@users.noreply.github.com>
Co-authored-by: mtalimanchuk <mtalimanchuk@gmail.com>
Co-authored-by: Dilyara Baymurzina <dilyara.rimovna@gmail.com>
oserikov pushed a commit that referenced this pull request Nov 21, 2024
* robot first commit (no cards)

* feat: do not use sentence ranker url from env (#535)

* Feat/ruxglm prompted dist (#528)

* feat: ignore env secret ru

* feat: add access token

* feat: distribution ruxglm

* fix: ruxglm cards

* fix: use use_auth_token

* fix: eos tokens type

* fix: stats cpu ram

* fix: skills cards

* fix: components cards

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: cards and table

* fix: ADDITIONAL_EOS_TOKENS

* fix: ADDITIONAL_EOS_TOKENS

* fix: codestyle

* fix: universal

* fix: dream persona ru skill name

* fix: prompt selector ru

* fix: replacement

* fix: prompt selecrto

* fix: use params

* fix: timeout and history

* fix: trye very short persona

* fix: increase timeout

* fix: sub replacement tokens correctly

* fix: sub replacement tokens correctly

* fix: use stopping criteria

* fix: typing

* fix: revert long persona

* fix: duplicate spaces

* fix: correct components for russian distribution

* fix: proxy for russian distribution

* feat: universal distr for ru

* feat: universal distr for ru

* fix: remove extra

* fix: working configs

* fix: configs

* feat: ruxglm prompted dists

* fix: component cards

* fix: container name

* fix: remove extra space after new line

* fix: remove extra space after new line

* feat: tests for dream ruxglm

* fix: proxy and ru lang

* fix: change port of universal ru

* fix: rights on file

* fix: tests skills

* fix: test for resp selector

* fix: tests for proxied components

* fix: remove do sample true

* fix: generative params

* feat: used sentence ranker url

* feat: utilized default llm

* Feat/ru prompted dists (#532)

* feat: ignore env secret ru

* feat: add access token

* feat: distribution ruxglm

* fix: ruxglm cards

* fix: use use_auth_token

* fix: eos tokens type

* fix: stats cpu ram

* fix: skills cards

* fix: components cards

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: cards and table

* fix: ADDITIONAL_EOS_TOKENS

* fix: ADDITIONAL_EOS_TOKENS

* fix: codestyle

* fix: universal

* fix: dream persona ru skill name

* fix: prompt selector ru

* fix: replacement

* fix: prompt selecrto

* fix: use params

* fix: timeout and history

* fix: trye very short persona

* fix: increase timeout

* fix: sub replacement tokens correctly

* fix: sub replacement tokens correctly

* fix: use stopping criteria

* fix: typing

* fix: revert long persona

* fix: duplicate spaces

* fix: correct components for russian distribution

* fix: proxy for russian distribution

* feat: universal distr for ru

* feat: universal distr for ru

* fix: remove extra

* fix: working configs

* fix: configs

* feat: ruxglm prompted dists

* fix: component cards

* fix: container name

* first dist (no cards)

* fix: remove extra space after new line

* fix: remove extra space after new line

* feat: tests for dream ruxglm

* fix: proxy and ru lang

* fix: change port of universal ru

* fix: rights on file

* fix: tests skills

* fix: test for resp selector

* multiskill_ru_assistant

* fix: tests for proxied components

* fairytale and action stories dists

* journalist helper dist

* fairytale fixes

* one more fix

* action stories cards

* add quotation marks

* fairytale cards

* storyteller cards

* journalist helper cards

* multiskill ru cards

* agent services cards

* minor fixes

* fix: utilize sentence ranker url

---------

Co-authored-by: dilyararimovna <dilyara.rimovna@gmail.com>

* update components.tsv (#537)

* update components.tsv

* tabulation

* Feat/rugpt 3.5 distribution (#534)

* feat: ignore env secret ru

* feat: add access token

* feat: distribution ruxglm

* fix: ruxglm cards

* fix: use use_auth_token

* fix: eos tokens type

* fix: stats cpu ram

* fix: skills cards

* fix: components cards

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: eos tokens

* fix: cards and table

* fix: ADDITIONAL_EOS_TOKENS

* fix: ADDITIONAL_EOS_TOKENS

* fix: codestyle

* fix: universal

* fix: dream persona ru skill name

* fix: prompt selector ru

* fix: replacement

* fix: prompt selecrto

* fix: use params

* fix: timeout and history

* fix: trye very short persona

* fix: increase timeout

* fix: sub replacement tokens correctly

* fix: sub replacement tokens correctly

* fix: use stopping criteria

* fix: typing

* fix: revert long persona

* fix: duplicate spaces

* fix: correct components for russian distribution

* fix: proxy for russian distribution

* feat: universal distr for ru

* feat: universal distr for ru

* fix: remove extra

* fix: working configs

* fix: configs

* feat: ruxglm prompted dists

* fix: component cards

* fix: container name

* fix: remove extra space after new line

* fix: remove extra space after new line

* feat: tests for dream ruxglm

* fix: proxy and ru lang

* fix: change port of universal ru

* fix: rights on file

* fix: tests skills

* fix: test for resp selector

* fix: tests for proxied components

* feat: rugpt-3.5 by sber in universal russian distribution

* fix; wait for it

* fix: models card

* fix: models card

* fix: add to list

* fix: change port

* fix: change port

* fix: change size to correct

* feat: instruction how to add a new model

* fix ru prompt selector, remove unused component (#538)

* feat: replace oasst12b with gptjt (#541)

* Feat/utilize rugpt35 (#540)

* feat: utilize rugpt35

* feat: tests for jounrlist rugpt35

* feat: tests for jounrlist rugpt35

* fix: rights for tfile

* feat: names

* fix: ru_dists_names_and_prompts (#543)

* rename ruxglm to u

* more renaming

* tabs

* tabs

* some more renaming

* short prompt

* many cards and name changes

* fix typo

* fixes for Dilya

* tiny fix

* tiny fix

* huge name check

* names

* typo prompt

* fix: no tests for non existing skills

---------

Co-authored-by: dilyararimovna <dilyara.rimovna@gmail.com>

* fix: cards for ru dists (#544)

* fix: rugpt35 config and envs (#546)

* Summarization models (#393)

* Added abstractive summarization model for English texts

* Added abstractive summarization model for Russian texts

* Added summarization annotator

* Moved rut5 summarizer to dream_russian

* Changed endpoint

* Added model path to Dockerfile

* Updated test

* Updated summarization annotator input

* Updated test

* Changed summarization service url

* Changed test

* Increased timeout

* Updated ram_usage

* Updated ports

* Updated models cards

* Added more info messages

* Fixed path error

* Added summarization output to bot attributes

* Added timeout param to dockerfile

* Updated model cards and ports

* Fixed problem with incorrect batch processing

* Updated summarization save format

* Updated dialog summarization model

* Updated tests

* Minor formatting changes

* Fixed black and flake8 codestyle

* Fixed black codestyle

* Updated models ports

* Small fixes

* Models table upd (#539)

* Fix requirements.txt (#84)

* fix itsdangerous requirements

* pin itsdangerous requirements for all flask==1.1.1 servers

* updated MODELS.md table: added info about models' licensing and commercial use + merged link+name cols to improve overall readability and decrease redundancy

* Update MODELS.md

fixed "is" for better consistency

* fix: format table and add new models back

* fix: sizes of models on gpu

* updated table

---------

Co-authored-by: Andrii.Hura <54397922+AndriiHura@users.noreply.github.com>
Co-authored-by: mtalimanchuk <mtalimanchuk@gmail.com>
Co-authored-by: Dilyara Baymurzina <dilyara.rimovna@gmail.com>

* fix: anthropic model params (#547)

* fix summarization annotator card (#549)

* add cards for prompted robot

* ports and n_utt

* port

* increase WAIT_HOSTS_TIMEOUT in cards

---------

Co-authored-by: Dilyara Zharikova (Baymurzina) <dilyara.rimovna@gmail.com>
Co-authored-by: Maxim Talimanchuk <mtalimanchuk@gmail.com>
Co-authored-by: Nikolay <99244955+Kolpnick@users.noreply.github.com>
Co-authored-by: Anastásis <43078815+nstsj@users.noreply.github.com>
Co-authored-by: Andrii.Hura <54397922+AndriiHura@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants