Skip to content

Commit

Permalink
Merge pull request #14 from premAI-io/misc-updates
Browse files Browse the repository at this point in the history
  • Loading branch information
casperdcl authored Aug 18, 2023
2 parents 2a16d74 + a317c77 commit 2f3608d
Show file tree
Hide file tree
Showing 10 changed files with 80 additions and 36 deletions.
6 changes: 6 additions & 0 deletions _toc.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,14 @@ format: jb-book
root: index
chapters:
- file: licenses
- file: eval-datasets
- file: models
- file: uncensored-models
- file: fine-tuning
- file: model-formats
- file: mlops-engines
- file: vector-stores
- file: sdk
- file: desktop-apps
- file: hardware
- file: references
7 changes: 7 additions & 0 deletions desktop-apps.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
# Desktop Apps

## LMStudio

## GPT4All UI

{{ comments }}
11 changes: 11 additions & 0 deletions eval-datasets.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
# Evaluation & Datasets

OpenLLM Leaderboard, etc.

## Perplexity

probably put short definition in [](glossary)?

## Reinforcement Learning for a Chat-based LLM

{{ comments }}
2 changes: 2 additions & 0 deletions fine-tuning.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,8 @@ Falcon |

## How Fine-tuning Works

https://github.com/h2oai, etc.

## LLMs

### Gorilla
Expand Down
11 changes: 11 additions & 0 deletions hardware.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
# Hardware

## NVIDIA GPUs

## Apple

### Mac

### iPhone

{{ comments }}
58 changes: 22 additions & 36 deletions index.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ Site: {{ baseurl }}

As a data scientist/developer with a 9 to 5 job, it's difficult to keep track of all the innovations. There's been enormous progress in the field in the last year.

The guide covers all the most important categories in the Open Source AI space, from model evaluations to deployment. It includes a [glossary](glossary) for you to quickly check definitions of new frameworks & tools.
The guide covers all the most important categories in the Open Source AI space, from model evaluations to deployment. It includes a [](glossary) for you to quickly check definitions of new frameworks & tools.

A quick TL;DR overview is included at the top of each section. We outline the pros/cons and general context/background for each topic. Then we dive a bit deeper. Examples include data models were trained on, and deployment implementations.

Expand All @@ -28,31 +28,17 @@ We identified the main categories for what concerns open-source tooling, models,

Chapter | Examples
---|---
[Licenses](licenses) | LLaMA, HuggingFace, Apache-2.0
Evaluation & Datasets | OpenLLM Leaderboard, Datasets
Models | LLaMA 1 vs 2, Falcon, Stable Diffusion, DALL-E
Uncensored Models | FraudGPT, PoisonGPT
[Fine-tuning Frameworks](fine-tuning) | h20, ...
[Model Formats](model-formats) | ONNX, Apache TVM, GGML
[MLOps Engines](mlops-engines) | BentoML, llama.cpp, ray
[Vector Stores](vector-stores) | weaviate, qdrant, milvus, redis, chroma
Developer SDK | langchain, haystack, llama index
Desktop Apps | LMStudio, GPT4All UI
Hardware | NVIDIA GPUs, Mac, iPhone

```{admonition} Chapter WiP
:class: dropdown
- Evaluation & Datasets
+ What is Perplexity?
+ What's the Role of Reinforcement Learning for a Chat-based LLM?
- Models
- Uncensored Models
- Developer SDK
+ LangChain
+ Llama Index
- Desktop Apps
- Hardware
```
[](licenses) | LLaMA, HuggingFace, Apache-2.0
[](eval-datasets) | OpenLLM Leaderboard, Datasets
[](models) | LLaMA 1 vs 2, Falcon, Stable Diffusion, DALL-E
[](uncensored-models) | FraudGPT, PoisonGPT
[](fine-tuning) | h20, ...
[](model-formats) | ONNX, Apache TVM, GGML
[](mlops-engines) | BentoML, llama.cpp, ray
[](vector-stores) | weaviate, qdrant, milvus, redis, chroma
[](sdk) | langchain, haystack, llama index
[](desktop-apps) | LMStudio, GPT4All UI
[](hardware) | NVIDIA GPUs, Mac, iPhone

## Contributing

Expand Down Expand Up @@ -83,11 +69,11 @@ We understand that the current open source ecosystem is moving at light-speed. T

- [Quickstart](https://jupyterbook.org/en/stable/reference/cheatsheet.html)
- [Full reference](https://jupyterbook.org/en/stable/content/myst.html)
- adding a new chapter
+ create `some-file.md` containing `# Some File` heading and `{{ comments }}` ending
- Create a new chapter:
+ create `some-file.md` (containing `# Some File` heading and `{{ comments }}` footer)
+ add `- file: some-file` to `_toc.yml`
+ add `[Some File](some-file) | summary` to [ToC](toc)
- figures:
+ add `[](some-file) | summary` to [ToC](toc)
- Figures:

```{figure-md} fig-ref
:class: margin-caption
Expand All @@ -100,7 +86,7 @@ We understand that the current open source ecosystem is moving at light-speed. T
- numbered ref: {numref}`fig-ref`
- custom ref: {numref}`Figure {number} with caption "{name}" <fig-ref>`

- glossary term: {term}`GPU`
- Glossary term: {term}`GPU`
- `references.bib` citation: {cite}`python`

## Conclusion
Expand All @@ -109,11 +95,11 @@ Open Source AI represents the future of privacy and ownership of data. On the ot

At Prem we are on a journey to make this possible, with a focus on developer experience and deployment for any sort of developers, from Web Developers with zero knowledge about AI to affirmed Data Scientist who wants to quickly deploy and try these new models and technologies in their existing infra without compromising privacy.

## Join our community
## Join our Community

- Be part of the community [by joining our Discord](https://discord.com/invite/kpKk6vYVAn).
- To stay in touch [follow us on Twitter](https://twitter.com/premai_io).
- To report bugs or ask for support [open an issue on the Github repository](https://github.com/premAI-io/prem-app).
- Ask for support on [our Discord server](https://discord.com/invite/kpKk6vYVAn).
- To keep up-to-date, [follow us on Twitter](https://twitter.com/premai_io).
- Report bugs or request features at https://github.com/premAI-io/prem-app.

(glossary)=

Expand All @@ -128,7 +114,7 @@ Tokens
[GPU](https://en.wikipedia.org/wiki/Graphics_processing_unit)
Graphics Processing Unit: hardware originally designed to accelerate computer image processing, but now often repurposed for [embarrassingly parallel](https://en.wikipedia.org/wiki/Embarrassingly_parallel) computational tasks in machine learning.
A100, V100 H100
A100, V100, H100
Vector
Embedding
Vector Embeddings
Expand Down
2 changes: 2 additions & 0 deletions licenses.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,5 @@
# Licenses

LLaMA, HuggingFace, Apache-2.0

{{ comments }}
5 changes: 5 additions & 0 deletions models.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# Models

LLaMA 1 vs 2, Falcon, Stable Diffusion, DALL-E

{{ comments }}
9 changes: 9 additions & 0 deletions sdk.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Sofware Development Toolkits

## LangChain

## LLaMA Index

## Haystack

{{ comments }}
5 changes: 5 additions & 0 deletions uncensored-models.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# Uncensored Models

FraudGPT, PoisonGPT

{{ comments }}

0 comments on commit 2f3608d

Please sign in to comment.