Merge pull request #14 from premAI-io/misc-updates

premAI-io · Aug 18, 2023 · 2f3608d · 2f3608d
2 parents 2a16d74 + a317c77
commit 2f3608d
Show file tree

Hide file tree

Showing 10 changed files with 80 additions and 36 deletions.
diff --git a/_toc.yml b/_toc.yml
@@ -3,8 +3,14 @@ format: jb-book
 root: index
 chapters:
 - file: licenses
+- file: eval-datasets
+- file: models
+- file: uncensored-models
 - file: fine-tuning
 - file: model-formats
 - file: mlops-engines
 - file: vector-stores
+- file: sdk
+- file: desktop-apps
+- file: hardware
 - file: references
diff --git a/desktop-apps.md b/desktop-apps.md
@@ -0,0 +1,7 @@
+# Desktop Apps
+
+## LMStudio
+
+## GPT4All UI
+
+{{ comments }}
diff --git a/eval-datasets.md b/eval-datasets.md
@@ -0,0 +1,11 @@
+# Evaluation & Datasets
+
+OpenLLM Leaderboard, etc.
+
+## Perplexity
+
+probably put short definition in [](glossary)?
+
+## Reinforcement Learning for a Chat-based LLM
+
+{{ comments }}
diff --git a/fine-tuning.md b/fine-tuning.md
@@ -16,6 +16,8 @@ Falcon |
 
 ## How Fine-tuning Works
 
+https://github.com/h2oai, etc.
+
 ## LLMs
 
 ### Gorilla

diff --git a/hardware.md b/hardware.md
@@ -0,0 +1,11 @@
+# Hardware
+
+## NVIDIA GPUs
+
+## Apple
+
+### Mac
+
+### iPhone
+
+{{ comments }}
diff --git a/index.md b/index.md
@@ -6,7 +6,7 @@ Site: {{ baseurl }}
 
 As a data scientist/developer with a 9 to 5 job, it's difficult to keep track of all the innovations. There's been enormous progress in the field in the last year.
 
-The guide covers all the most important categories in the Open Source AI space, from model evaluations to deployment. It includes a [glossary](glossary) for you to quickly check definitions of new frameworks & tools.
+The guide covers all the most important categories in the Open Source AI space, from model evaluations to deployment. It includes a [](glossary) for you to quickly check definitions of new frameworks & tools.
 
 A quick TL;DR overview is included at the top of each section. We outline the pros/cons and general context/background for each topic. Then we dive a bit deeper. Examples include data models were trained on, and deployment implementations.
 
@@ -28,31 +28,17 @@ We identified the main categories for what concerns open-source tooling, models,
 
 Chapter | Examples
 ---|---
-[Licenses](licenses) | LLaMA, HuggingFace, Apache-2.0
-Evaluation & Datasets | OpenLLM Leaderboard, Datasets
-Models | LLaMA 1 vs 2, Falcon, Stable Diffusion, DALL-E
-Uncensored Models | FraudGPT, PoisonGPT
-[Fine-tuning Frameworks](fine-tuning) | h20, ...
-[Model Formats](model-formats) | ONNX, Apache TVM, GGML
-[MLOps Engines](mlops-engines) | BentoML, llama.cpp, ray
-[Vector Stores](vector-stores) | weaviate, qdrant, milvus, redis, chroma
-Developer SDK | langchain, haystack, llama index
-Desktop Apps | LMStudio, GPT4All UI
-Hardware | NVIDIA GPUs, Mac, iPhone
-
-```{admonition} Chapter WiP
-:class: dropdown
-- Evaluation & Datasets
-  + What is Perplexity?
-  + What's the Role of Reinforcement Learning for a Chat-based LLM?
-- Models
-- Uncensored Models
-- Developer SDK
-  + LangChain
-  + Llama Index
-- Desktop Apps
-- Hardware
-```
+[](licenses) | LLaMA, HuggingFace, Apache-2.0
+[](eval-datasets) | OpenLLM Leaderboard, Datasets
+[](models) | LLaMA 1 vs 2, Falcon, Stable Diffusion, DALL-E
+[](uncensored-models) | FraudGPT, PoisonGPT
+[](fine-tuning) | h20, ...
+[](model-formats) | ONNX, Apache TVM, GGML
+[](mlops-engines) | BentoML, llama.cpp, ray
+[](vector-stores) | weaviate, qdrant, milvus, redis, chroma
+[](sdk) | langchain, haystack, llama index
+[](desktop-apps) | LMStudio, GPT4All UI
+[](hardware) | NVIDIA GPUs, Mac, iPhone
 
 ## Contributing
 
@@ -83,11 +69,11 @@ We understand that the current open source ecosystem is moving at light-speed. T
 
 - [Quickstart](https://jupyterbook.org/en/stable/reference/cheatsheet.html)
 - [Full reference](https://jupyterbook.org/en/stable/content/myst.html)
-- adding a new chapter
-  + create `some-file.md` containing `# Some File` heading and `{{ comments }}` ending
+- Create a new chapter:
+  + create `some-file.md` (containing `# Some File` heading and `{{ comments }}` footer)
   + add `- file: some-file` to `_toc.yml`
-  + add `[Some File](some-file) | summary` to [ToC](toc)
-- figures:
+  + add `[](some-file) | summary` to [ToC](toc)
+- Figures:
 
   ```{figure-md} fig-ref
   :class: margin-caption
@@ -100,7 +86,7 @@ We understand that the current open source ecosystem is moving at light-speed. T
   - numbered ref: {numref}`fig-ref`
   - custom ref: {numref}`Figure {number} with caption "{name}" <fig-ref>`
 
-- glossary term: {term}`GPU`
+- Glossary term: {term}`GPU`
 - `references.bib` citation: {cite}`python`
 
 ## Conclusion
@@ -109,11 +95,11 @@ Open Source AI represents the future of privacy and ownership of data. On the ot
 
 At Prem we are on a journey to make this possible, with a focus on developer experience and deployment for any sort of developers, from Web Developers with zero knowledge about AI to affirmed Data Scientist who wants to quickly deploy and try these new models and technologies in their existing infra without compromising privacy.
 
-## Join our community
+## Join our Community
 
-- Be part of the community [by joining our Discord](https://discord.com/invite/kpKk6vYVAn).
-- To stay in touch [follow us on Twitter](https://twitter.com/premai_io).
-- To report bugs or ask for support [open an issue on the Github repository](https://github.com/premAI-io/prem-app).
+- Ask for support on [our Discord server](https://discord.com/invite/kpKk6vYVAn).
+- To keep up-to-date, [follow us on Twitter](https://twitter.com/premai_io).
+- Report bugs or request features at https://github.com/premAI-io/prem-app.
 
 (glossary)=
 
@@ -128,7 +114,7 @@ Tokens
 [GPU](https://en.wikipedia.org/wiki/Graphics_processing_unit)
     Graphics Processing Unit: hardware originally designed to accelerate computer image processing, but now often repurposed for [embarrassingly parallel](https://en.wikipedia.org/wiki/Embarrassingly_parallel) computational tasks in machine learning.
 
-A100, V100 H100
+A100, V100, H100
 Vector
 Embedding
 Vector Embeddings

diff --git a/licenses.md b/licenses.md
@@ -1,3 +1,5 @@
 # Licenses
 
+LLaMA, HuggingFace, Apache-2.0
+
 {{ comments }}
diff --git a/models.md b/models.md
@@ -0,0 +1,5 @@
+# Models
+
+LLaMA 1 vs 2, Falcon, Stable Diffusion, DALL-E
+
+{{ comments }}
diff --git a/sdk.md b/sdk.md
@@ -0,0 +1,9 @@
+# Sofware Development Toolkits
+
+## LangChain
+
+## LLaMA Index
+
+## Haystack
+
+{{ comments }}
diff --git a/uncensored-models.md b/uncensored-models.md
@@ -0,0 +1,5 @@
+# Uncensored Models
+
+FraudGPT, PoisonGPT
+
+{{ comments }}
-Original file line number
+Diff line change
@@ Expand Up / @@ -16,6 +16,8 @@ Falcon | @@
     ## How Fine-tuning Works
+    https://github.com/h2oai, etc.
     ## LLMs
     ### Gorilla
@@ Expand Down @@