This repository has been archived by the owner on Aug 13, 2024. It is now read-only.
Semantic functions #23
ConsciousCode
started this conversation in
General
Replies: 1 comment
-
This is similar to an idea I cooked up much earlier this year, but it was around finetuning models to provide individual "cognitive functions." My original idea was to finetune lightweight models to provide basic cognitive capabilities like brainstorming or choosing individual options. This is a good direction to go in. By creating a reusable toolbox, you make it easier to build a "plug and play" architecture. Thanks |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm excited to see the demos for the ACE framework and where it leads. This sort of high-level systemic model makes it a lot easier to conceptualize and try out my own experiments using it.
I have a feeling my servitor library, or at least its concept, might be helpful in building the different layers. It's based on Microsoft's Semantic Kernels, but stripped down to what I perceive to be a potential new programming abstraction - "semantic functions" - which is simply a function defined in natural language with discrete inputs and outputs. Thus, rather than treat LLMs as conversational agents which require careful curation of input context and allowable responses, they're used as simple language models with predictable outputs. For instance, using my library's API:
(Note that the library includes additional configurable internal prompting and per-annotation switching of hyperparameters and model).
It's meant for interoperability with the normal programming environment, which may be useful for the more technical layers, but is flexible enough to receive and output ordinary natural language. The LLM is given the function signature it's meant to execute, so you could eg create a dummy class
NaturalLanguage
or just leave it unannotated and omit the postprocessing steps, simply returning the string generation. If for some reason this isn't helpful, I'd also like to know why too since that may provide insights which can inform my own experiments.Beta Was this translation helpful? Give feedback.
All reactions