Model/Dataset Capabilities #13
Replies: 6 comments 15 replies
-
There's a few service calls I'd like to see implemented if possible:
Out of curiosity, what is the basic process for implementing new capabilities? Is it a matter of generating new dataset "piles" for the additional services and retraining, or are additional code changes required elsewhere on your end? |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Just getting my feet wet with this, but it appears that Home-3B model is not good at comprehending calendar entities and associated items. I have had exactly 0% success with around a dozen variations/iterations of each of the below (and received plenty of false "I did what you asked" responses, to boot!)
To be fair, GPT-4 also fails the same tests spectacularly. Also, exposing zero calendar entities to the assistant results in hallucinations, when it should result in "I have no information about that". |
Beta Was this translation helpful? Give feedback.
-
Another currently missing skill / future capability to train on: being able to report the currently selected item, and set the correct item for an input_select. Home3B appears to intermittently be able to use the input_select:next service call, maybe 10% of the time. But I have had 0% success picking a specific option. Additionally, model remembers setting the input_select to something (even though the service call failed in reality), then will sometimes incorrectly report that as the value of that item in future query/response pairs. A better behavior for this and really all interactions would be to recheck the current value of the entity after the model thinks it has accomplished something ("check its own work before claiming its done"), but I have no idea if the conversation pipeline even allows for this...? |
Beta Was this translation helpful? Give feedback.
-
I was playing with llama3:instruct recently, and I have to say I am very impressed with the 8b model.
My custom system prompt:
|
Beta Was this translation helpful? Give feedback.
-
Hey please use this area to discuss future capabilities for the model to avoid creating a ton of noise in the issues section.
Beta Was this translation helpful? Give feedback.
All reactions