feat(core): llm integration for cli #236

andrewrust-virtru · 2024-07-23T19:26:10Z

SEE FULL STATUS on OpenTDF Issue #455 and/or otdfctl issue #12

This PR integrates capacity for a localized LLM model to be used as an aid for troubleshooting DSP and otdfctl via prompt injection of relevant documentation into a local LLM(ex: ollama)

Architecture Overview

System Requirements:

Local-first LLM
CLI access, simple TUI interface
Basic stats on request time & tokens
Exception handling & error processing (no hanging prompts)
~~- [ ] Documentation retrieval and injection~~
Wrapping prompts with both sanitization~~ and relevant docs~~

Future Wishlist

perhaps linking to relevant docs via RAG implementation
instead of ollama/gemma/etc fine-tuned personalized LLM on Virtru's non-sensitive data (all agnostic code and docs)

Open Bugs/Issues:

Bad UI when waiting for initial response
Not graceful exit if model is not running on device yet
Need to flush out sanitization/entry prompt

andrewrust-virtru · 2024-07-25T15:46:21Z

Thinking in the open about strategy for documentation injection: Could use 'fuzzy-finder'-esque wrapper to determine concepts or words that are relevant to a user's query. Might have to start with dict of common associations. Ex:

['DSP', 'Data Security Platform' ...] --> ['$def_of_dsp', $opt_link_to_readme_section]
['PEP', 'Policy Enforcement Points' ...] --> ['$def_of_peps', $opt_link_to_readme_section]
...

This, along with basic sanitization wrapping in sanitizer.go would be the basis for entire prompt. Open item would be to adjust parameters based on the token_lens/context_size of the model when considering tradeoffs for speed... default is 2K, 8K IIUC.

jrschumacher · 2024-07-25T15:59:52Z

@andrewrust-virtru when considering the reducing context have you looked at prompt-engineering with llama2? https://medium.com/@eboraks/llama-2-prompt-engineering-extracting-information-from-articles-examples-45158ff9bd23

For instance if we append <SYS>...</SYS> to the user's prompt we can ensure the response will be limited by what we want.

<s>[INST] <<SYS>>
You are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe.  
Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. 
Please ensure that your responses are socially unbiased and positive in nature.
If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. 
If you don't know the answer to a question, please don't share false information.
<</SYS>>

Tell me how to choose a favorite color

andrewrust-virtru · 2024-08-06T23:07:55Z

UPDATE(08/06/24): https://github.com/virtru-corp/data-security-platform/issues/455#issuecomment-2272393715

In preparation for intern presentation Thursday, ending 'new feature' commits. Just necessary bug fixes for a working demo. Updating documentation for OpenTDF issue #455 on base issue for otdfctl: issue #12.

Related to archived Atlassian goal and abandoned otdfctl branch #40

andrewrust-virtru added 8 commits July 23, 2024 15:22

setting foundation for entry for llm integration

fba04b6

quick pass on structure of llm intake code in go with dynamic models

7f69cae

chat TUI starts and exits, but model is not accessible

c31f55c

removing old code comments

9d2487c

llama3 model works using 'chat' cmd

00f3453

cleaned up code function declarations

1c58ba9

added basic token and time tracker per req

8c55341

cleaning up unused go modules

dd7628b

added sanitization wrapper to input prompt

d7defda

andrewrust-virtru added 2 commits July 25, 2024 12:12

slightly more nuanced and complete intro prompt

2ab33dd

renamed to llm_sanitizer.go and prompt

2d0e94c

jrschumacher linked an issue Jul 25, 2024 that may be closed by this pull request

Spike/PoC utilizing local LLM to enable better understanding OpenTDF #239

Open

andrewrust-virtru added 16 commits July 30, 2024 07:06

added loading animation for better ui during idle periods

67ca351

moved model configs to chat_config.json file temporarily

42df4fa

temporarily removed chat_config.json out of .gitignore

e739fac

moved chat files into dedicated directory, fixing imports

22e4598

chat commands now moved to /pkg/chat

fbab4aa

removed bad chatCmd import and managed chat via cmd/chat.go

a622c19

adding log/ directory to gitignore

f3a3479

changed configurations from JSON to YAML

4e21627

added more exit criteria and graceful endings

a918ed5

minor formatting changes to yaml and example file linking

b851348

graceful and verbose entry to now working

f801a58

added token limit to configurations

c2a19ec

added verbosity to control output

2887003

added time before first token statistics

d66c5f6

moved statistics code into performance.go file

e443324

added --ask invocation capability

607d751

andrewrust-virtru added 12 commits August 4, 2024 00:55

cleaning up and adding comments

166a517

updated prompts.go to ref_questions

a5682e0

updated Q&As

6935fac

added GPU toggle, perhaps already using GPU? not sure

a24d1f6

keyword extractor running syncronously with full LLM call

6a3b1d0

adjusted verbosity and changed all functions to PascalCase

a1810bb

ui animation fix and default config loading fix

dcd0631

fixing bug with deault config file

1636542

re-simpified config file getting

1b7bc91

minor UI improvment

719ce77

updated useGPU to num_GPU per ollama specs

9c375b8

switched config file to dev example file to pass unit tests

4285e34

andrewrust-virtru marked this pull request as ready for review August 7, 2024 00:28

andrewrust-virtru requested a review from a team as a code owner August 7, 2024 00:28

jrschumacher marked this pull request as draft September 6, 2024 18:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(core): llm integration for cli #236

feat(core): llm integration for cli #236

andrewrust-virtru commented Jul 23, 2024 •

edited

Loading

andrewrust-virtru commented Jul 25, 2024 •

edited

Loading

jrschumacher commented Jul 25, 2024

andrewrust-virtru commented Aug 6, 2024 •

edited

Loading

feat(core): llm integration for cli #236

Are you sure you want to change the base?

feat(core): llm integration for cli #236

Conversation

andrewrust-virtru commented Jul 23, 2024 • edited Loading

Architecture Overview

andrewrust-virtru commented Jul 25, 2024 • edited Loading

jrschumacher commented Jul 25, 2024

andrewrust-virtru commented Aug 6, 2024 • edited Loading

andrewrust-virtru commented Jul 23, 2024 •

edited

Loading

andrewrust-virtru commented Jul 25, 2024 •

edited

Loading

andrewrust-virtru commented Aug 6, 2024 •

edited

Loading