[Task]: Spotting commands in the stream from coding assistants like cline #844

therealnb · 2025-01-30T10:46:37Z

Description

We have done some work to spot suspicious commands in #34. The task here is to write this code into codegate. This involves

Creating the model from the code in https://github.com/stacklok/research/blob/command-detection/command_detection/command_models.ipynb. The should result in a function which returns good or bad when fed a command.
In a platform neutral way (cline, copilot edits, etc) spot when a command is returned and categorise it and log this if the command is bad.

Extensions for the future

Have more than two categories - e.g. safe, risky, and block
Block commands in the 'block' category
Have the block behaviour configurable
Have more options around context - e.g. files and dirs that are writable
Have the NN learn from feedback from the user (i.e. retrain the NN from feedback in the codegate UI)

We will probably have to intercept the commands at

snippets = extract_snippets(current_content)

and write the comment back at

async def _snippet_comment(self, snippet: CodeSnippet, context: PipelineContext) -> str:

As a baseline we decided to use the hybrid-all-MiniLM-L6-v2 with post-processing by a small ANN. We didn't want the extra cost of codebert, but the local ANN seems to produce some benefit.

Additional Context

We need to decide which model to use for the embeddings. all-minilm-L6-v2 works well, especially with a post ANN process step. It is already in codegate, so we get it for free. microsoft/codebert-base works better as expected, but at a cost of 476 MB.
The ANNs are much smaller
ls -lh | grep hybrid
-rw-r--r-- 1 nigel staff 228K 29 Jan 18:21 hybrid-all-MiniLM-L6-v2.model
-rw-r--r-- 1 nigel staff 420K 29 Jan 18:21 hybrid-microsoft-codebert-base.model

The text was updated successfully, but these errors were encountered:

therealnb · 2025-02-04T15:51:15Z

Initial implementation inhttps://github.com//pull/917

Note that:

There has been little optimisation of the ANN
We need to work on getting better data
Performance may not be good enough - testing required

Accuracy: 0.88
Precision: 0.8823529411764706
Recall: 0.7894736842105263
F1 Score: 0.8333333333333333

therealnb · 2025-02-05T12:50:47Z

This was reverted in #930.

It was causing the runner to run out of space. See the slack discussion

therealnb · 2025-02-05T12:57:14Z

Another PR created here to fix the build space problem #931

therealnb · 2025-02-12T14:35:35Z

This should be closed by #931

therealnb · 2025-03-04T11:28:15Z

Reopened this (again) and disabled the suspicious commands (in #1204). We need more context restriction on where this is run.

therealnb · 2025-03-04T12:42:20Z

On discussion we need this merged first https://github.com/jhrozek/codegate-open/blob/51dfd5e50f50e2a9b5deb61afcc52297872520bc/src/codegate/pipeline/functions/output.py#L53 (and possibly gather some of the tool semantics)

We will reconvene when that is done. Added @jhrozek as an assignee to flag when this is ready.

We also need to take a look at the top N MCP servers, to see what tool parameters they support.

We need to support 'experimental' flags - this is not specific to this case, but this would allow curious folk to switch features on and off.

This does not block the accuracy work, which can proceed in parallel.

CC @lukehinds , @poppysec , @blkt

github-actions bot added the needs-triage label Jan 30, 2025

lukehinds removed the needs-triage label Jan 30, 2025

therealnb self-assigned this Jan 31, 2025

therealnb mentioned this issue Feb 4, 2025

Initial suspicious commands #917

Merged

therealnb closed this as completed in #917 Feb 5, 2025

therealnb reopened this Feb 5, 2025

github-actions bot added the needs-triage label Feb 5, 2025

therealnb removed the needs-triage label Feb 5, 2025

therealnb closed this as completed Feb 12, 2025

therealnb reopened this Feb 20, 2025

therealnb closed this as completed Feb 20, 2025

github-actions bot added the needs-triage label Feb 20, 2025

therealnb reopened this Mar 4, 2025

therealnb assigned jhrozek Mar 4, 2025

lukehinds removed the needs-triage label Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Task]: Spotting commands in the stream from coding assistants like cline #844

[Task]: Spotting commands in the stream from coding assistants like cline #844

therealnb commented Jan 30, 2025 •

edited

Loading

therealnb commented Feb 4, 2025

therealnb commented Feb 5, 2025

therealnb commented Feb 5, 2025

therealnb commented Feb 12, 2025

therealnb commented Mar 4, 2025

therealnb commented Mar 4, 2025

[Task]: Spotting commands in the stream from coding assistants like cline #844

[Task]: Spotting commands in the stream from coding assistants like cline #844

Comments

therealnb commented Jan 30, 2025 • edited Loading

Description

Additional Context

therealnb commented Feb 4, 2025

therealnb commented Feb 5, 2025

therealnb commented Feb 5, 2025

therealnb commented Feb 12, 2025

therealnb commented Mar 4, 2025

therealnb commented Mar 4, 2025

therealnb commented Jan 30, 2025 •

edited

Loading