Post-Mortem: Changes to Prepare for Next Week #126

mdr223 · 2023-10-26T03:58:51Z

Summarizing the steps to begin hardening our system for next week. Please add anything I may have missed from our discussion.

Merge PR Feature/simplify services #119 to move Data Manager into chat & cleo services
Merge PR adding archi code to sources in meta archi #109 to finish updates to A2rchi Meta
Create PR to move lock from guarding OpenAI calls and only guard update(s) to vector store
Merge this^ PR to main
Write simple script w/for loop to submit N API calls to our Flask app, measure and plot histogram of latencies for N = [10, 100, 1000, 10000]

Call A2rchi at t3desk19.mit.edu:7683 ~5 times using the PSET 7 question to get some sense of the distribution for the latency
Simulate N calls to Flask app by deploying DumbLLM in dev but on t3desk19 with time.sleep(np.random.normal(mean, std)) where mean and std are guesstimates of latency parameters based on calls to t3desk19.mit.edu
- Running experiment on t3desk is important b/c parallelism will be different compared with submit06
Record and store results

Depending on performance results from 5., we may have to contemplate load-balancing w/multiple containers

And separately but also importantly:
7. remove raise e from inside our main try-except block in ChatWrapper

The text was updated successfully, but these errors were encountered:

julius-heitkoetter · 2023-10-26T11:25:27Z

One edit here is that Ludo raised an issue (#115) that the DumbLLM is no longer compatible. We need to resolve this issue before we get to step 5 with a PR

mdr223 · 2023-10-26T13:40:00Z

Another final addition we will likely want to make is merging in my work on the db backend once it's ready. Without that we will also need to lock the conversations.json file (which is plausible), but could harm performance.

ludomori99 · 2023-10-30T13:18:31Z

I believe we also discussed introducing timeouts in the requests in the app. Not sure if we want to open an issue about that.

mdr223 · 2023-11-07T04:11:07Z

With the new PR for request timeouts (#141), I believe we can close this issue once the PR is merged.

mdr223 linked a pull request Nov 7, 2023 that will close this issue

Feature/add timeout to client requests #141

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Post-Mortem: Changes to Prepare for Next Week #126

Post-Mortem: Changes to Prepare for Next Week #126

mdr223 commented Oct 26, 2023

julius-heitkoetter commented Oct 26, 2023

mdr223 commented Oct 26, 2023

ludomori99 commented Oct 30, 2023

mdr223 commented Nov 7, 2023

Post-Mortem: Changes to Prepare for Next Week #126

Post-Mortem: Changes to Prepare for Next Week #126

Comments

mdr223 commented Oct 26, 2023

julius-heitkoetter commented Oct 26, 2023

mdr223 commented Oct 26, 2023

ludomori99 commented Oct 30, 2023

mdr223 commented Nov 7, 2023