Partial context updates #93

pseusys · 2023-03-13T21:31:20Z

Description

Context storages are updated partially now instead of reading and writing whole data at once.

Checklist

I have covered the code with tests
I have added comments to my code to help others understand it
I have updated the documentation to reflect the changes
I have performed a self-review of the changes
Consider extending UpdateScheme from BaseModel
Decide how we want to use clear method.

dff/context_storages/database.py

dff/context_storages/update_scheme.py

dff/context_storages/json.py

dff/context_storages/update_scheme.py

dff/context_storages/mongo.py

chatsky/core/ctx_dict.py

RLKRo · 2025-01-30T19:45:40Z

chatsky/context_storages/sql.py

+        async with self.engine.begin() as conn:
+            await conn.execute(update_stmt)


It'd be better to define a method for saving context in DBContextStorage (i.e. ContextStorage method that accepts Context and saves it).

That way we can send all changes in one commit because there'd be access to engine from ContextStorage.

Right now we basically do

COMMIT main_info COMMIT labels_added_modified COMMIT labels_removed COMMIT requests_added_modified COMMIT requests_removed

Instead of a single

COMMIT main_info, labels_added_modified, labels_removed, requests_added_modified, requests_removed

Why this might be relevant: in case one of the methods fails we don't want others to succeed (this will lead to a broken context). And we can also have one place where we can process the errors.

I think we should improve db context storage api as a patch after release.

Yes, we could. However, I am not sure that even for SQL databases executing many operations with one "connection" would protect us from breaking context structure (i.e. I don't think that writes will become atomic), however we can try.

RLKRo · 2025-02-06T15:33:38Z

chatsky/core/ctx_dict.py

+            del self._items[key]
+
+    def __iter__(self) -> Sequence[int]:
+        return iter(self.keys() if self._storage is not None else self._items.keys())


Why not yield from self.keys()?
I.e.:

Why not use self.keys() regardless of the _storage?

Why not use yield from?

I just forgot yield from construction exists at all, it is not widely used...

RLKRo · 2025-02-06T17:04:23Z

chatsky/core/context.py

        node = self.framework_data.current_node
        if node is None:
            raise ContextError("Current node is not set.")
        return node
+
+    async def turns(self, key: Union[int, slice]) -> Iterable[Tuple[AbsoluteNodeLabel, Message, Message]]:


Make turns a property that returns an instance of a Turns class (needs to be defined) with async __getitem__.
So that the usage of turns is await ctx.turns[1:3] instead of await ctx.turns(slice(1, 3)).

Also, make it so that turns[1] is a single tuple while turns[1:2] is an iterable of tuples.

RLKRo · 2025-02-10T16:18:19Z

chatsky/core/ctx_dict.py

+        if isinstance(key, slice):
+            return [self._items[k] for k in self.keys()[key]]
+        else:
+            return self._items[key]


What I don't like about the way slices are processed now is that single key refers to a specific turn while a slice refers to a range of available turns.
So if there are only turns 1, 3, 5 available in ctx_dict, ctx_dict[1:2] == ctx_dict[3], ctx_dict[5], which is very confusing.

I think that there's little use case to getting ctx_dict items by their position in the keys.
So my proposal is:
Allow only

ctx_dict[non-neg-int] -- return element on turn non-neg-int (turn number is a non-negative integer);

ctx_dict[non-neg-int1:non-neg-int2:non-neg-int3] -- return elements starting from turn non-neg-int1 up to non-neg-int2 with a step of non-neg-int3.

(just in case: this proposal is for all the methods)

RLKRo · 2025-02-10T16:23:07Z

chatsky/core/ctx_dict.py

+        """
+
+        try:
+            return await self[key]


This will always fail if key is Iterable.
This should be rewritten as

await self.load_items(keys if isinstance(keys, Iterable) else [keys]) if isinstance(key, Iterable): return [self._items.get(k, default) for k in key] else: return self._items.get(k, default)

Tests for using get with iterable key should be added.

Fair enough

pseusys self-assigned this Mar 13, 2023

pseusys requested review from kudep and RLKRo April 7, 2023 01:43

pseusys added the enhancement New feature or request label Apr 7, 2023

pseusys marked this pull request as ready for review April 7, 2023 01:43

kudep marked this pull request as draft April 24, 2023 16:41

kudep requested changes May 10, 2023

View reviewed changes

dff/context_storages/database.py Outdated Show resolved Hide resolved

dff/context_storages/update_scheme.py Outdated Show resolved Hide resolved

kudep requested changes May 10, 2023

View reviewed changes

This comment was marked as outdated.

Sign in to view

pseusys requested review from ruthenian8, kudep and RLKRo June 14, 2023 23:36

ruthenian8 reviewed Jun 15, 2023

View reviewed changes

dff/context_storages/mongo.py Outdated Show resolved Hide resolved

ruthenian8 reviewed Jun 15, 2023

View reviewed changes

dff/context_storages/mongo.py Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

pseusys added 5 commits July 19, 2023 05:45

file-based

39d0da7

with_stem removed

1ca66ed

ydb ??? again??

9bb3eb7

len and prune

a8c6497

redis delete number of args changed

2bbf6e4

This comment was marked as outdated.

Sign in to view

pseusys added 6 commits July 21, 2023 06:32

Update community.rst, revert some changes

7aefa5b

one line reverted

6fa0542

double serialization removed

fa9359f

no_dependencies_tests_fixed

9fdf5bd

serializer changed

c70157a

serializer unchanged (example)

05f0d94

RLKRo mentioned this pull request Jul 26, 2023

pydantic2.0 #168

Merged

4 tasks

partial tutorials started

95ba296

lint fixed

0cfebbe

RLKRo reviewed Jan 30, 2025

View reviewed changes

chatsky/core/ctx_dict.py Show resolved Hide resolved

RLKRo reviewed Jan 30, 2025

View reviewed changes

pseusys and others added 3 commits January 31, 2025 04:39

coverage increased further

e01e739

context guide tutorial update prototype

0ef6d97

formatting fixed

10ae3b1

RLKRo reviewed Feb 6, 2025

View reviewed changes

context storing API updated

6ae396a

RLKRo reviewed Feb 10, 2025

View reviewed changes

pseusys added 21 commits February 10, 2025 22:38

legacy API removed from tests

c67dd72

context updating fixture updated

817cb67

old API remains removed

1e529b7

some more tests updated

502bb15

field validation returned

e110c06

field name fixed

7baf4a9

sql length updated

68b2d1e

mongo session introduced

aa02ef4

motor asyncio transaction

25675be

mongo and postgres updated AGAIN

10619a4

mongo transacrion made optional

50b0ce5

import fixed

0820450

few more tests fixed

87ec2a0

one more small redis update

3646271

SQL column binary defaults added

191303b

empty SQL queries not sent

25a0b7d

different field updates collapsed

1a6efa7

assignment changed

f4eb249

lint applied

9677882

sql updates separated

ff6661c

docs updated and type updated

9148970

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partial context updates #93

Partial context updates #93

pseusys commented Mar 13, 2023 •

edited

Loading

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

RLKRo Jan 30, 2025

pseusys Jan 30, 2025

RLKRo Feb 6, 2025 •

edited

Loading

pseusys Feb 10, 2025

RLKRo Feb 6, 2025 •

edited

Loading

RLKRo Feb 10, 2025

pseusys Feb 10, 2025

RLKRo Feb 10, 2025

pseusys Feb 10, 2025

		async with self.engine.begin() as conn:
		await conn.execute(update_stmt)

Partial context updates #93

Are you sure you want to change the base?

Partial context updates #93

Conversation

pseusys commented Mar 13, 2023 • edited Loading

Description

Checklist

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

RLKRo Jan 30, 2025

Choose a reason for hiding this comment

pseusys Jan 30, 2025

Choose a reason for hiding this comment

RLKRo Feb 6, 2025 • edited Loading

Choose a reason for hiding this comment

pseusys Feb 10, 2025

Choose a reason for hiding this comment

RLKRo Feb 6, 2025 • edited Loading

Choose a reason for hiding this comment

RLKRo Feb 10, 2025

Choose a reason for hiding this comment

pseusys Feb 10, 2025

Choose a reason for hiding this comment

RLKRo Feb 10, 2025

Choose a reason for hiding this comment

pseusys Feb 10, 2025

Choose a reason for hiding this comment

pseusys commented Mar 13, 2023 •

edited

Loading

RLKRo Feb 6, 2025 •

edited

Loading

RLKRo Feb 6, 2025 •

edited

Loading