feat(indexer/server): implement a task buffer to improve efficiency in managing backpressure issues #519

FrankLi123 · 2024-09-03T18:17:34Z

Summary

Implement an adaptive "Task Buffer" that efficiently manages task flow between the producer (data-source) and the consumer (worker).

Task Buffer
- Uses a Go slice to effectively manage tasks.
- Uses a mutex and conditional variables to coordinate interactions with the task buffer between goroutines ; producer goroutines pause when the buffer is full, while consumer goroutines wait when the buffer is empty.
- The buffer is configured to handle up to 1,000 tasks by default.
Performance
- the Task Buffer Increases performance by allowing producers to add tasks into the buffer queue while the buffer is not full, reducing the producer blocking time.
- use of mutex and conditional variable may increase complexity and lead to more time spent in context switching of Task Buffer.

Checklist

The commit message follows Angular Contributing guidelines;
Tests for the changes have been added (for bug fixes / features);

Does this PR introduce a breaking change?

Yes
No

Other information

…ta transmission

brucexc · 2024-09-03T22:13:24Z

internal/engine/task_buffer.go

+)
+
+// TaskBuffer represents a fixed-size buffer, it operates as a FIFO buffer
+type TaskBuffer struct {


Suggested change

type TaskBuffer struct {

This is a new data structure, please add sufficient test cases.

Comprehensive test cases have been added in task_buffer_test.go under directory/engine

brucexc · 2024-09-03T22:18:04Z

internal/engine/task_buffer.go

+	}
+
+	task := sw.tasks[0]
+	sw.tasks = sw.tasks[1:]


Suggested change

sw.tasks = sw.tasks[1:]

The slice operation sw.tasks = sw.tasks[1:] changes the starting position of the slice, but the underlying array remains unchanged. If this operation is performed frequently (e.g., frequent task retrieval), the underlying array of the slice may grow indefinitely without releasing the used memory, potentially causing a memory leak.

So i think adding sw.tasks[0] = nil would be safer.

I agree with your point, and I have made the change to address it.

brucexc · 2024-09-03T22:34:33Z

internal/node/indexer/server.go

+				return fmt.Errorf("an error occurred in the source: %w", err)
+			}
+
+			return nil


Suggested change

return nil

The return nil statement immediately after the if err != nil check is unnecessary because if err != nil is true, the function will return early.

brucexc · 2024-09-03T22:37:24Z

internal/node/indexer/server.go

-				if err := s.handleTasks(ctx, tasks); err != nil {
-					return fmt.Errorf("handle tasks: %w", err)
+				if err := s.handleTasks(ctx, task); err != nil {
+					errorChan <- fmt.Errorf("handle tasks error: %w", err)


Suggested change

errorChan <- fmt.Errorf("handle tasks error: %w", err)

I think the function retryableFunc should return the error instead of sending it to the errorChan. This approach can make the retry mechanism clearer and more straightforward

…nncessary operations on error channel in Run()

FrankLi123 added 2 commits September 4, 2024 02:02

feat: Implement and apply the task buffer to improve efficiency of da…

19fc82a

…ta transmission

fix: apply TaskBuffer for the ethereum task case

fe1b42c

FrankLi123 requested review from polebug and pseudoyu as code owners September 3, 2024 18:17

FrankLi123 changed the title ~~feat(indexer/server): implement task buffer to improve efficency during Backpressure issues~~ feat(indexer/server): implement a task buffer to improve efficiency in managing backpressure issues Sep 3, 2024

fix: fix typo in comments

afd91cf

FrankLi123 linked an issue Sep 3, 2024 that may be closed by this pull request

Resolve Backpressure Issues about the Task Channel in Node #515

Open

1 task

FrankLi123 self-assigned this Sep 3, 2024

FrankLi123 requested a review from brucexc September 3, 2024 18:26

brucexc requested changes Sep 3, 2024

View reviewed changes

FrankLi123 added 6 commits September 4, 2024 22:05

fix: modified the slice operation within the task buffer and remove u…

0352334

…nncessary operations on error channel in Run()

feat: implement test cases for task buffer

7b4bfde

feat: implement performance test cases for task buffer

dc8f439

fix: reset the test parameter value

d8a387f

fix: fix param value to run the performance test multiple rounds

17a69dc

fix: remove TestTaskBuffer_MemoryLeak

64911e9

FrankLi123 marked this pull request as draft October 18, 2024 10:14

FrankLi123 closed this Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(indexer/server): implement a task buffer to improve efficiency in managing backpressure issues #519

feat(indexer/server): implement a task buffer to improve efficiency in managing backpressure issues #519

FrankLi123 commented Sep 3, 2024 •

edited by brucexc

Loading

brucexc Sep 3, 2024

FrankLi123 Sep 5, 2024

brucexc Sep 3, 2024

FrankLi123 Sep 5, 2024

brucexc Sep 3, 2024

FrankLi123 Sep 5, 2024

brucexc Sep 3, 2024

FrankLi123 Sep 5, 2024

	type TaskBuffer struct {
	This is a new data structure, please add sufficient test cases.

	sw.tasks = sw.tasks[1:]
	The slice operation sw.tasks = sw.tasks[1:] changes the starting position of the slice, but the underlying array remains unchanged. If this operation is performed frequently (e.g., frequent task retrieval), the underlying array of the slice may grow indefinitely without releasing the used memory, potentially causing a memory leak.
	So i think adding sw.tasks[0] = nil would be safer.

	return nil
	The return nil statement immediately after the if err != nil check is unnecessary because if err != nil is true, the function will return early.

	errorChan <- fmt.Errorf("handle tasks error: %w", err)
	I think the function retryableFunc should return the error instead of sending it to the errorChan. This approach can make the retry mechanism clearer and more straightforward

feat(indexer/server): implement a task buffer to improve efficiency in managing backpressure issues #519

feat(indexer/server): implement a task buffer to improve efficiency in managing backpressure issues #519

Conversation

FrankLi123 commented Sep 3, 2024 • edited by brucexc Loading

Summary

Checklist

Does this PR introduce a breaking change?

Other information

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

FrankLi123 commented Sep 3, 2024 •

edited by brucexc

Loading