You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In #317 , pipeline logs are not changed and are stored at the supervisor.
We would like to implement proper per-pipeline logging, as well as ftp to send logs from supervisor to client
The text was updated successfully, but these errors were encountered:
This PR implements the modyn client part of #317.
Summary: One client instance starts one pipeline and polls for pipeline
status until exit.
1. Add a modynclient/ on the same level as modyn/. modynclient/ imports
grpc code from modyn/.
2. Add GetPipelineStatus message to supervisor grpc proto.
3. Add IPC queues to store status messages from PipelineExecutors.
Supervisor keeps three queues per pipeline: pipeline_status,
training_status, evaluation_status. Currently, status messages are added
sequentially. Supervisor keeps those queues separate as a preparation
for overlapping training and evaluation in future optimizations. Upon
one GetPipelineStatusRequest, Supervisor sends back all the status
messages in the queues of the requesting pipeline.
4. Add TrainingStatusReporter and EvaluationStatusReporter to
supervisor. Remove pipeline monitor from supervisor.
5. Move TrainingStatusTracker and EvaluationStatusTracker to client.
Implement enlighten progress counter and status bar at client.
6. Update the Dummy model: run one nn.Linear layer instead of
nn.Identity so that the optimizer (CrossEntropyLoss) accepts the model
output. Input non-batched tensor shape: (1,2).
7. Update supervisor and client integration tests. Both supervisor and
client tests run the same Dummy model pipeline, using a tiny dataset
instead of the images dataset. In client test, run the pipeline through
client entrypoint.
8. Update python to >= 3.11
TODOs
1. Implement ftp in a separate PR in the future. #342
2. Implement tensorboard in a separate PR in the future. #339
3. Rethink GRPCHandler wait_for_training_completion.
In #317 , pipeline logs are not changed and are stored at the supervisor.
We would like to implement proper per-pipeline logging, as well as ftp to send logs from supervisor to client
The text was updated successfully, but these errors were encountered: