-
-
Notifications
You must be signed in to change notification settings - Fork 274
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support intermediate pipe handlers to transform output from one command into input for another #206
Comments
Pretty much, since the underlying stdin/stdout/stderr streams are, in fact, streams. The pipe abstraction is just an inversion of the stream abstraction – where you normally read from a stream, you write to the pipe, and vice versa. Now, in theory, your suggestion can be implemented without buffering the entire input, but by a process of:
We already do 1 and 2 for existing string-based pipe targets. The rest, however, would require additional abstractions in addition to Perhaps an easier solution would be to implement an |
Although pulling in Rx is easier in the sense of requiring less code, it sure sounds like a whole bunch of overhead to overcome what is essentially the inability (or perhaps more accurately unwillingness, in design terms) of CliWrap to customize the copy loop from stream to stream. Using Rx to implement what is essentially a fully synchronous buffer (with hidden characteristics) is probably one of the worst ways to use it (although this is admittedly just me speaking off the cuff without having measured the actual performance impact, which is likely still dwarfed by the IPC itself). A stream abstraction over a delegate is still overhead, but at least the overhead there is easier to reduce to a minimum. Even so it feels clunky that it would be necessary at all, although I do understand the alternative would probably involve revising a bunch of internal plumbing, which isn't necessarily worth it either. I'll have to think about whether this is worth looking into further or if I'm better off just doing my own little wrapping, since the rest of my scenario isn't very exciting. |
It would help if you could also provide some code samples that illustrate the usage scenario more in-depth |
Simple enough, let me show you what I'm actually doing (well, more or less): while (inputProcess.StandardOutput.ReadLine() is string line) {
JsonDocument document = JsonDocument.Parse(line);
using (Utf8JsonWriter writer = new(outputProcess.StandardInput.BaseStream)) {
// Do a bunch of calls on `writer` using what's in `jsonDocument`.
// We only make forward motion here, no buffering
}
outputProcess.StandardOutput.WriteLine();
} The additional complication here is that I also need to read the This is all a lot of ceremony and boilerplate for what could be It's also notable that no asynchronous code is involved, mainly because for this particular scenario the added overhead isn't worth it -- it just makes the total time slower while we don't need the scalability benefits, if there are any. (Also, as I understand it, there is no portable asynchronous console I/O so you'll usually get async-over-sync anyway.) |
Okay, thanks for the example. Good thing I asked because in your scenario you don't actually need to encode/decode strings since both |
That's true, although the input does need to be split on newlines, which |
Yeah. I definitely think there's merit to the idea in general, just need to make sure not to overshoot with the design. Having some sort of However, if the use case primarily revolves around transforming input/output between two commands, then maybe it's instead more fitting to expand the functionality of |
You mean something like adding an overload that accepts a delegate to transform the output, and/or maybe a separator or splitter function of some sort? That could work. It would support a scenario like |
Yeah, something like that. A stream-to-stream transform at the base level.
Why exactly would that not work? var cmdA = PipeSource.FromCommand(Cli.Wrap("cmd1"), x => ...) | Cli.Wrap("cmd2");
var cmdB = PipeSource.FromCommand(cmdA, x => ...) | Cli.Wrap("cmd3"); |
Right. For some reason I was fixated on |
operator |
on Func<string, string>
(or on-demand input in general)
I'm still unsure how to proceed with this feature, but I'm leaning towards mentally filing this under the "future major version" category. On the surface, the simplest solution (i.e., what I described above) can be implemented like so: public static PipeSource FromCommand(
Command command,
Func<Stream, Stream, CancellationToken, Task> transformAsync) =>
Create(async (destination, cancellationToken) =>
await command
.WithStandardOutputPipe(
PipeTarget.Create(async (source, innerCancellationToken) =>
await transformAsync(source, destination, innerCancellationToken).ConfigureAwait(false))
)
.ExecuteAsync(cancellationToken)
.ConfigureAwait(false)
); This allows the user to manually copy data from the stdout of one command ( However, this is a pretty low-level API and the library should provide some higher level overloads as well, such as those accepting public static PipeSource FromCommand(
Command command,
Func<string, CancellationToken, Task<string>> transformAsync) =>
FromCommand(command, async (source, destination, cancellationToken) =>
{
using var reader = new StreamReader(source);
using var writer = new StreamWriter(destination);
await foreach (var line in reader.ReadAllLinesAsync(cancellationToken))
{
var transformedLine = await transformAsync(line, cancellationToken).ConfigureAwait(false);
await writer.WriteLineAsync(transformedLine).ConfigureAwait(false);
}
}); At this point, the delegates become unwieldy, and we're returning to the original suggestion of implementing some sort of I think it's best that we park this issue until I can gather enough comments from the community to understand how the design should be shaped around this feature. As an interim solution, I recommend using the code snippets I shared above. |
Details
You can write
cmd | Console.WriteLine
(for example) to pipe output to a delegate, andcmd1 | cmd2 | cmd3
to pipe between commands, but you can't writecmd1 | twiddle | cmd2
, withcmd
aFunc<string, string>
, to transform between output and input. Any chance this could be implemented?Syntax niceties aside, I haven't found any easy, clean way of expressing this logic with the existing interfaces either (assuming you don't want to buffer the input in its entirety). While handling output can be done with a delegate, there appears to be no dual for handling input (through an
IEnumerable<string>
or other iterator/observer mechanism). Only stream-based inputs are supported, and while you can write your very own stream type for that purpose, that seems very uncomfortable. As the existing infrastructure seems to be entirely based onStream
, that may also be the reason this isn't supported, I guess. :PSystem.IO.Pipelines
may be of some use there, though it makes my head hurt every time I look at it.Dropping down to
Process
and writing a good old-fashioned loop aroundOutput.ReadLine
andInput.WriteLine
works, of course, but you need to take care of reading the other streams to their end and cleaning up and all those other niggles thatCliWrap
is otherwise neat at handling, so it's not at all cool.The text was updated successfully, but these errors were encountered: