-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Application: Data Processing Workflows on IPFS #58
Comments
Data Processing Workflows on IPFSEssential Use Case
Use CasesConcrete
Groupings:
Other:
Foundational Features + Functionality
Existing Projects + Organizations Working in this AreaThere have been a number of people who have expressed desire to:
|
Read this very interesting architecture: https://www.cse.unsw.edu.au/~hpaik/thesis/showcases/16s2/scott_brisbane.pdf Planning on building the full scale services.. |
Thanks for adding the link to the full report |
Great, we're trying |
@scottybrisbane do you have a public repo with your ipfs/hdfs integration? |
@echarles not just yet, although I am planning to post my work. It's very much a POC, but could be a good starting point for anyone wanting to get something going. I'll update this thread when I post the code. |
Thx @scottybrisbane - POC is very fine. Once published, I expect contributors (like me) to try and let evolve the code. Without pushing you, any ideas on the timeline? (do we speak about days, weeks, months... before having something public?). Btw If you fear uncompleted feature, not-perfect code, no documentation... just push what you have and other will help, that's how opensource works. |
@echarles I'm hoping to have it up within a few weeks. |
Hey Folks, Any update on this topic? @scottybrisbane @echarles |
Hi, can I ask what sort of data processing people want to use IPFS for?
…On Sun, Sep 30, 2018, 05:02 bo-liu ***@***.***> wrote:
Hey Folks, Any update on this topic? @scottybrisbane
<https://github.com/scottybrisbane> @echarles
<https://github.com/echarles>
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<https://github.com/ipfs/ipfs/issues/248#issuecomment-425715959>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AAAcndBIs-9jQbaw9-nNJW4aHiH7XTLGks5ugLLTgaJpZM4MhHjR>
.
|
Really depends on what kind of data(or which kind of data you are interested in) stored inside IPFS. |
Right, I'm looking to help make sure IPFS is a good fit for the kind of data processing people want to do. Getting specific examples helps me ensure we're putting effort in the right places. |
@scottybrisbane Your work is really interesting! The use case is fascinating. |
KIP team(@KIPFoundation) is working on one of the reference implementation that may align with Scotty's work. Look forward to adding HDFS support together! |
@scottybrisbane Great work! Are you going to publish your code? I would really like to make use of it in my own thesis, so if you need some help, just hit me up :) |
Note: Discussion on applications of IPFS are happening over in the IPFS Forums now ... please continue the discussion there! This issue is being moved over to the archived repo https://github.com/ipfs/apps/ for reference. |
Work in progress - please contribute. See #40.
Essential Use Case: when running data processing/analysis workflows, use IPFS as the storage layer. This allows your workflows to be agnostic about where the data are stored -- pulling all the source data onto a local node before running a workflow is an optimisation choice that can be done on the fly with zero impact on the code. Likewise, the results of the workflows can be written to IPFS and moved around as needed without impacting the referential integrity of your data.
The text was updated successfully, but these errors were encountered: