Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extra Beam Sources or Sinks #706

Closed
3 tasks
nevillelyh opened this issue Jun 29, 2017 · 3 comments
Closed
3 tasks

Extra Beam Sources or Sinks #706

nevillelyh opened this issue Jun 29, 2017 · 3 comments
Labels
enhancement New feature or request

Comments

@nevillelyh
Copy link
Contributor

nevillelyh commented Jun 29, 2017

  • Elasticsearch source
  • Cassandra source
  • HadoopOutputFormat sink (Beam really)
@nevillelyh nevillelyh added the enhancement New feature or request label Jun 29, 2017
@kubum
Copy link

kubum commented Mar 12, 2018

Hi @nevillelyh

I would like to know more about Cassandra source. How do you see it is working? Any potential problems or ideas?

Thank you!

@nevillelyh
Copy link
Contributor Author

Don't have any good idea for this. The sink we have is optimized for throughput in batch mode and doesn't work so well in streaming. It uses BulkWriter under the hood which exposes some Cassandra internals and there isn't an equivalent BulkReader.

Upstream has CassandraIO but it has different API (annotated POJO which may not work in Scala) and doesn't seem to be particularly optimized.
We don't read from Cassandra often so this hasn't been a priority.

@jbx jbx added the P2 label Jul 17, 2018
@jbx jbx changed the title Nice to haves Extra Beam Sources or Sinks Aug 1, 2018
@regadas
Copy link
Contributor

regadas commented Sep 10, 2018

Moved to spotify/scio-contrib#7

@regadas regadas closed this as completed Sep 10, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants