Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Source and shuffle parallelism settings don't seem to work #58

Open
kkrugler opened this issue Apr 28, 2016 · 0 comments
Open

Source and shuffle parallelism settings don't seem to work #58

kkrugler opened this issue Apr 28, 2016 · 0 comments

Comments

@kkrugler
Copy link
Contributor

Fabian had described the two configuration parameters:

The parameters are:

  • flink.num.sourceTasks to specify the parallelism of source tasks.
  • flink.num.shuffleTasks to specify the parallelism of all shuffling tasks (GroupBy, CoGroup).

I tried setting these in the conf that’s passed to the Flink FlowConnector, but they don’t seem to be working as expected.

I know that something is happening, as the first time I tried this (setting flink.num.sourceTasks to 40, and flink.num.shuffleTasks to 20) I forgot to remove the -p 20 parameter from the CLI.

This caused the job to fail, with an error about not having enough slots.

When I removed the -p 20 parameter, then the job runs, but I see everything using 40 slots (source/map tasks and groups)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant