Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Running sparklint on Databricks #77

Open
damanp opened this issue May 2, 2018 · 1 comment
Open

Running sparklint on Databricks #77

damanp opened this issue May 2, 2018 · 1 comment

Comments

@damanp
Copy link

damanp commented May 2, 2018

Hi,
So I've just started using Databricks and I'm pretty stoked about being able to use sparklint and checking on my jobs.

I currently upload my jars to Databricks and then invoke the methods that I am interested in.

I tried including sparklint from the sbt file as mentioned, but the compilation always fails.
(Let me know if you would want to see the logs in detail. I will upload them if you would like to)

I then compiled only the sparklint code and uploaded it as a separate jar, attached it to my cluster, and in the other jar that holds my code, I added the lines:
spark.conf.set("spark.extraListeners", "com.groupon.sparklint.SparklintListener") spark.conf.set("spark.sparklint.port", "4242")

However, I am unable to access the port or see any sort of indication in the logs that sparklint has been active during the job execution.

Can you please help me get this going on Databricks?

@mrayandutta
Copy link

I am also interested to use Sparklint with Databricks notebooks .Can anybody provide any help on this ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants