chore: replace selenium user with fixed user #31844

villebro · 2025-01-14T20:54:34Z

SUMMARY

This PR does the following:

Remove the THUMBNAILS_SELENIUM_USER config parameter. To execute as a fixed user you can now use the new FixedUser executor that can assume the role of any user. This means you can use a different fixed user for thumbnails and Alerts & Reports if necessary.
Rename ALERT_REPORTS_EXECUTE_AS and THUMBNAILS_EXECUTE_AS to ALERT_REPORTS_EXECUTORS and THUMBNAILS_EXECUTORS for improved clarity.
Add CACHE_WARMUP_EXECUTORS to make it possible to specify executors for cache warmup tasks. With this change all async tasks can be configured to use arbitrary executors, similar to what was already possible for Thumbnails and Alerts & Reports.
Change the cache warmup strategy interface to have a get_tasks method instead of get_payloads. This change is needed as the cache warmup tasks now need to provide a username to execute the cache warmup tasks.
Update docs to reflect the changes.

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

Has associated issue: closes 76 - Remove falling back to fixed selenium executor for thumbnails by default #31843
Required feature flags:
Changes UI
Includes DB Migration (follow approval process in SIP-59)
- Migration is atomic, supports rollback & is backwards-compatible
- Confirm DB migration upgrade and downgrade tested
- Runtime estimates and downtime expectations provided
Introduces new feature or API
Removes existing feature or API

korbit-ai

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Category	Issue	Fix Detected
	Insecure default admin user for async tasks ▹ view	✅

Need a new review? Comment /korbit-review on this PR and I'll review your latest changes.

Korbit Guide: Usage and Customization

Interacting with Korbit

You can manually ask Korbit to review your PR using the /korbit-review command in a comment at the root of your PR.

You can ask Korbit to generate a new PR description using the /korbit-generate-pr-description command in any comment on your PR.

Too many Korbit comments? I can resolve all my comment threads if you use the /korbit-resolve command in any comment on your PR.

Chat with Korbit on issues we post by tagging @korbit-ai in your reply.

Help train Korbit to improve your reviews by giving a 👍 or 👎 on the comments Korbit posts.

Customizing Korbit

Check out our docs on how you can make Korbit work best for you and your team.

Customize Korbit for your organization through the Korbit Console.

Feedback and Support

Tell us what you think of Korbit

Schedule a call with our team

Email us @ [email protected]

superset/config.py

codecov · 2025-01-14T20:59:52Z

Codecov Report

Attention: Patch coverage is 80.00000% with 19 lines in your changes missing coverage. Please review.

Project coverage is 83.46%. Comparing base (76d897e) to head (5054fac).
Report is 1339 commits behind head on master.

Files with missing lines	Patch %	Lines
superset/tasks/cache.py	60.86%	18 Missing ⚠️
superset/tasks/utils.py	95.65%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master   #31844       +/-   ##
===========================================
+ Coverage   60.48%   83.46%   +22.97%     
===========================================
  Files        1931      546     -1385     
  Lines       76236    39323    -36913     
  Branches     8568        0     -8568     
===========================================
- Hits        46114    32822    -13292     
+ Misses      28017     6501    -21516     
+ Partials     2105        0     -2105

Flag	Coverage Δ
hive	`48.81% <33.68%> (-0.36%)`	⬇️
javascript	`?`
mysql	`76.00% <73.68%> (?)`
postgres	`76.06% <73.68%> (?)`
presto	`53.30% <33.68%> (-0.51%)`	⬇️
python	`83.46% <80.00%> (+19.97%)`	⬆️
sqlite	`75.54% <73.68%> (?)`
unit	`61.07% <44.21%> (+3.44%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

korbit-ai

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Category	Issue	Fix Detected
	Silent Executor Error Handling Skips Cache Warmup ▹ view
	Invalid FIXED_USER Executor Error ▹ view
	Silent Failure on Missing Executor ▹ view
	Silent Error Handling ▹ view
	Handling ExecutorNotFoundError in get_dashboard_digest and get_chart_digest. ▹ view

Files scanned

File Path	Reviewed
superset/tasks/exceptions.py	✅
superset/tasks/types.py	✅
superset/thumbnails/digest.py	✅
superset/tasks/thumbnails.py	✅
superset/commands/report/alert.py	✅
superset/tasks/utils.py	✅
superset/models/slice.py	✅
superset/models/dashboard.py	✅
superset/tasks/cache.py	✅
superset/commands/report/execute.py	✅
superset/config.py	✅

Explore our documentation to understand the languages and file types we support and the files we ignore.

Need a new review? Comment /korbit-review on this PR and I'll review your latest changes.

Korbit Guide: Usage and Customization

Interacting with Korbit

You can manually ask Korbit to review your PR using the /korbit-review command in a comment at the root of your PR.

You can ask Korbit to generate a new PR description using the /korbit-generate-pr-description command in any comment on your PR.

Too many Korbit comments? I can resolve all my comment threads if you use the /korbit-resolve command in any comment on your PR.

Chat with Korbit on issues we post by tagging @korbit-ai in your reply.

Help train Korbit to improve your reviews by giving a 👍 or 👎 on the comments Korbit posts.

Customizing Korbit

Check out our docs on how you can make Korbit work best for you and your team.

Customize Korbit for your organization through the Korbit Console.

Current Korbit Configuration

General Settings

Setting Value

Review Schedule Automatic excluding drafts

Max Issue Count 10

Automatic PR Descriptions ❌

Issue Categories

Category Enabled

Naming ✅

Database Operations ✅

Documentation ✅

Logging ✅

Error Handling ✅

Systems and Environment ✅

Objects and Data Structures ✅

Readability and Maintainability ✅

Asynchronous Processing ✅

Design Patterns ✅

Third-Party Libraries ✅

Performance ✅

Security ✅

Functionality ✅

Feedback and Support

Tell us what you think of Korbit

Schedule a call with our team

Email us @ [email protected]

Note

Korbit Pro is free for open source projects 🎉

Looking to add Korbit to your team? Get started with a free 2 week trial here

korbit-ai · 2025-01-18T21:04:33Z

superset/tasks/cache.py

+    except (ExecutorNotFoundError, InvalidExecutorError):
+        username = None


Silent Executor Error Handling Skips Cache Warmup

Tell me more

What is the issue?

The code silently handles ExecutorNotFoundError and InvalidExecutorError by setting username to None, which leads to skipping cache warmup for affected charts.

Why this matters

This can result in some charts not being warmed up properly, reducing the effectiveness of the cache warmup strategy.

Suggested change ∙ Feature Preview

Log the specific error and consider alternative executor fallbacks:

try: executor = get_executor(executors, chart) username = executor[1] except ExecutorNotFoundError as e: logger.error(f"No executor found for chart {chart.id}: {str(e)}. Using default executor.") username = current_app.config.get("DEFAULT_CACHE_WARMUP_USER") except InvalidExecutorError as e: logger.error(f"Invalid executor for chart {chart.id}: {str(e)}. Using default executor.") username = current_app.config.get("DEFAULT_CACHE_WARMUP_USER")

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

korbit-ai · 2025-01-18T21:04:33Z

superset/tasks/utils.py

+        if executor == ExecutorType.FIXED_USER:
+            raise InvalidExecutorError()


Invalid FIXED_USER Executor Error

Tell me more

What is the issue?

The code raises InvalidExecutorError when encountering ExecutorType.FIXED_USER, but this type should be valid according to the developer's intent of replacing selenium executor with fixed_user executor.

Why this matters

This will prevent the system from using the FIXED_USER executor type directly, contradicting the main purpose of the changes being made to replace selenium with fixed user executors.

Suggested change ∙ Feature Preview

Remove this validation check since FIXED_USER is now a valid executor type:

# Remove these lines if executor == ExecutorType.FIXED_USER: raise InvalidExecutorError()

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

korbit-ai · 2025-01-18T21:04:33Z

superset/thumbnails/digest.py

+    try:
+        executor_type, executor = get_executor(
+            executors=config["THUMBNAIL_EXECUTORS"],
+            model=dashboard,
+            current_user=get_current_user(),
+        )
+    except ExecutorNotFoundError:
+        return ""


Silent Failure on Missing Executor

Tell me more

What is the issue?

Silently returning an empty string when no executor is found could lead to unexpected behavior in thumbnail generation.

Why this matters

Without proper error handling or logging, debugging issues with missing executors becomes difficult, and the system's behavior becomes opaque to administrators.

Suggested change ∙ Feature Preview

Add logging before returning empty string to help with debugging:

try: executor_type, executor = get_executor( executors=config["THUMBNAIL_EXECUTORS"], model=dashboard, current_user=get_current_user(), ) except ExecutorNotFoundError: logger.warning("No valid executor found for dashboard thumbnail generation") return ""

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

korbit-ai · 2025-01-18T21:04:33Z

superset/thumbnails/digest.py

+    except ExecutorNotFoundError:
+        return ""


Silent Error Handling

Tell me more

What is the issue?

The error handling silently returns an empty string without logging the error or providing context about what went wrong.

Why this matters

Silent error handling makes it difficult to debug issues when they occur, as there's no trace of what caused the ExecutorNotFoundError.

Suggested change ∙ Feature Preview

Add error logging before returning empty string:

except ExecutorNotFoundError as ex: logger.warning(f"Failed to get executor for thumbnail generation: {ex}") return ""

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

korbit-ai · 2025-01-18T21:04:33Z

superset/thumbnails/digest.py

+    except ExecutorNotFoundError:
+        return ""


Handling ExecutorNotFoundError in get_dashboard_digest and get_chart_digest.

Tell me more

In the get_dashboard_digest and get_chart_digest functions, when an ExecutorNotFoundError is caught, the code currently returns an empty string. This can potentially lead to security issues if the application does not handle the case when an executor is not found properly. Consider raising the exception or returning a default thumbnail image instead to ensure the application behaves as expected and avoids potential vulnerabilities.

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

UPDATING.md

michael-s-molina · 2025-01-20T11:44:59Z

superset/config.py

+# from superset.tasks.types import ExecutorType, FixedExecutor
+#
+# CACHE_WARMUP_EXECUTORS = [ExecutorType.OWNER, FixedExecutor("admin")]
+CACHE_WARMUP_EXECUTORS = [ExecutorType.OWNER]


Would be possible to have a more generic configuration that allows future async tasks and reduces the number of configurations? Something like:

ASYNC_TASKS_EXECUTORS: dict[AsyncTaskType, ExecutorType] = {}

@michael-s-molina I kinda like this idea. However, I'm not fully clear on what the exact pattern will look like for the forthcoming AsyncTaskType. Similarly, a while back when I was implementing a custom KeyValue resource for an internal use case, I ran into issues where it was difficult to extend the existing KeyValueResource type. In the end I ended up doing a bunch of funky casting to avoid linting issues. So to get around this, we would need to make sure the type (or its equivalent) supports introducing new types beyond the built-in ones.

Thoughts? I'm ok implementing an AsyncTaskType enum (or similar) now, but we may still need to refactor it later, causing yet another breaking change.

How about:

class AsyncTaskType: """Base class for async task types.""" pass class CacheWarmUpTask(AsyncTaskType): pass class AlertsReportsTask(AsyncTaskType): pass # Usage ASYNC_TASKS_EXECUTORS: dict[AsyncTaskType, ExecutorType] = { CacheWarmUpTask(): ExecutorType.OWNER, AlertsReportsTask(): FixedExecutor("admin"), }

I'd prefer something that serializes more easily. String based enums are good in this regard, but introduce the pluggability issue outlined above.

Would a class, with predefined values that can be extended, be serializable enough? Something like:

class AsyncTaskType: CACHE_WARMUP = "CACHE_WARMUP", ALERTS_REPORTS = "ALERTS_REPORTS", _additional_types = set() @classmethod def add_type(cls, type): if type not in cls._additional_types: cls._additional_types.add(type) @classmethod def get_types(cls): return {cls.CACHE_WARMUP, cls.ALERTS_REPORTS}.union(cls._additional_types)

This would essentially mean get_types just returns set[str], which means ASYNC_TASKS_EXECUTORS would be dict[str, list[ExecutorType]]. I'm sure there's some good pattern for solving this, but I still feel that's best taken care of in a follow-up PR that implements the new async task framework.

villebro · 2025-01-21T21:54:34Z

@michael-s-molina is there anything else that needs to be addressed, or is this good to go, assuming the async task type change can wait until the GATF feature?

michael-s-molina · 2025-01-22T11:46:53Z

superset/thumbnails/digest.py

+            current_user=get_current_user(),
+        )
+    except ExecutorNotFoundError:
+        return ""


Maybe None will be a better representation? If you agree, please change all digest generation methods.

pull-request-size bot added the size/M label Jan 14, 2025

korbit-ai bot reviewed Jan 14, 2025

View reviewed changes

superset/config.py Outdated Show resolved Hide resolved

pull-request-size bot added size/L and removed size/M labels Jan 14, 2025

justinpark marked this pull request as draft January 15, 2025 18:19

justinpark added the review:draft label Jan 15, 2025

villebro force-pushed the villebro/selenium-user branch 6 times, most recently from 06e7f74 to ce4330b Compare January 18, 2025 17:43

github-actions bot added the doc Namespace | Anything related to documentation label Jan 18, 2025

villebro force-pushed the villebro/selenium-user branch from 9b07b8e to 9fc1e15 Compare January 18, 2025 18:26

villebro added risk:breaking-change Issues or PRs that will introduce breaking changes v5.0 Label added by the release manager to track PRs to be included in the 5.0 branch labels Jan 18, 2025

villebro requested a review from michael-s-molina January 18, 2025 19:34

villebro marked this pull request as ready for review January 18, 2025 20:58

korbit-ai bot reviewed Jan 18, 2025

View reviewed changes

michael-s-molina reviewed Jan 20, 2025

View reviewed changes

villebro added 8 commits January 21, 2025 09:45

chore: replace selenium user with fixed user

3b17380

fix digest

44ce221

fix digest tests

4b79c30

add variable fixed user

96f5dd9

use executor on cache warmup tasks

27d3ece

lint

0fbe4d2

fix tests

5f30177

rename config flags

230799f

villebro force-pushed the villebro/selenium-user branch from 9fc1e15 to ec338af Compare January 21, 2025 17:46

update docs

5054fac

villebro force-pushed the villebro/selenium-user branch from ec338af to 5054fac Compare January 21, 2025 17:47

michael-s-molina approved these changes Jan 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: replace selenium user with fixed user #31844

chore: replace selenium user with fixed user #31844

villebro commented Jan 14, 2025 •

edited

Loading

korbit-ai bot left a comment •

edited

Loading

codecov bot commented Jan 14, 2025 •

edited

Loading

korbit-ai bot left a comment •

edited

Loading

korbit-ai bot Jan 18, 2025

korbit-ai bot Jan 18, 2025

korbit-ai bot Jan 18, 2025

korbit-ai bot Jan 18, 2025

korbit-ai bot Jan 18, 2025

michael-s-molina Jan 20, 2025 •

edited

Loading

villebro Jan 21, 2025

michael-s-molina Jan 21, 2025

villebro Jan 21, 2025

michael-s-molina Jan 21, 2025 •

edited

Loading

villebro Jan 21, 2025

villebro commented Jan 21, 2025

michael-s-molina Jan 22, 2025

Setting	Value
Review Schedule	Automatic excluding drafts
Max Issue Count	10
Automatic PR Descriptions	❌

Category	Enabled
Naming	✅
Database Operations	✅
Documentation	✅
Logging	✅
Error Handling	✅
Systems and Environment	✅
Objects and Data Structures	✅
Readability and Maintainability	✅
Asynchronous Processing	✅
Design Patterns	✅
Third-Party Libraries	✅
Performance	✅
Security	✅
Functionality	✅

		except (ExecutorNotFoundError, InvalidExecutorError):
		username = None

		if executor == ExecutorType.FIXED_USER:
		raise InvalidExecutorError()

chore: replace selenium user with fixed user #31844

Are you sure you want to change the base?

chore: replace selenium user with fixed user #31844

Conversation

villebro commented Jan 14, 2025 • edited Loading

SUMMARY

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

korbit-ai bot left a comment • edited Loading

Choose a reason for hiding this comment

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Interacting with Korbit

Customizing Korbit

Feedback and Support

codecov bot commented Jan 14, 2025 • edited Loading

Codecov Report

korbit-ai bot left a comment • edited Loading

Choose a reason for hiding this comment

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Interacting with Korbit

Customizing Korbit

Current Korbit Configuration

Feedback and Support

Looking to add Korbit to your team? Get started with a free 2 week trial here

korbit-ai bot Jan 18, 2025

Choose a reason for hiding this comment

Silent Executor Error Handling Skips Cache Warmup

What is the issue?

Why this matters

Suggested change ∙ Feature Preview

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

korbit-ai bot Jan 18, 2025

Choose a reason for hiding this comment

Invalid FIXED_USER Executor Error

What is the issue?

Why this matters

Suggested change ∙ Feature Preview

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

korbit-ai bot Jan 18, 2025

Choose a reason for hiding this comment

Silent Failure on Missing Executor

What is the issue?

Why this matters

Suggested change ∙ Feature Preview

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

korbit-ai bot Jan 18, 2025

Choose a reason for hiding this comment

Silent Error Handling

What is the issue?

Why this matters

Suggested change ∙ Feature Preview

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

korbit-ai bot Jan 18, 2025

Choose a reason for hiding this comment

Handling ExecutorNotFoundError in get_dashboard_digest and get_chart_digest.

Chat with Korbit by mentioning @korbit-ai, and give a 👍 or 👎 to help Korbit improve your reviews.

michael-s-molina Jan 20, 2025 • edited Loading

Choose a reason for hiding this comment

villebro Jan 21, 2025

Choose a reason for hiding this comment

michael-s-molina Jan 21, 2025

Choose a reason for hiding this comment

villebro Jan 21, 2025

Choose a reason for hiding this comment

michael-s-molina Jan 21, 2025 • edited Loading

Choose a reason for hiding this comment

villebro Jan 21, 2025

Choose a reason for hiding this comment

villebro commented Jan 21, 2025

michael-s-molina Jan 22, 2025

Choose a reason for hiding this comment

villebro commented Jan 14, 2025 •

edited

Loading

korbit-ai bot left a comment •

edited

Loading

codecov bot commented Jan 14, 2025 •

edited

Loading

korbit-ai bot left a comment •

edited

Loading

michael-s-molina Jan 20, 2025 •

edited

Loading

michael-s-molina Jan 21, 2025 •

edited

Loading