Qualx model updates from weekly KPI run 2025-01-10 #1495

nvauto · 2025-01-10T12:40:26Z

Description:

The latest /ssd0/qual/spark-rapids-tools-private/qual-kpis/kpi_summary_xgboost-2025-01-10.csv:

platform,tp_count,fp_count,tn_count,fn_count,precision,recall

databricks-aws,15,8,26,1,65.22,93.75

databricks-aws_photon,10,18,18,4,35.71,71.43

databricks-azure,18,1,21,8,94.74,69.23

databricks-azure_photon,20,7,6,15,74.07,57.14

dataproc,34,20,29,2,62.96,94.44

emr,13,9,25,4,59.09,76.47

onprem,27,19,28,2,58.7,93.1

Description:\n\nThe latest /ssd0/qual/spark-rapids-tools-private/qual-kpis/kpi_summary_xgboost-2025-01-10.csv:\n\nplatform,tp_count,fp_count,tn_count,fn_count,precision,recall\n\ndatabricks-aws,15,8,26,1,65.22,93.75\n\ndatabricks-aws_photon,10,18,18,4,35.71,71.43\n\ndatabricks-azure,18,1,21,8,94.74,69.23\n\ndatabricks-azure_photon,20,7,6,15,74.07,57.14\n\ndataproc,34,20,29,2,62.96,94.44\n\nemr,13,9,25,4,59.09,76.47\n\nonprem,27,19,28,2,58.7,93.1 Signed-off-by: nvauto <[email protected]>

mattahrens · 2025-01-10T15:26:37Z

KPIs look good on the updates....only minor changes is one more TN and one less FP for dataproc and onprem (which is good).

@leewyang any concerns on your side?

leewyang · 2025-01-10T17:19:56Z

It's interesting that while only dataproc and onprem had KPI deltas, all of the model binaries were different than before.

Since there were no dataset changes, these changes were likely due to:

the code commits since the last model build, i.e. presumably this PR?
build environment differences.

For (2), I had been using a fairly fixed conda environment on a dev box, so I think building from a controlled CI/CD environment is actually better. Note that I had seen some binary deltas when building models in different environments before, hence the use of a fixed conda env.

I will run the old process today just to confirm the expected deltas for dataproc and onprem and see if the other models change as well (or if they stay the same for my build environment).

parthosa · 2025-01-10T19:10:03Z

Question on this CI pipeline: Can we use a bash script in the pipeline that converts CSV to markdown format so that it renders better in Github?

leewyang · 2025-01-10T22:34:05Z

OK, finished running through the old process and comparing results.

All models did, in fact, change due to the recent code changes, but the model metrics were very similar to before (since the changes were minor). Note that my models were still slightly different.
For the KPI deltas, it looks like they were mostly due to data points that were very close to the threshold line, which subsequently moved across the line slightly one way or the other.

So, basically, I think these models would be fine to merge now.

Ideally, it would be nice to also auto-generate a PR for the evaluate metrics for the internal repo, but I think those are more of a nice-to-have at this point, since we can theoretically generate them offline, if needed.

@mattahrens should we address @parthosa's comment first before committing these models?

mattahrens · 2025-01-10T22:36:02Z

Question on this CI pipeline: Can we use a bash script in the pipeline that converts CSV to markdown format so that it renders better in Github?

Did you specifically mean the .metrics files as changed in this PR? We can do that separately.

parthosa · 2025-01-10T22:36:17Z

We can address this in the next weekly update as it is more of a cosmetic feature.

mattahrens · 2025-01-10T22:36:21Z

@mattahrens should we address @parthosa's comment first before committing these models?

No need to block this PR to make that change.

leewyang · 2025-01-10T23:13:35Z

Also, one other note. The script doesn't build/evaluate the combined model, but that's not really used, so again, it's something we could always do offline if needed.

amahussein requested review from leewyang and mattahrens January 10, 2025 15:01

amahussein added the user_tools Scope the wrapper module running CSP, QualX, and reports (python) label Jan 10, 2025

leewyang approved these changes Jan 10, 2025

View reviewed changes

leewyang merged commit dd0b336 into dev Jan 10, 2025
14 checks passed

leewyang deleted the qualx-model-update-weekly-2025-01-10 branch January 10, 2025 23:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qualx model updates from weekly KPI run 2025-01-10 #1495

Qualx model updates from weekly KPI run 2025-01-10 #1495

nvauto commented Jan 10, 2025

mattahrens commented Jan 10, 2025

leewyang commented Jan 10, 2025

parthosa commented Jan 10, 2025

leewyang commented Jan 10, 2025

mattahrens commented Jan 10, 2025

parthosa commented Jan 10, 2025

mattahrens commented Jan 10, 2025

leewyang commented Jan 10, 2025

Qualx model updates from weekly KPI run 2025-01-10 #1495

Qualx model updates from weekly KPI run 2025-01-10 #1495

Conversation

nvauto commented Jan 10, 2025

mattahrens commented Jan 10, 2025

leewyang commented Jan 10, 2025

parthosa commented Jan 10, 2025

leewyang commented Jan 10, 2025

mattahrens commented Jan 10, 2025

parthosa commented Jan 10, 2025

mattahrens commented Jan 10, 2025

leewyang commented Jan 10, 2025