reasoning output #9

lingwei-gu · 2025-09-30T00:26:46Z

This PR enables reasoning output.

ronakice

Most of the code looks fine some comments, I will review again after you resolve. I think next time we should also try to break PRs up, 500 something lines is a bit harder to review (but okay this time :))

ronakice · 2025-10-23T20:09:42Z

src/nuggetizer/core/llm.py

+                # print(f"🔍 DEBUG LLM: API call completed successfully") # not
+                # removed because it's very helpful for debugging


I think we should remove these even if it is helpful for debugging, coding practice wise it is for the developer to debug.

We could have a --debug mode where we expose such things, but that is a separate PR. And we shouldn't be doing print within function methods, logging is a better alternative.

I see let me remove this then. We can do debug mode in another PR if needed

ronakice · 2025-10-23T20:09:48Z

src/nuggetizer/core/llm.py

+                # print(f"🔍 DEBUG LLM: Full response: {completion}") # not
+                # removed because it's very helpful for debugging


ronakice · 2025-10-23T20:10:34Z

src/nuggetizer/core/llm.py

-                    response = reasoning_content if reasoning_content else ""
+                    reasoning_content = message["reasoning_content"]
+                else:
+                    print(f"No reasoning found in response from {self.model}")


should this be printed? Feels spammy. if the model is a reasoning model and we hit here, we reach some error state and should warn. else this should not be printing

this is also debugging print. removed

ronakice · 2025-10-23T20:11:34Z

src/nuggetizer/core/llm.py

+                        "qwen" in self.model.lower()
+                        or "qwen2" in self.model.lower()
+                        or "qwen3" in self.model.lower()
+                    ):
+                        # Use cl100k_base for Qwen models as they typically use
+                        # similar tokenization
+                        encoding = tiktoken.get_encoding("cl100k_base")


let's not conflate this into this PR, if we don't know the model just don't include the encoding

sounds good. removed

ronakice · 2025-10-23T20:13:38Z

src/nuggetizer/models/nuggetizer.py

-        if self.log_level >= 1:
-            self.logger.info(
-                f"Initialized Nuggetizer with models: {creator_model}, {scorer_model}, {assigner_model}"
-            )
+        if log_level >= 1:
+            self.logger.setLevel(logging.INFO)
+        if log_level >= 2:
+            self.logger.setLevel(logging.DEBUG)


what's this code doing, we don't update log_level?

we can specify log level in the input. same debugging logs appear only if we specify log level to be 2

lingwei-gu · 2025-10-24T21:20:44Z

Most of the code looks fine some comments, I will review again after you resolve. I think next time we should also try to break PRs up, 500 something lines is a bit harder to review (but okay this time :))

I see thanks for pointing out. I will break them into smaller PRs

ronakice

LGTM! Thank you :)

lingwei-gu and others added 29 commits September 29, 2025 20:26

reasoning output

6d5ae27

reasoning output

9a3e9ae

trace class

fcad88d

trace class

6a0ab23

trace class

7bfec5e

trace class

4bd4917

reasoning with debug prints

8e5c598

trace class

fda9989

trace class

7d42ad9

trace class

2f57beb

trace class

3dabfe6

Merge branch 'main' into reasoning

c6889ba

fix lint & merge

213643c

fix lint & merge

cacb544

fix lint & merge

3e2a8d1

fix lint & merge

2d9f9a2

fix lint & merge

702242d

fix lint & merge

c56c49a

fix lint & merge

4b1f527

fix lint & merge

f72106e

fix lint & merge

85ea34c

fix lint & merge

4fae12f

fix lint & merge

9522800

fix lint & merge

7cb106b

fix lint & merge

4084b2d

fix lint & merge

51564eb

Merge branch 'main' into reasoning

118da1f

Merge branch 'main' into reasoning

3fccb91

fix lint

430f577

ronakice reviewed Oct 23, 2025

View reviewed changes

addressed comments

d9e4e1c

lingwei-gu requested a review from ronakice October 24, 2025 21:31

ronakice approved these changes Oct 24, 2025

View reviewed changes

ronakice merged commit 7ca223b into castorini:main Oct 24, 2025
1 check passed

		# print(f"🔍 DEBUG LLM: API call completed successfully") # not
		# removed because it's very helpful for debugging

		# print(f"🔍 DEBUG LLM: Full response: {completion}") # not
		# removed because it's very helpful for debugging

reasoning output #9

reasoning output #9

Uh oh!

Conversation

lingwei-gu commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ronakice left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lingwei-gu commented Oct 24, 2025

Uh oh!

ronakice left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lingwei-gu commented Sep 30, 2025 •

edited

Loading

ronakice left a comment •

edited

Loading