Skip to content

Conversation

@jinliyl
Copy link
Member

@jinliyl jinliyl commented Jan 14, 2026

No description provided.

…ation

- Integrated tqdm library for progress tracking
- Added progress bar display for user sessions processing
- Implemented session-by-session progress updates with custom description
- Maintained existing session processing logic while adding visual feedback
- Preserved all original functionality including logging and data handling
…ocessing

- Increased max_rps from 9 to 20 in default config
- Removed tqdm progress bar for session processing
- Implemented asyncio semaphore for session concurrency control
- Added parallel processing of sessions within each user
- Updated logging to show session count per user
- Changed user processing from concurrent to sequential
- Added completion tracking with progress indicators
- Modified output formatting for better readability
- Lowered max_rps from 20 to 2 to prevent rate limiting issues
- Reduced rps_window from 10 to 1 for stricter request throttling
- Updated default configuration values for better performance stability
…oncurrency control

- Add new analyze_dataset_stats.py script for comprehensive HaluMem dataset analysis
- Include statistics for user sessions, dialogues, content lengths and chunk distributions
- Replace rate limiting with concurrency control in BaseLLM using semaphore mechanism
- Update configuration to use max_concurrency instead of max_rps and rps_window
- Modify dialogue formatting to include only user messages in evaluation
- Add percentile calculations and detailed content size distribution metrics
- Implement session splitting logic based on character length thresholds
- Provide per-user statistics and summary tables for dataset analysis
- Refactor BaseLLM to use internal _chat_impl and _stream_chat_impl methods
- Remove rate limiting locks and timestamps from LLM initialization
- Add command-line interface for dataset statistics analysis tool
@jinliyl jinliyl merged commit 12efcc9 into main Jan 14, 2026
1 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants