Stars
DolbyUUU / verl
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
风云天气是Android 平台开源天气 App,采用Kotlin、Room、OKHttp3、 协程等框架实现。
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
Fully open reproduction of DeepSeek-R1
Curated list of surveys and papers on the applications of satellite data (satellite imagery) in finance and economics.
A curated list of academic papers focused on AI model pricing, model valuation (on model marketplaces).
Collection of papers on Virtual Reality (VR) and Augmented Reality (AR) applications in field of economics and business.
Curated list of papers on cryptocurrency and blockchain from top Economics, Finance, and Information Systems journals.
Collection of resources for economists and economics PhD students, including tips for academic writing, research, publishing, job market preparation, referee reports, and more.
Comparative analysis of stereo matching techniques for disparity map reconstruction, including traditional block matching, OpenCV StereoSGBM, and PSMNet.
Collection of materials on experimental economics and economic experiments, covering experimental approaches, methodologies, design, and econometrics.
Collection of materials and papers on eye tracking in field of economics and business, covering the basics of eye tracking technology and its application to visual attention studies.
Collection of materials on academic writing in economics, business, and social science.
A Virtual Reality-based Balloon Analogue Risk Task (VR-BART) built with Unreal Engine 5 to study risk-taking behavior in immersive environments.
Economic preference clustering analysis using generative and deep learning models, including Gaussian Mixture Models (GMM), Wishart Mixture Models (WMM), and Variational Deep Embedding (VaDE).
A toy system for generating event timelines from social media data, specifically focusing on the Olympic Game medalist events.
Reinforcement Calibration SimCSE, combining contrastive learning, artificial potential fields, perceptual loss, and RLHF to achieve improved Semantic Textual Similarity (STS) embeddings. PyTorch-ba…
Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python
Regression algorithm implementaion from scratch with python (OLS, LASSO, Ridge, robust regression)
Clustering algorithm implementaions from scratch with python (k-means, EM-GMM, mean-shift, agglomerative)