Skip to content
View xhsioi's full-sized avatar

Highlights

  • Pro

Block or report xhsioi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

研究生自救指南

217 42 Updated Dec 18, 2023

A fork to add multimodal model training to open-r1

Python 1,143 58 Updated Feb 8, 2025

A dataset of complex questions on semi-structured Wikipedia tables

HTML 160 30 Updated Mar 19, 2021

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 7,131 483 Updated Mar 30, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 6,517 739 Updated Oct 22, 2024

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 762 57 Updated Oct 7, 2024

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,687 714 Updated Mar 28, 2025

A naive implementation of the TableRag Paper

Python 29 4 Updated Oct 22, 2024

An awesome resume template.

TeX 110 5 Updated Feb 21, 2025

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Jupyter Notebook 2,412 247 Updated Feb 6, 2024

made RAG pipeline better in table data

Python 41 5 Updated Oct 16, 2024

Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Python 63 14 Updated Jun 18, 2024
Python 855 107 Updated Oct 26, 2024

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

552 43 Updated Apr 22, 2024

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 13,251 1,877 Updated Mar 29, 2025

📝 An Awesome Collection of Chinese Legal Dataset and Relevant Resources. 致力于收集全面的中文法律数据源

847 80 Updated Jun 20, 2023

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,830 710 Updated Mar 10, 2025

An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)

Python 309 864 Updated Dec 2, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 24,035 2,403 Updated Mar 27, 2025

A distributed, fast open-source graph database featuring horizontal scalability and high availability

C++ 11,198 1,221 Updated Mar 19, 2025

整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 611 52 Updated Mar 30, 2025

[IEEE VIS 2024] LLaVA-Chart: Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning

Python 66 5 Updated Jan 22, 2025

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 7,476 715 Updated Mar 28, 2025

包含了哈工大计算机学院2019级研究生的课件、实验、试题、MOOC

23 6 Updated Sep 24, 2020

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tab…

Python 192 7 Updated Sep 27, 2024

🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀

4,073 353 Updated Jan 10, 2025

Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"

Python 80 9 Updated May 11, 2023

[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".

Python 127 13 Updated May 14, 2024

中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。

Python 5,519 1,243 Updated Sep 23, 2020
Next
Showing results