Highlights
- Pro
Stars
A fork to add multimodal model training to open-r1
A dataset of complex questions on semi-structured Wikipedia tables
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
Toolkit for linearizing PDFs for LLM datasets/training
A naive implementation of the TableRag Paper
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
📝 An Awesome Collection of Chinese Legal Dataset and Relevant Resources. 致力于收集全面的中文法律数据源
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)
A modular graph-based Retrieval-Augmented Generation (RAG) system
A distributed, fast open-source graph database featuring horizontal scalability and high availability
整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.
[IEEE VIS 2024] LLaVA-Chart: Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tab…
🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀
Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"
[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。